OpenStreetMap logo OpenStreetMap

Post When Comment
Welcome to the new Missing Maps

I always get a feeling of false dichotomy in these discussions. Sure, it is -possible- for a separate OSM and MM community to arise, but I don’t think that is the reality.

Traditionally, OSM has grown from the nerdy circles of open source enthusiasts and geography fanatics. As these people mostly know OSM already by now, I’ve heard people say that the community can’t grow much more. The lack of imagination is impressive. I see MM/HOT as a way to get people interested from a completely different background. My introduction about OSM at MM events is always something like “OSM is this crazy idea that makes unexpected things possible. It started with some people wanting some open data for England, and now you can drive from Alaska to Ushuaia with it. We also discovered that it’s the best way to get geodata to NGOs”. You cannot explain that MM is a crazy but possible idea without showing how the whole concept of OSM is completely outrageous - and working.

I also don’t see the work we do at a Mapathon as the most important. We’re just showing people what they can do with a few clicks of their mouse. So next time they use OSM and spot a mistake, they’ll be more likely to fix it. So at the next disaster, more experience mappers will be available for a fast response.

Letting your thinking about the (real) issues be guided by the idea of a dichotomy which is not real, makes the problems seem larger than they are.

Welcome to the new Missing Maps

to identify the best and brightest to receive additional training and mentoring to become local champions and event hosts

Why should the best mappers be the best event hosts? I’ll take myself as an example. I’ll be hosting my third MM mapathon soon, but I have hardly mapped with Missing Maps myself. To host, you need to be able to explain the project (both OSM in general and MM), to motivate and to get people started. But my own mapping is guided by what I feel like doing that day, not the tasking manager.

Where was I ?

QGIS will give you a lot more options for prettier visualization.

How easy is it to get this data in JOSM? (I would love to see my own map, but I have about a 1000 tracks)

First diary entry: notation, OSMTracker and iD editor.

I did not know you could copy-paste in iD, good to know!

In the old fashioned Web editor Potlatch, it is possible to select more than one thing, and add tags to all of them at once. But for simple things Iike benches, I map these directly on the phone, using Osmand. For better geometry, it helps to have a satellite overlay.

But if you like OSMtracker, I believe you’ll find your data easier to process in JOSM.

On language : the Belgian community often uses English to bridge our language devide. As said above, translating documentation is certainly possible and a good idea.

(the diary does not send me a message if there’s a new comment, so send me a direct message if you want to get in tocuh)

Data and community in the Belgian regions

Hi Mikel, a good introduction to my little project is [this diary post] (osm.org/user/joost%20schouppe/diary/26259) (but I’ve been writing about it for quite some time, [1] (osm.org/user/joost%20schouppe/diary/21826), 2, 3, 4).

In short : the kind of questions you ask are exactly the kind of questions I would like to answer, but for the whole world. I think I’m about ready to scale it up, however that remains to be seen. I would love to invest more time in this, but that would mean working less on things that earn money :)

Basic setup: take a full history dump, use a poly file to split the area you want with Mazdermind’s history splitter, then import it to Postgres with his history importer. Some basics I do within Postgres, but as I’m not strong on SQL I do most of the heavier analysis with SPSS.

I’ve always been more interested in the very basics (like evolution if number of active mappers, of road lengths, of road edits), but I did try things like identify people who work on bicycle infrastructure. That works, with some limitations. Lists of people could definitely be generated as a by-product of the yearly analysis I would like to do for all regions worldwide. It will probably take someone else to turn that into a service, however I would love to collaborate with more people on this project.

Statistical data of the Dutch OSM mappers.

When you do try to make a statistical comparison between people who did and did not get the message, be sure to use an appropriate statistical technique. Most simple would be to just compare something like number of people with a changeset between six months and a year after their first changeset (making sure this amount of time has passed for all cases studied). That way, you obviously lose a lot of cases that are still too young. The alternative is to use survival analysis. There even is a Kaplan Meier toolkit for Excel.

Re the surveying idea: could be very interesting, but the response rate will likely be very very small. Response is often a function of interest in the subject, and our study group are people with proven desinterest.

In the Belgian new contributor welcoming thing, we also register what their first edit was. Might be interesting over time to see if that has any predictive value.

I don’t know if we should automate this kind of thing too much. I believe the value of a message like this has much to do with how personal it is. What would be nice over time is to have a central way to register who is getting welcoming messages and who isn’t. So as to avoid that people watching their area of interest send duplicate messages to someone watching a larger area. And it would be a nice tool to highlight areas that - aren’t - getting messages.

New Users

In Belgium, we send a message to every new mapper, as picked up by Pascal Neis CSS feed, archived in a Google Doc. We do check the changesets, and sometimes personalise the welcome message based on that. There is no call to answer us, just an open invitation to join the community. With about 400 messages sent, we had a response in 11% of the cases.

I do have the feeling that personalisation increases chance of feedback, but then again feedback isn’t the goal. Maybe it would be more interesting to track link clicking from the message (but that might be off putting tonsome)

BTW our welcome message is available here for copying and improving. (scroll down for Dutch, French and English versions)

Natural language vs. abstract tags

Conifer=forestry is actually entirely logical in the UK. There are few examples of where a coniferous forest was not planted for forestry use, as this does not occur naturally in our climate (or most people don’t realize if it doesn’t),

So thus is exactly the kind of example you need: people think they know what words mean, but they don’t. The same with kiosk : maybe in England kiosks are either newsagents or ice-cream shops, but in other countries they might sell a range of items, and maybe evens specialize in both ice-cream and newspapers.

So both of these examples actually strengthen your point of describing aspects of a thing, not the thing itself. While I would agree that doing this radically might in fact discourage mappers, we will probably have to evolve more and more in this direction, just as we did with woodland tagging.

This will be a gradual evolution though, and not a revolution. That way tools can develop that put a layer of presets over things, in much the way that iD shows a description and JOSM the tags. And that way this necessary complexity is only introduced where it is really necessary.

The most useful thing to do, IMHO, is getting involved in specific discussions about tagging. The theoretical argument may add just the little weight there to tip the scale. But the “simple tagging model” has its worth too, and will not be abandoned just because it’s not logically elegant.

On the friture example, as a Belgian this is important to me :) I’m entirely satisfied with the fast food + cuisine=friture that is common practice. It might have the disadvantages of both tagging styles, but it is queriable - and no subset of tags could ever do a friture justice.

Myth of Newbie

I checked one or two hundred newbie changesets in our Belgian simple setup using the neis-one service with a Google spreadsheet. All of them got a welcome message.

I can certainly agree that to speak of newbies as a simple concept isn’t useful. The diversity in what they do is enormous : from changing one way streets to remapping entire universities. There is the occasional untagged way and other mistakes, though I haven’t got the statistic at hand.

Our little mapper-of-the-month team is planning a series of short interviews, which I hope to extend with some statistics derived from the welcoming project.

Maybe we can think of some kind of survey, I’d be happy to help (setting up surveys is part of my day job and education). The thing is, I don’t think it will be possible to get a high enough response rate to be representative. I only have anecdotal evidence of this (insert smiley), such as the 10% response rate to our welcome message (which doesn’t actually asks for response ) and the 0% response rate to our “questions for new mappers” after our last Missing Maps mapathon.

What comes first, Map or Database? Should we tell newbies the truth?

When I explain OSM to newbies, or when I see other people explain it, they always say something like “you might know Openstreetmap as the map on osm.org, but it is many many more things than that”. So the fact that it is a database is basically the second thing you say. I don’t see how that can be controversial, and why anyone would want to hide that from newbies. If you want to avoid the word “database”, you could say something like “OSM collects map data”.

OpenStreetMap Foundation Chairperson's Report for the Annual General Meeting

Agree with Davor and Vincent. There should not be a special procedure for special people. They’re not special, it’s just that their money is worth less when converted to pounds. Membership fee weighted by GDP/capita would be great. Of course, there can still be membership waiver program on top of that.

Data and community in the Belgian regions

Phillippe, in OSM, data is denser in Flanders too. Most of the above is about community size.

It might just be that in a couple of years Mapillary will have a larger community in Wallonia too, even if the data density stays higher in Wallonia.

And did you just call OSM old technology? I suddenly feel old now, too. :)

The history and completeness of OSM

I can compare Ecuador, Peru, Bolivia, Argentina and Chile from driving around there. I would say Argentina and Chile both have great maps, especially when it comes to completeness. But Argentina might be a more interesting case for you, as I think they didn’t have a large import. Ecuador is also pretty good. Peru and Bolivia have a lot of work left.

Data and community in the Belgian regions

Sander, I can understand the fear which might be very reasonable in some cases. The situation in Bolivia comes to mind, where we can make “formal” decisions if we agree among two or three people.

But I don’t think there’s anything to worry about in this case. This mapper’s interest is first and foremost landuse mapping. So yes, his view of landuse mapping will have a serious dominance in Flanders. But in this case, that’s a good thing, because he did a terrific job. So while it is a big lot of nodes, it’s only a quite narrow theme that he is dominant in.

The other thing about landuse is that it’s pretty stable. Mapping a few 100 square kilometers takes many many hours. But keeping it up to date is a completely different job, and a lot let labour intensive.

Should we teach JOSM to first-time mapathon attendees?

Hi Martin,

This is a slightly more elaborate answer than what I expected :)

Here’s some thoughts: - the longer term retention is worth keeping an eye on as numbers increase, keeping in mind selection and social effects of course - are you sure the difference in labour hours at the events is real and not an artifact of the way JOSM and iD save information to the database? For example, if you take the difference between first and last object saved as labour time, that might affect JOSM negatively. Did you (or could you try) look at the last save during an event minus the start time of the event? Probably difficult on a larger scale, but might be worth a check at a single event. - you filtered away the people who used both editors. It might be interesting to see if they started off using JOSM and gave up to fall back to iD or if they took the other way around. If it looks like the former, one can imagine more people giving up before having anything worth saving.

Thanks again. We’ll think about how we can experiment more at our events in Belgium.

The history and completeness of OSM

Link kaput, and you can’t edit your comments. Hope this works: http://imgur.com/2hxAXNk

The history and completeness of OSM

From what I understand, you look at the shape of the growth curve of the road network. The visual inspection is just a crosscheck. Here’s an example of the curve I’m guessing you use, from Flanders, Belgium.

growth

As you see growth flatten out over the years, you can make the assumption that the road network is “complete” in the sense of having the geometry of almost all roads.

A couple of things to keep in mind:

  • this only holds true if the community is large enough. Growth will also peter out if there are not enough good sat pics left to map or no more people willing to map. I just wrote in my diary about how even in a large community like Flanders, 44% of all nodes were mapped by just one guy. Now imagine how things might go in a place like Bolivia. It might show up in the data as a certain shakiness of the graph, but you do have a number for all countries.

  • as you can see in my example, one road isn’t the other. in Flanders, the growth curve for main roads (tertiary and up) is quite similar to the minor roads (road, unclassified, residential, living street). But that might not always hold. For example, in Africa, a correction of tagging might distort numbers when a lot of tracks suddenly become main roads.

  • I used the Chile map in real life. I don’t think I found any road at all that was missing there. But that brings me to something else: imports (as Chile did). They obviously distort the curve. I would imagine that the road network in the US also looks quite “complete” since many years, as the huge Tiger cleaning operation won’t have much effect on overal network length. Completeness does no necessarily implicate quality; making the measure most useful in countries without a large import.

I’d be very interested to hear more on how you went about this analysis.

How large are our national contributor communities and how are they developing?

I just did some calculations for Belgium based on the history files. I have 8900 contributors by january 2015 (I know, I should get some fresh data). You got to 7622 on more recent data. I think that’s pretty decent, a similar order of difference you get in these comparisons, like I checked here for Brasilia.

Dealing with incorrect tags in Africa

So to second Warin: don’t just “correct wrong tagging”, but get involved in local road tagging discussions. Otherwise you run the risk of getting your work reverted.

Dealing with incorrect tags in Africa

The Highway Tag Africa is highly relevant, however it does not talk about freeways. If there is any type of road we could agree on at a global level, it should be freeways. There is a simple list of physical properties that you can use to decide if it is a freeway or not.

For all other road levels this is much more complex. The African scheme takes into account only economic/practical importance measures, but in countries like Peru and Argentina the government classification has final say.