Wikipedia is changing to an OSM-compatible Creative-Commons license. See http://meta.wikimedia.org/wiki/Licensing_update/Questions_and_Answers. Although it isn't officially official yet, it is pretty clear that the licensing will be fixed very soon. This means we will be able to use WP data in OSM.
We don't want to make OSM a POI dumping ground for every geocoded Wikipedia article. But some data makes sense to use as a source for permanent OSM map data. The wikipedia infoboxes are moderately consistent codifications of data. Extracts are available at infochimps.org. These can be used, with some pain, to create data to load into OSM.
The first thing I'm looking at is airports. They tend to be permanent landmarks, and have map-worthy data of both name labels and runways, which makes them good candidates for import.
Using this, I have been working on some scripts to turn the airport infobox data into OSM aerodrome nodes, and approximately-located runway ways. I create an aerodrome node with the designated lat/lon, and related information. I download surrounding existing OSM data, and merge the new node with any existing aerodrome node within 0.0100 degrees. I keep name variations as alternate ("name_1", "name_2") node names. I check for runways nearby, and if there aren't any, and the WP data has runway info, I create new runway ways. I use the length in meters, and the angle from the runway name (runway "3/21" = 30 degrees/210 degrees), and center it on the designated point.
I do these 5 at a time, and load the results into JOSM. I use the aerial photos (Landsat or Yahoo as available) to move the runway into the right place. The angles and location are close, so it is not hard to get them in the right place. It is annoying that the runways are often (always?) named for magnetic north rather than true north, but with the aerial photos it is easy enough to line them up correctly anyways.
I've fixed 200 or so airports so far. Only a few thousand to go.
討論
由 RichardB 於 2009年05月13日 21時55分 發表的評論
Have you discussed this on the list? Can we be happy that the Wikipedia data hasn't been taken from copyrighted sources?
由 ArtyCarty479831 於 2009年05月13日 22時20分 發表的評論
In short, yes, we are OK. All Wikipedia entries are required to be free of copyright encumbrances, and there are plenty of activities in WP around identifying and removing such problems. The current change will simply iron out some minor differences between Wikipedia's prior open license, GFDL, with their new open license, CC-BY-SA. Plenty of details are in the link. Thanks.
由 RichardB 於 2009年05月13日 22時33分 發表的評論
There was a discussion recently on the talk@ list about the "Wikipedia:Obtaining geographic coordinates" page suggesting one of the ways that editors source coordinates is from Google Maps for example - something which OSM specifically does not condone. Wikipedia has a different idea of copyright to OSM in this instance. Normally OSM takes the view that we "play safe" where possible.
Usually however, when we're doing a large-scale data import, this gets discussed on one of the talk@ lists
由 ArtyCarty479831 於 2009年05月13日 22時46分 發表的評論
See for example http://lists.openstreetmap.org/pipermail/talk/2009-May/036617.html
由 ArtyCarty479831 於 2009年05月13日 22時56分 發表的評論
To clarify, I'm taking locations suggested by Wikipedia (which may, or may not, have come from Google Maps), and placing nodes and ways based on Landsat or Yahoo Aerial photos. The actual coordinates going into OSM are not the Wikipedia points, so not from Google Maps.
In the case of airports in the United States, which is as far as I have gotten so far, it is pretty clear that the Wikipedia information did not originate with Google Maps at all, but a mass load from FAA public domain airport data. So for the US the point is moot. I have looked at a LOT of wikipedia airport pages so far and seen the same data as in the FAA data.
For airports in the rest of the world, I'm expecting there to be more shake-out on the issue before I get to them.
由 ArtyCarty479831 於 2009年05月13日 23時59分 發表的評論
I think this one is the best followup: http://lists.openstreetmap.org/pipermail/talk/2009-May/036661.html
由 geobase:acrosscanadatrails 於 2009年05月14日 05時53分 發表的評論
Would be great to get the data directly from FAA.
Having WP show the OSM slippy map as a default. So then WP users will know to go onto OSM and manually add in the waypoint, as well as reminded to back-up their waypoint with facts. .. Our OSM facts, are GPS Traces. .. where WP facts are based on books & google maps.
My best advice is to set up a new user prefexed with something identifiable, and follow the generally accepted guidelines for 'bulk import' So your data can be removed, if you find out it's not needed.
Until WP stops showing Google Maps as a source for locating GeoData, and only uses OSM. .. it's a never ending discussion.
Or until Google Maps uses OSM instead of NavTeq. .. Whichever comes first :)
Great to see more progress in leaning this way :)
由 seav 於 2009年05月15日 16時08分 發表的評論
> Or until Google Maps uses OSM instead of NavTeq
Nitpick: Google Maps has dumped Navteq since early last year (around the time Nokia bought Navteq). Since then, Google has relied on Tele Atlas and its own Map Maker.
由 axk 於 2009年05月29日 10時23分 發表的評論
would it be possible to limit your changes to a smaller bounding box? -133.418,-42.163,183.455,55.952 (or, half the world) seems to be a little big for a change of 12 nodes and 3 ways [1]. as all your airport changes use such a big bounding box, history for a small area (like mine [2]) is not meaningful anymore.
thank you!
[1] osm.org/browse/changeset/1348370
[2] osm.org/history?bbox=124.216%2C10.72%2C125.002%2C11.29
由 ArtyCarty479831 於 2009年05月30日 17時32分 發表的評論
Thanks for your feedback. I did not realize there even was a history feature like that. Yes, I'll sort them together to make it more useful. Thanks for the tip.