OpenStreetMap logo OpenStreetMap

Post When Comment
New Host for OSM data , archive.org

yes, Well we will be able to check them all out.
My plan is to create a hierarchy of data, where each region (state) that contains another region(county) and so forth (relations and ways that contain each other).

If we find data that does not match or is crossing the border then it can be split up or marked for being manually fixed

Given a hierarchy of data, then we would match it based on the attributes of the EPA datapoints. Does the county match the county from tiger? Does the zipcode match the zipcode from the census.

The census said that they will not update this data, but we can. Given enough test data (zip code attributes) we can find all the ones that break the model and fix it.

Anyway, there is a huge market for this type of processing and I think that OSM or something like it is the right way to go.

I will not commit this data to osm, but keep the osm files on archive.org

if we get enough updates, we can out them into a git repository...

I am starting to think that the monster database idea is not a very good one anyway..

mike

If it turns out that the zip code from the zcta produces bad data,

New Host for OSM data , archive.org

Yes, I have been looking at the data. There are cases were the boundries are not exactly matching, this will have to all be reviewed.

My idea is to make a program to look for containment hierarchies in the data, this region contains this one and to flag errors...

mike

New Host for OSM data , archive.org

Yes, We have two levels. 3 digit ones and 5 digit ones.
the 3 digits contain the 4 digits.

Of course I can import them.... But I will first send a mail to the list.
mike

EPA Bulk Import

I am going to remove all the data that has not been updated by someone manually.
I am working atm on downloading and processing all the points and will setup a separate hosting for the datafiles.

mike

open letter to the EPA

it is not junk, if you think it is junk then revert the changesets and we dont need to talk about it anymore.

With a modify command, I will modify the data and adjust it. But I think the data is still usable as it is, not perfect but a good start.

mike

open letter to the EPA

I have started a wikipage. osm.wiki/EPAGeospatial please add in your comments there.

EPA Bulk Import

Hi tomh,

I understand you concerns. We will see how the community reacts. I have gotten mixed messages.

mike

Next Project for the EPA and Mine data

Well, of course. I am thinking about just using a standard module.

there are other things to do with these nodes :

1. looking for duplicates (pre existing)
2. looking for out of date information.
3. looking for better ways to render.

EPA Bulk Import

I have found alot more information on these sites.
If the data is in error, it can be reported.
Also there are date fields for the records.
There are population information fields, congress districts and more.

http://iaspub.epa.gov/Cleanups/RcraProfile.jsp?handler_id=NJ0000061846

EPA Bulk Import

I did not consult. But I am willing to put in the work to fix it.
It is not that hard to make an update to the data.

Adding the list of mines and quarries in the USA

http://en.wikipedia.org/wiki/Capitalization
http://search.cpan.org/dist/Text-Capitalize/Capitalize.pm
http://search.cpan.org/~summer/Lingua-EN-NameCase/

Next Project for the EPA and Mine data

Ok, the next steps will be to rename all the items removing the CAPSLOCK.
Also to add in the is_in information about the town, county etc.
thanks to goldfndr__ for the suggestions.

Adding the list of mines and quarries in the USA

Ok, I have done a big update on the names
osm.org/browse/changeset/3348720 and others.

Please report any problems.
mike

Adding the list of mines and quarries in the USA

I have reworked a dirt hack version of my merger.
What it does is use the note field to uniquely define what nodes are unique. It takes an osm file that contains the badly named node with ids and the orginal nodes with negative ids in it.
http://bazaar.launchpad.net/~kosova/%2Bjunk/openstreetmapkosova/revision/96#merge_duplicates_formines.pl

Denied Persons List with Denied US Export Privileges

Well, I just found it as possible one after searching for data sources on data.gov. Obviously it is not relevant. I have found better sources since.

Adding the list of mines and quarries in the USA

osm.wiki/Potential_Datasources#National I have documented the sources that I found interesting on data.gov and will be fixing the names today.

Adding the list of mines and quarries in the USA

Yes, I am working on fixing this import. The names will be updated. We can reverse them and I will reimport. Otherwise just wait and i am fixing my merge script. Sorry about that, josm killed the names on me!

Denied Persons List with Denied US Export Privileges

http://www.bis.doc.gov/enforcement/unverifiedlist/unverifiedlist.txt
Unverifed List of Foreign Persons Red Flagged for Exporting Actions

Historical International Station Catalogue

I am talking about this :
International Station Catalogue

http://badc.nerc.ac.uk/data/radiosglobe/stations_sorting_lists/stnlist-historical.html

I found out about the list from the leaked data, but the list is there and public.

Argeltown is now also in openstreetmap

Ok I removed the Chuthulu one as well.