OpenAddresses: A collection of open address data
In this talk, we'll give an overview of OpenAddresses, a constantly growing collection of address-level geographic data from around the world. We collect data from over 1,100 data sources, lightly modify it to pick out the fields we need, and create a CSV with over 215 million rows of data. The output of our community-built system is used by commercial and government entities to build geocoding tools. A community of 70+ people contribute to this dataset via GitHub, finding new data sources and submitting pull requests for data while a background processing system downloads and manipulates the data in real time. This same setup can easily be used to fetch and merge similar sets of data in a collaborative way and we can show you how it works.