Don’t Hide Your Data in Description Fields – My Plea to KML and GeoRSS Creators
I was pulling together some data for a customer following Hurricane Irene today and kept running into the same problem. Folks creating KML and GeoRSS feeds with awesome statistical data, but leaving it all mixed up with text in a description field. The folks at Google put up a really nice Hurricane Irene response map and had links to many of the layers for download. Downloadable data is awesome, but less awesome when the best data is trapped. Take for instance the Pepco power outage layer from the map; when you load up the data on how many people are affected by the power outage it is trapped in the description field with a bunch of other text. That means you can’t use that data for any kind of analysis unless you want to go parse it out as a new field. This is not Google’s fault. Pepco generated their GeoRSS with the numeric data in the description field. USGS does the same thing with their stream gauge data as you can see here. The majority of KML/GeoRSS feeds we see in the wild fall into this bad habit.
Bill Dollins tweeted out an old school power outage map for SMECO in Southern Maryland. While it lacks a sweet zoomy map it did have a table with zip codes and number of people effected. So, I could copy and paste it into a CSV and load it into GeoCommons no problem. The result was a nice thematic map of people effected by outages by zipcode. Once the numeric data is broken free you can have all sorts of statistical fun with it. So, as we get past red dot fever and just making pretty pictures online, let’s start making data portable for analysis. Get your numeric values in their own field not all junked up with your text. My plea to the KML/GeoRSS data creators of the world.
5 Responses to Don’t Hide Your Data in Description Fields – My Plea to KML and GeoRSS Creators
Leave a Reply Cancel reply
About Us
Welcome to the Esri DC Development Center blog. We write about features of our work on big data analytics, open platforms, and open data, what is new and exciting in the Esri and community, and general industry thought leadership and discussions of geospatial data visualization and analysis.
Please explore what we're working on and let us know if you have any questions or ideas!
New GeoCommons Maps- Pets in Montreal doggyday
- Pennsylvania Counties steve@bikemap.com
- Texas/Arizona Birding Road Trip 2013 afournier
- Info hamiam21
- Tennessee golf courses dgang@tennessean.com
- San Carlos sbutt92
New GeoCommons Datasets- kerentanan
- Kawasan Resiko Bencana Banjir Kabupaten Ponorogo
- Filtered Features of 'Merge of 'Montreal census tracts' into 'Languages spoken at home -montreal'' based on 'MontrealImpactFC2013'
- Intersect_risiko2 REQUIRED: The person responsible for the metadata information.
- Info
- Tennessee Public Courses
Recent Comments
- プラダ 財布 on World Bank’s Mapping for Results updates
- buy twitter follower on Cell phone service providers: Who's on top?
- shops on Dataset of the Day: Mega Millions!!!!
- fashion on Dataset of the Day: U.S. Census Bureau Annual Population Estimates
- outlet on If You Were Sec. Paulson for a Day: A Foreclosure Clearing House?





I’ve run into this quite a bit as well–but I’ve come to the conclusion that this is sort of baked into the cake. KML is not a data format so much as it is a data presentation format. As long as people are using KML as a basic data standard, I think this is going to be a problem. (GeoRSS I can’t speak to as much.)
Right on. The need to scrape data wasn’t what people meant when they said KML was “the HTML of the GeoWeb.”
Timothy – actually KML has this capability as ExtendedData. That allows you to put in structured data in KML. All KML out of GeoCommons uses ExtendedData so you can ensure you still have the underlying data attributes in a structured way.
KML is often referred to as a “presentation format” but it’s more structured than an HTML page and when used properly is an excellent data exchange format.
GeoRSS by comparison is very simple and while you can always extend it, these customizations make it more difficult to generally share around without custom code.
I’ve used six different technology options to parse and convert KML and none of them can support every variety of KML that I come across.
I hate the format for reasons including the one mentioned in this post. It seems that there
is plenty of room in the ‘standard’ for different interpretations about what is valid and even
more permutations from users interpreting how to structure their data.
At least the shapefile has been around for 10 years and every major technology can work
with it.
Günümüzde en çok yapılan estetik diş işlemlerinin başında gelen diş beyazlatma yöntemleri bugün birçok kişi tarafından tercih edilmekte.