Geocoding should be easy. Enter a street address and get a location defined by latitude and longitude.
But if it is so simple, why do consumer apps, and even some commercial apps, return geocodes as much as a quarter mile off the actual location?
Dots on the map represent the coordinates returned by a consumer geocoder and by a commercial geocoder for the same address. The lines illustrate the distance between them. Every dot and line represents a business problem and an expense to manage.
The client that asked us that was struggling with inaccurate data that appeared to be accurate— until it was time to deliver to a location that couldn’t be found!
- There were 6 decimal points in the coordinates.
- There were no flags from the app that the data was questionable,
- And no reason to believe the geocoded data was anything but accurate.
So what went wrong?
1. Not all of the addresses entered for geocoding were properly cleansed and validated. Bad data in equals bad data out. (While geocoding is NOT the same as Address Cleansing and Validation, our software does include an internal cleansing process to return the most accurate data possible. This process, however, is happening behind the scenes. If you want to update your dataset with the cleansed and validated address information, consider adding our Address Validation software to your solution.)
2. The geocoder they used relied on interpolated geocoding rather than point geocoding, and while that may be good enough for some business processes, it was disastrous for theirs.
Interpolated geocoding uses data from a street geographic information system where the street network is mapped within the coordinate space. Each street is assumed to have a certain number and type of addresses and the location is “mapped” in an assumed location along the street – that may or may not reflect its actual location.
3. The client’s geocoder might have:
- Lacked coverage in their targeted areas
- Had outdated, inaccurate, or single-sourced datasets
- Or had weak interpretation and matching algorithms for matching pieces of their input to the underlying databases.
…So in some cases, it “fell back” to a higher level, such as postal code, to produce a geocode, without the client’s approval. He saw six decimal places and assumed it was highly accurate.
INTERPOLATED AND FALL BACK DATA
Interpolated data can geocode a location at quite a distance from its actual location. Sometimes, that’s good enough, but what if you needed to asses this property for flood risk? Is the interpolated data “good enough?”
Geocoders “fall back” when they are unable to verify a location at a certain level of accuracy. For example, if a system cannot locate a specific house number, it will fall back to a street, and from a street to a zip code.
Do you know what level of accuracy is good enough for your application? Or what level of accuracy you are currently getting?
Match Code: What the Geocoder had to do to get a match with the reference dataset. What parts of an address matched and what didn’t.
Location/Geoconfidence Code: Identifies the geographic precision of the coordinate for a matched address.
A better geocoder would have provided Match and Location Confidence codes and allowed our client to control the level of geographic precision it used. Automatic fall back is not nearly good enough for many business processes. That’s why our system gives you control over the fall back.
Our client couldn’t trust his data because he couldn’t trust his geocoder. Can you?
Better geocoders produce better, more precise results more often; get better over time (lesser geocoders operate at the whim of the USPS); and produce consistent results so historical comparisons and other downstream processes work better.
With clean and consistent data, it is possible to surface relevant business insights by understanding the relationships between people, places and things.
Spectrum Geocoding On Demand™ offers rapid deployment, easy integration and cost-effective, highly accurate results with a cloud-based SaaS platform.
- User error is mitigated with on-the-fly run through address correction before geocoding.
- It covers 73 countries at full point address accuracy and another 149 countries at street level accuracy.
- Underlying the results is the latest mapping data from TomTom, HERE, USPS, and more. The highest quality option has data from NINE sources. These datasets are updated automatically to ensure you have the latest location information available.
- It provides Match and Location Confidence Codes.
- In addition, you have the power to control the match process through custom settings to control Precisely’s battle-tested matching algorithms in the default settings.
- And yes, it provides batch processing.
Ready to test your data?
Are you trying to implement Artificial Intelligence, Machine Learning, Single View, MDM (Master Data Management)? Or are you just trying to match items in Excel or Access?
Data Quality for location data is difficult. The Albuquerque list shows variations of just the city name. Imagine the issues with full addresses and the consequences of having combinations of problematic or ambiguous data in multiple fields. You may have addresses that cannot be reliably found, that are actually duplicates, or are very similar to other addresses.
Also, you can’t rely on matching coordinates because the same physical location may have geocoded to a different place – very close or even somewhat distance.
No less than state of the art is required for every step in the process. You need consistent, validated, and standardized address and geocoding data PLUS…
- Advanced matching algorithms to analyze, parse and standardize your data.
- And software with machine learning capabilities to incorporate exceptions and provide flexible, easy to control data governance that makes the process of ensuring data quality as quick and painless as possible.
Our geocoding performs address validation and correction as part of the geocoding process! If you only need address validation and correction, we have that, too. We also cover you when you need the full features of address validation and correction, including detailed diagnostic information about your addresses. All available in batch, interactive, SaaS or self-hosted.