views:

706

answers:

3

I have detailed US county maps, from the TIGER LINE data sets. How might I sample, smooth, or degrade the data so that I get straighter, more boxy, less "noisy" shapes to represent the geographical features -- in this case just county boundaries and state lines, but maybe also in the general case?

The sampling could happen at rendering time if that can be done efficiently, or a parallel data set could be generated and stored. I am using PostGIS, and the lines are multi-polylines generated by shp2pgsql -- but any solution where you take a squiggly line and reduce it to a smoother line of roughly the same significance to a human interpreter would be very useful.

+3  A: 

Here's a simple iterative smoothing algorithm:

for each three sequential points on any path, if the middle point has no intersections and is within some small threshold angle of the direct path between the two outer points, remove it.

Repeat until satisfied.

Will
+6  A: 

The problem with simply throwing away points is that you can quickly distort the shape of the original polygon. A better approach is to come at it from the other direction; start with a basic approximation of the polygon and then refine it upwards towards your complex shape.

An excellent example of this approach is the Douglas-Puecker algorithm. You start with two vertices drawn from the full polygon. Add a third vertex by selecting the one that lies farthest from an edge drawn between the first two vertices. Continue adding points until you have something that sufficiently resembles your original polygon.

ire_and_curses
+8  A: 

Douglas-Peucker is definitely the right approach. There are some simple ways to access implementations of it in PostGIS and QGIS that I thought I would add here for those who come across this post with a similar question. The goal is to start with something like this:

alt text

and end up with something like this:

alt text

In PostGIS Douglas-Peucker is implemented as simplify, the syntax, detailed here at bostongis.org, is some variant of:

SELECT transform(simplify(transform(the_geom, 2249), 500),4326) from the_geo_table

This worked very well even on the full national dataset, with some few errors that seem due to bad underlying data. It also turns out that in QGIS the menu item Tools > Geometry Tools > Simplify Geometries will export a simplified shapefile of any geometry and add it as a layer to your current project.

This is a pretty fundamental tool-set and I asked the question at too low a level, though it was nice to learn the underlying math, there is a good explanation of that here: http://www.mappinghacks.com/code/PolyLineReduction/, along with sample code that turns out not to be too necessary!

bvmou
+1 Excellent. Thanks for taking the time to report this back here with the extra implementation info. The example you give is very nice!
ire_and_curses