Choropleth map

A choropleth map (from Greek χῶρος "area/region" and πλῆθος "multitude") is a type of thematic map in which areas are shaded or patterned in proportion to a statistical variable that represents an aggregate summary of a geographic characteristic within each area, such as population density or per-capita income.

A choropleth map that visualizes the fraction of Australians that identified as Anglican at the 2011 census

Choropleth maps provide an easy way to visualize how a measurement varies across a geographic area or show the level of variability within a region. A heat map or isarithmic map is similar but does not use a priori geographic areas. They are the most common type of thematic map because published statistical data (from government or other sources) is generally aggregated into well-known geographic units, such as countries, states, provinces, and counties, and thus they are relatively easy to create using GIS, spreadsheets, or other software tools.

Overview

The earliest known choropleth map was created in 1826 by Baron Pierre Charles Dupin.[1] They were first called "cartes teintées" (coloured map in French). The term "choroplethe map" was introduced in 1938 by the geographer John Kirtland Wright in "Problems in Population Mapping".[2]

Choropleth maps are based on statistical data aggregated over previously defined regions (e.g., counties), in contrast to area-class and isarithmic maps, in which region boundaries are defined by data patterns. Thus, where defined regions are important to a discussion, as in an election map divided by electoral regions, choropleths are preferred.

Choropleth maps are generalizations of an areal distribution where real-world patterns may not conform to the regional unit symbolized.[3] Because of this, issues such as the ecological fallacy and the modifiable areal unit problem (MAUP) can lead to major misinterpretations of the data depicted, and other techniques are preferable.[4] Similarly, the size and specificity of the displayed regions depend on the variable being represented. While the use of smaller and more specific regional units such as census tracts, zip codes, or county boundaries can decrease the risk of an ecological fallacy and MAUP, it may inadvertently cause the map (or the data displayed) to appear more sophisticated than reality. Although representing specific data in large regions can be misleading, it can make the map clearer and easier to interpret and remember.[5] The choice of regions will ultimately depend on the map's intended audience and purpose.

The dasymetric technique can be thought of as a compromise approach in many situations. Broadly speaking, choropleths represent two types of data: spatially extensive or spatially intensive.

  • Spatially extensive data are things like populations. The population of the United Kingdom might be 65 million, but it would not be accurate to arbitrarily cut the UK into two halves of equal area and say that the population of each half of the UK is 32.5 million.
  • Spatially intensive data are things like rates, densities, and proportions, which can be thought of conceptually as field data that is averaged over an area. For example, though the United Kingdom's 65 million inhabitants occupy an area of about 240,000 km2 with a population density of about 250/km2, arbitrary halves of an equal area would not possess equal population densities.

Normalization

Normalization:the map on the left uses total population to determine color. This causes larger polygons to appear to be more urbanized than the smaller dense urban areas of Boston, Massachusetts. The map on the right uses population density. A properly normalized map will show variables independent of the size of the polygons.

Another common error in choropleth maps are the use of raw data values to represent magnitude rather than normalized values to produce a map of densities.[6] This is problematic because the eye naturally integrates over areas of the same color, giving undue prominence to larger polygons of moderate magnitude and minimizing the significance of smaller polygons with high magnitudes. The problem with using data in total counts arises when the polygons are not all the same size (in area or total population), as in the figure at right. Because a single color, representing a single value, is spread over the entire area of the district, large areas will be more dominant in the visual hierarchy than they should be, and are commonly misinterpreted as having larger values than smaller districts with the same color. To solve this issue, one can normalize the variable by dividing it by the total area, thus deriving density, which is a field. Another solution is to represent total amounts using a proportional symbol map.

Other valid forms of normalization for choropleth maps can be derived by computing ratios between two total amounts, such as rates of change (e.g. population growth = 2010 population / 2000 population) and mean allocations (e.g. mean family income = total income / total families), or other descriptive statistics such as the median or standard deviation.[7]

Classification

Color progression

When mapping quantitative data, a specific color progression should be used to depict the data properly. There are several different types of color progressions used by cartographers. The following are described in detail in Robinson et al. (1995)[8]

Single hue progression

Single-hue progressions fade from a dark shade of the chosen color to a very light or white shade of relatively the same hue. This is a common method used to map magnitude. The darkest hue represents the greatest number in the data set and the lightest shade representing the least number.

Two variables may be shown through the use of two overprinted single color scales. The hues typically used are from red to white for the first data set and blue to white for the second, they are then overprinted to produce varying hues. These type of maps show the magnitude of the values in relation to each other.

Bi-polar color progression

Bi-polar progressions are normally used with two opposite hues to show a change in value from negative to positive or on either side of some either central tendency, such as the mean of the variable being mapped or other significant value like room temperature. For example, a typical progression when mapping temperatures is from dark blue (for cold) to dark red (for hot) with white in the middle. When one extreme can be considered better than the other (as in this map of life expectancy[9]) then it is common to denote the poor alternative with shades of red, and the good alternative with green.

Complementary hue progressions are a type of bi-polar progression. This can be done with any of the complementary colors and will fade from each of the darker end point hues into a gray shade representing the middle. An example would be using blue and yellow as the two end points.

Blended hue color progression

Blended hue progressions use related hues to blend together the two end point hues. This type of color progression is typically used to show elevation changes. For example, from yellow through orange to brown.

Partial spectral color progression

Partial spectral hue progressions are used to map mixtures of two distinct sets of data. This type of hue progression will blend two adjacent opponent hues and show the magnitude of the mixing data classes.

Full-spectral color progression

Full spectral progression contains hues from blue through red. This is common on relief maps and modern weather maps. This type of progression is not recommended under other circumstances because certain color connotations can confuse the map user.

Value progression

Value progression maps are monochromatic. Although any color may be used, the archetype is from black to white with intervening shades of gray that represent magnitude. According to Robinson et al. (1995).[10] this is the best way to portray a magnitude message to the map audience. It is clearly understood by the user and easy to produce in print.

Qualitative color progression

A Qualitative progression is often used when working with nominal or qualitative data. The colors shown on the map seem unrelated to one another or are arbitrarily chosen. For example, a choropleth map of "most prevalent religion" would be best shown with this type of scheme.

Usability

A choropleth map (top) and a dasymetric map (bottom) of the population of the San Francisco Bay Area in 2000.[11]

When using any of these methods, there are two important principles: first is that darker colors are perceived as being higher in magnitude; second is that, while there are millions of color variations, the human eye is limited as to how many colors it can easily distinguish. Generally, five to seven color categories are recommended. The map user should be able to easily identify the implied magnitude of the hue and to match it with the legend.

Additional considerations include color blindness and various reproduction techniques. For example, the red–green bi-polar progression described in the section above is likely to cause problems for dichromats. A related issue is that color scales which rely primarily on hue with insufficient variation in saturation or intensity may be compromised if reproduced in black and white. Conversely, if a map is legible in black and white (lightness-based), then a prospective user's perception of color is irrelevant.

Color can greatly enhance the communication between the cartographer and their audience, but poor color choice can result in a map that is neither effective nor appealing to the map user. A simpler correlation between color-properties and underlying values is better.[12] With considerations for black-and-white rendering, the color mapping function should be chosen to make sure the lightness of the color is still monotonic, or the uneven change would make it hard to interpret levels, for both normal and colorblind viewers. One commonly-mentioned offender is the "rainbow" palette, the default palette in many statistical applications that has a back-and-forth change in lightness.[13]

See also

Footnotes

  1. "Milestones in the History of Thematic Cartography, Statistical Graphics, and Data Visualization". datavis.ca. Retrieved 5 February 2020.
  2. John Kirtland Wright (1938). "Problems in Population Mapping" in "Notes on statistical mapping, with special reference to the mapping of population phenomena".
  3. Jenks, George F.; Caspall, Fred C. (June 1971). "Error on Choroplethic Maps: Definition, Measurement, Reduction". Annals of the Association of American Geographers. 61 (2): 217–244. doi:10.1111/j.1467-8306.1971.tb00779.x. ISSN 0004-5608.
  4. T. Slocum, R. McMaster, F. Kessler, H. Howard (2009). Thematic Cartography and Geovisualization, Third Edn, pages 85–86. Pearson Prentice Hall: Upper Saddle River, NJ.
  5. Rittschof, Kent (1998). "Learning and Remembering from Thematic Maps of Familiar Regions". Educational Technology Research and Development. 46: 19–38. doi:10.1007/BF02299827.
  6. Mark Monmonier (1991). How to Lie with Maps. pp. 22–23. University of Chicago Press
  7. T. Slocum, R. McMaster, F. Kessler, H. Howard (2009). Thematic Cartography and Geovisualization, Third Edn, page 252. Pearson Prentice Hall: Upper Saddle River, NJ.
  8. Robinson, A.H., Morrison, J.L., Muehrke, P.C., Kimmerling, A.J. & Guptill, S.C. (1995) Elements of Cartography. (6th Edition), New York: Wiley.
  9. Patricia Cohen (9 August 2011). "What Digital Maps Can Tell Us About the American Way". New York Times.
  10. Robinson et al. (1995), op. cit.
  11. "Western Geographic Science Center". Retrieved 13 November 2015.
  12. Light; et al. (2004). "The End of the Rainbow? Color Schemes for Improved Data Graphics" (PDF). Eos. 85 (40): 385–91. doi:10.1029/2004EO400002.
  13. Stauffer, Reto. "Somewhere over the Rainbow". HCL Wizard. Retrieved 14 August 2019.

References

  • Dent, Borden; Torguson, Jeffrey; Hodler, Thomas (21 August 2008). Cartography Thematic Map Design. McGraw-Hill. ISBN 978-0-072-94382-5.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.