Initially our software was designed for international comparisons and we only supported country-level data. It was a major aim of our current release to support more-fine-grained data, yet when we built the new capabilities we anchored exclusively on US data, easily available through the Census. Before release, I wanted to see if we could handle other sub-national boundaries, which led me to Afghanistan. Finding population data by province was not too hard; parsing it into an appropriate format took a bit more time, which explains the word “clean” in the image titles below — you are seeing my file name notation.
When I was finally able to load the data I was hungry to see the fruit of my labor.
And when this first image loaded up, I confess I was disappointed.
Honestly, I had been hoping for more activity; I had not expected the flatness of all provinces outside of major cities. Then I realized, the very thing I was unhappy about was new information about Afghanistan, new for me at least: Kabul is a large city; so is Ghazi; Kandahar, in orange, is substantially smaller but still a major population center.
In addition, from exploring other data I knew that if the highest value(s) are clipped, one can see a layer below the initial image. In some cases clipping values aids in removing outliers deemed inaccurate. While that step should always be taken cautiously, remembered, and ideally noted, in this case I just wanted to see Afghan population aside from the major cities, the image of which I had already absorbed.
After removing Kabul, new texture showed up in the image:
Next I clipped Ghazni, a city I had never heard of, although maybe I ought not admit such things. And, after removing Kandahar also, I could see even more demographic detail:
Looking at the value labels for each province, I was surprised to see many with a population between 50,000 and 250,000 persons.
From the first image I thought that maybe Afghanistan was mainly barren, with all of the population clustered in the cities, but after drilling in farther I saw many population centers of middling size, spread almost evenly across the country, with some concentration in the north, and with only a small fraction of provinces having extremely sparse population.
On reflection, an image that seemed initially unappealing has taught me a few essentials about the demography of a place otherwise distant and foreign. Still I will wrestle with questions about which image is more accurate: the one that shows how far the the largest cities stand out, or the one which portrays the breadth of general settlement. As with many things, I suspect the answer is that both are valid perspectives; and that maybe even better is having both images at once, a task which maybe we can entrust to our memories.
Last, a word of encouragement to anyone experimenting with the software…
Please, Dive In!
We built the software to make it easy to explore and experiment with different ways of viewing whatever data interests you. And I would be remiss if I didn’t invite you to share what you find. Almost as new parents, we are eager for screen shots that show how our software handles any data that you decide to explore.