--- layout: default title: "The Emergence of Regional Cultures in the United States" description: "Interactive paper viewer for Jaworski and Kimbrough." extra_css: /css/paper-viewer.css extra_js: /js/paper-viewer.js ---

The Emergence of Regional Cultures in the United States

Taylor Jaworski (University of Colorado Boulder and NBER)
Erik O. Kimbrough (Chapman University)

This paper recovers the cultural geography of the United States from first names in complete-count census data spanning 1850 to 1930. Using unsupervised hierarchical clustering of county-level name distributions, the paper identifies spatially coherent cultural regions that align with historically recognized settlement patterns and remain stable across eight decades of economic and institutional change. The deepest division separates North from South, but finer groupings (New England, the Mid-Atlantic, Appalachia, the Deep South) emerge as nested subregions.

Important context

  • The paper tests whether the forces of convergence in nineteenth-century America (the market revolution, westward migration, railroad construction, institutional standardization) homogenized regional cultures or whether distinct cultural regions persisted.
  • It engages with David Hackett Fischer's thesis (Albion's Seed, 1989) that four British colonial folkways (Puritans, Royalists, Quakers, Borderers) established durable regional cultures. The clustering results are consistent with Fischer's account, but recovered from data rather than assumed.
  • First names serve as a cultural marker because naming conventions are transmitted within families and communities, are not directly governed by markets or institutions, and are observable in census microdata at scale.
  • The paper uses only white populations due to severe undercounting of Black and immigrant populations in earlier censuses.

Data and methods

  • Complete-count US census microdata for 1850, 1880, and 1930 (via IPUMS)
  • County-level first-name distributions for white individuals
  • Cosine similarity on county-level name-share vectors to measure cultural distance
  • Hierarchical clustering based on Ward's criterion
  • Results presented for 2, 4, and 7 clusters

Key results

  1. At two clusters, a clear North-South divide emerges in all three census years. The boundary does not follow the free-slave line; it tracks settlement-origin patterns more closely.
  2. At four clusters, the North subdivides into eastern and western regions; the South separates into Upper South and Deep South.
  3. At seven clusters, New England, the Mid-Atlantic, and Appalachia emerge as distinct subregions nested within the broader North-South partition.
  4. Temporal persistence is high: 94% of counties retain their two-cluster assignment between 1880 and 1930; 67% retain it from 1850 to 1930.
  5. Northern naming conventions consolidate over time, while the South remains internally differentiated.