Chinese Journal of Society ›› 2015, Vol. 1 ›› Issue (3): 333-355.doi: 10.1177/2057150X15593710

Previous Articles     Next Articles

Statistical coherence of primary schooling in IPUMS-International integrated population samples for China, India, Vietnam and ten other Asia-Pacific countries

Robert McCaa, Lara Cleveland, Patricia Kelly-Hall, Steven Ruggles and Matthew Sobek   

  1. Minnesota Population Center, Minneapolis, USA
  • Online:2015-09-30 Published:2015-09-30
  • Contact: Robert McCaa, Minnesota Population Center, 50 Willey Hall, 225 19th Ave. S., Minneapolis, MN 55455,USA. Email: rmccaa@umn.edu

Abstract:

IPUMS-International disseminates harmonized census microdata for more than 80 countries at no cost, although access is restricted to bona-fide researchers and students who agree to the stringent conditions-of-use license. Currently over 270 samples are available, totaling more than 600 million person records. Each year, 15–20 additional samples are released, as more countries cooperate with the IPUMS initiative and the integration of 2010 round census samples is completed. With so much microdata so readily available, questions of data quality naturally arise. This article focusses on the concept of statistical coherence over time for a single concept, primary schooling completed. From an analysis of the percentage completing primary schooling by birth year for pairs of samples for 13 Asia-Pacific countries, outstanding coherence is found for four countries – China, Mongolia, Vietnam and Indonesia – with mean differences of less than 0.5 percentage points, regression coefficient (b) ranging from 0.93 to 1.07 and R2 = 0.99. For the 13 countries as a group there is considerable variation overall with mean absolute difference as high as 16 percentage points, b ranging from 0.62–1.44 and R2 = 0.65–0.99. As a whole, statistical coherence of primary schooling is outstanding. Nonetheless, to make expert use of the harmonized microdata, researchers are cautioned to carefully study the IPUMS integrated metadata as well as the original source documentation. National Statistical Offices not currently cooperating or that have not yet entrusted 2010 round census microdata are invited to do so.

Key words: Primary schooling, statistical coherence, IPUMS-International, population census samples, integrated microdata, microdata access, China, India, Vietnam, Asia, Pacific, Bangladesh, Cambodia, Fiji Islands, Indonesia, Kyrgyz Republic, Malaysia, Mongolia, Pakistan, Philippines, Thailand