i Copyright Notice The copyright of this thesis vests in the University of the Witwatersrand, Johannesburg, South Africa, in accordance with the University‘s Intellectual Property Policy. No portion of the text may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, including analogue and digital media, without prior written permission from the University. Extracts of or quotations from this thesis may, however, be made in terms of Sections 12 and 13 of the South African Copyright Act No. 98 of 1978 (as amended), for non- commercial or education purposes. Full acknowledgement must be made to the author and the University. An electronic version of this thesis is available on the Library webpage (www.wits.ac.za/library) under ―Research Resources‖. For permission requests, please contact the University Legal Office or the University Research Office (www.wits.ac.za). LEARNER MOBILITY IN JOHANNESBURG-SOWETO, SOUTH AFRICA: DIMENSIONS AND DETERMINANTS Julia Ruth de Kadt A thesis submitted to the Wits School of Education, Faculty of Humanities, University of the Witwatersrand in fulfilment of the requirements for the degree of Doctor of Philosophy. Johannesburg, 2011 iii Abstract and Keywords Abstract: Many South African school children are known to travel fairly long distances to school each day, in pursuit of the best possible educational opportunities in a schooling system that is known to vary greatly in quality. This thesis documents the dimensions and determinants of the daily, education- related travel of primary school aged children in Johannesburg-Soweto, South Africa. It uses data on a sample of 1428 children drawn from the Birth to Twenty cohort study to provide the first population-based data on the extent of learner mobility in contemporary urban South Africa. Learner mobility is measured in three different ways: firstly by the straight line distance between a child‘s home and his or her school; secondly by whether the child‘s school falls into the same geographical area as his or her home; and thirdly by whether the child attends his or her nearest, grade-appropriate school. The thesis provides clear evidence for extensive mobility using all three of these approaches to measurement. Over 25% of children were found to be travelling more than 5km each way to school and back on a daily basis. Almost 60% of children attended a school outside of the Census 2001 Sub-Place (roughly equivalent to a suburb) in which they lived, and fewer than 20% of children attended the grade-appropriate school nearest to their home. Counter to expectations, these figures were fairly stable over time, suggesting that educational mobility does not increase substantially as children age or transition to high school. Mobile children attended significantly more well- resourced and well-performing schools than their non-mobile peers, and the quality of schools attended increased with distance travelled. This substantiates the assumption that children and families make use of educational mobility to improve the quality of education that they are able to access. The analyses presented in the thesis suggest that two distinct patterns of mobility, with different determinants, are in use in the Johannesburg-Soweto area. The first relates primarily to travel from townships to historically advantaged schools in suburban Johannesburg, and typically requires substantial economic investment and extensive parental involvement. The second form of mobility operates at a more local level, and relates to children and families making choices between a number of relatively local schools. This form of mobility is less resource intensive. Children engaging in the first form of mobility were more likely to attend a particularly advantaged school, and to have a well-educated mother. By contrast, children engaged in the second form of mobility were more likely to live in a disadvantaged area, and come from households with moderate SES levels. iv The findings of this thesis provide important insights into the nature of school choice in South Africa, which have implications for educational policy, and the understanding of the nature of urban poverty as experienced by South African children. They also contribute to the international school choice literature, by providing novel information about the implications of relatively unregulated school choice for educational inequality and segregation in the South African context. Keywords: Birth to Twenty; cohort data; Johannesburg; learner migration; learner mobility; primary school; quantitative analysis; school choice; travel to school; secondary analysis; South Africa; Soweto v Candidate’s Declaration I declare that this thesis is my own unaided work. It is being submitted for the degree of Doctor of Philosophy at the University of the Witwatersrand, Johannesburg. It has not been submitted before for any degree or examination at any other University. ____________________________ Julia Ruth de Kadt 11th day of May in the year 2011 vi To my parents, in thanks for their support. vii Publications or Presentations Emanating from this Research De Kadt, J., Richter, L., Fleisch, B. & Norris, S. (2010) Measuring learner mobility in Johannesburg-Soweto, South Africa. 2nd Isibalo Young African Statisticians Conference, Pretoria, South Africa, December 1-3. De Kadt, J. (2009). Learner Migration and socio-economic status in Johannesburg, South Africa. United Kingdom Forum for International Education and Training (UKFIET) Conference, Oxford, UK, 15-27 September. viii Acknowledgements My sincere thanks to Prof. Brahm Fleisch, for supervising this thesis, and for his ongoing enthusiasm on the topic. Prof. Linda Richer played an essential role in enabling me to complete this thesis. She provided me with funding for fees, travel to conferences, and a challenging but fulfilling work environment. Despite her frantically busy schedule, she has always found time to read drafts of my work, and to provide guidance, mentorship, encouragement and support. Prof. Shane Norris facilitated my access to the Birth to Twenty data, and has since tirelessly read drafts of my work. He has been a reliable source of valuable advice and suggestions, and his input has strengthened this thesis immeasurably. The whole Birth to Twenty team deserves special thanks, not just for providing the data to me, and following up my endless queries, but also for the dedication that has allowed the collection of such complex data over so many years. Similarly, without the commitment of all the cohort members over many years, this rich and valuable dataset could not exist. The National and Gauteng Departments of Basic Education, and Statistics South Africa, all provided data, without which this thesis would not have been possible. My family has been a source of strength while working on this thesis. My father, Raphael de Kadt, provided ongoing support and encouragement, despite my stubborn independence. My mother, Elizabeth de Kadt, has somehow always known what I needed from her throughout this process, whether it was warm encouragement, a delicious dinner, or a stern talking to. She has provided support in more ways than I can count. My brothers deserve thanks too: Daniel for help with editing, motivating, and providing a steady supply of cute pictures of kittens and puppies; and Chris for generous gifts, dependable advice, and lots of sushi. My involvement in IkamvaYouth introduced me to a group of wonderful young people, about the same age as the Birth to Twenty sample members, and made me realise how important addressing the crises in South Africa‘s school system is. Thank you to the organization, and all its members, for playing an important part in my life, and for understanding when I needed to become less involved. Special thanks to Joy Olivier & Khona Dlamini. Thanks to Anita Craig, and her husband Denham, for allowing me to escape to their lovely home in Cape Town from time to time, and spoiling me with food, wine and all sorts of ideas. ix Colleagues at the Developmental Pathways for Health Research Unit at Wits, from my period at the HSRC, and before that from Princeton, have been a consistent source of encouragement and ideas. Extra special thanks to Dean Janse van Rensburg, for sharing his wizardry with Stata and MS Access, and providing less technical assistance in the form of friendship. The best group of housemates I‘ve ever had also deserve a mention. Thanks to Tawanda Makusha, Rutendo Jaison and Thabo Letsoalo for making my home a wonderful place to be these past few years, and for always being around to listen, talk, and de-stress. Finally, a special thank you to all my friends, scattered all over the world, who have kept me company in one way or another as I worked on this thesis. x Table of Contents Copyright Notice ............................................................................................. i Abstract and Keywords ................................................................................. iii Candidate‘s Declaration ................................................................................. v Publications or Presentations Emanating from this Research ...................... vii Acknowledgements ..................................................................................... viii Table of Contents............................................................................................x List of Figures .............................................................................................. xix List of Tables .............................................................................................. xxii Glossary ..................................................................................................... xxix Chapter 1: Introduction ....................................................................................... 1 1.1 Introduction .......................................................................................... 1 1.2 Defining learner mobility ..................................................................... 3 1.3 Core research questions ....................................................................... 6 1.3.1 What is the extent of learner mobility in South Africa? ............... 8 1.3.2 What are the patterns of learner mobility in South Africa, particularly with respect to socio-economic determinants? .................. 8 1.4 Methodological approach .................................................................. 10 1.5 Rationale ............................................................................................ 11 1.5.1 Rationale for a focus on learner mobility ................................... 11 1.5.2 Rationale for a focus on school choice more broadly ................ 13 1.5.3 Rationale in terms of practical and policy implications ............. 15 1.6 Outline of the thesis ........................................................................... 17 1.7 Conclusion ......................................................................................... 18 xi Chapter 2: Literature review ............................................................................. 20 2.1 Introduction ........................................................................................ 20 2.2 Major forms of school choice policy ................................................. 21 2.3 School choice in South Africa ........................................................... 23 2.3.1 The context of school choice in South Africa ............................ 23 2.3.2 The practice of school choice in South Africa ........................... 26 2.4 Debates in the international school choice literature ......................... 28 2.4.1 School choice and quality ........................................................... 29 2.4.2 School choice and equality ......................................................... 29 2.5 School choice and equality in South Africa ....................................... 37 2.6 Methodological approach .................................................................. 39 2.7 Conceptual framework ....................................................................... 42 2.8 Conclusion ......................................................................................... 47 Chapter 3: Methods .......................................................................................... 48 3.1 Methodological approach: quantitative secondary data analysis ....... 48 3.2 Dataset selection ................................................................................ 51 3.2.1 Birth to Twenty ........................................................................... 52 3.3 Ethical considerations ........................................................................ 54 3.4 Overview of data and variables used ................................................. 56 3.4.1 Child level variables ................................................................... 57 3.4.2 Household and maternal level variables ..................................... 59 3.4.3 Community level variables ......................................................... 63 3.4.4 School variables .......................................................................... 67 3.5 Sample selection and creation of the analytical database .................. 74 3.5.1 Sample selection ......................................................................... 74 3.5.2 Creation of the analytical dataset ............................................... 76 3.6 Operationalization of learner mobility ............................................... 79 xii 3.6.1 A distance-based definition of mobility ..................................... 79 3.6.2 An area based definition of mobility .......................................... 81 3.6.3 Mobility defined by attendance at the nearest school ................ 82 3.7 Analysis ............................................................................................. 84 3.8 Conclusion ......................................................................................... 85 Chapter 4: Sample descriptive statistics and representativity .......................... 86 4.1 Introduction ........................................................................................ 86 4.2 Sample descriptive statistics .............................................................. 86 4.2.1 Child level variables ................................................................... 87 4.2.2 Household level variables ........................................................... 88 4.2.3 Community level variables ......................................................... 90 4.3 Relationships between variables ........................................................ 91 4.4 Study sample representativity ............................................................ 91 4.4.1 How representative is Birth to Twenty? ..................................... 92 4.4.2 How representative is the study sub-sample? ............................. 99 4.4.3 Sample selection & bias: Conclusion ....................................... 107 4.5 Descriptive schools data: all Gauteng schools ................................. 108 4.5.1 School types and sectors ........................................................... 109 4.5.2 School Quintile ......................................................................... 109 4.5.3 Section 21 Status ...................................................................... 110 4.5.4 School enrolment ...................................................................... 111 4.5.5 Percentage of black learners ..................................................... 112 4.5.6 School fees ................................................................................ 117 4.5.7 Historical racial status of the school ......................................... 118 4.5.8 Matric pass rate ......................................................................... 120 4.5.9 Descriptive schools data: discussion ........................................ 123 4.6 Conclusion ....................................................................................... 124 xiii Chapter 5: Measuring the extent of learner mobility in contemporary urban Johannesburg-Soweto ..................................................................................... 125 5.1 Introduction ...................................................................................... 125 5.2 Distance-based operationalization of learner mobility .................... 126 5.2.1 Actual straight-line distance from home to school ................... 126 5.2.2 Binary definitions of mobility .................................................. 132 5.3 Area-based operationalization of learner mobility .......................... 135 5.4 Nearest school based operationalization of learner mobility ........... 137 5.5 Conclusion: ...................................................................................... 140 Chapter 6: Individual, family and community characteristics of mobile learners ........................................................................................................................ 142 6.1 Introduction ...................................................................................... 142 Child level characteristics ........................................................................... 143 6.2 Race ................................................................................................. 143 6.2.1 1997 .......................................................................................... 143 6.2.2 2003 .......................................................................................... 145 6.2.3 Race and mobility discussion ................................................... 147 6.3 Gender .............................................................................................. 147 6.3.1 1997 .......................................................................................... 148 6.3.2 2003 .......................................................................................... 150 6.3.3 Gender and mobility discussion ............................................... 153 6.4 Age at first school enrolment ........................................................... 153 6.4.1 1997 .......................................................................................... 154 6.4.2 2003 .......................................................................................... 155 6.4.3 Age at first enrolment and mobility discussion ........................ 156 6.5 School phase in 2003 ....................................................................... 157 6.5.1 1997 .......................................................................................... 160 xiv 6.5.2 2003 .......................................................................................... 162 6.5.3 School phase in 2003 and mobility discussion ......................... 164 6.6 Grade repetition ............................................................................... 165 6.6.1 1997 .......................................................................................... 166 6.6.2 2003 .......................................................................................... 167 6.6.3 Grade repetition and mobility discussion ................................. 169 Household level characteristics .................................................................. 169 6.7 Maternal education .......................................................................... 169 6.7.1 1997 .......................................................................................... 170 6.7.2 2003 .......................................................................................... 173 6.7.3 Maternal education and mobility discussion ............................ 175 6.8 Maternal Marital status .................................................................... 177 6.8.1 1997 .......................................................................................... 177 6.8.2 2003 .......................................................................................... 178 6.8.3 Maternal marital status and mobility discussion ...................... 180 6.9 Household SES ................................................................................ 180 6.9.1 1997 .......................................................................................... 181 6.9.2 2003 .......................................................................................... 183 6.9.3 Household SES and mobility discussion .................................. 186 Community level characteristics ................................................................ 187 6.10 Residential area poverty ............................................................... 187 6.10.1 1997 ........................................................................................ 188 6.10.2 2003 ........................................................................................ 195 6.10.3 Discussion of residential area poverty and mobility ............... 199 6.11 Conclusion .................................................................................... 201 Chapter 7: School characteristics associated with mobile learners ................ 205 7.1 Introduction ...................................................................................... 205 xv 7.2 Schools attended by study sample members, and grade-appropriate schools closest to study sample members‘ homes ...................................... 206 7.2.1 Unweighted data ....................................................................... 206 7.2.2 Weighted data ........................................................................... 210 7.3 Which children attend which schools? ............................................ 217 7.3.1 Child level variables ................................................................. 217 7.3.2 Family & household variables: ................................................. 225 7.3.3 Community level variables: ...................................................... 231 7.3.4 Child, household and community level variables associated with school enrolment patterns: Discussion .............................................. 236 7.4 Relationships between school attributes and mobility behaviours .. 237 7.4.1 School sector ............................................................................ 237 7.4.2 School quintile .......................................................................... 238 7.4.3 Section 21 status ....................................................................... 240 7.4.4 School Enrolment ..................................................................... 241 7.4.5 Proportion black learners .......................................................... 242 7.4.6 School fees ................................................................................ 244 7.4.7 Historical DET .......................................................................... 245 7.4.8 Matric pass rate ......................................................................... 246 7.4.9 Relationships between school attributes and mobility behaviour: Discussion ......................................................................................... 247 7.5 Conclusion ....................................................................................... 248 Chapter 8: Changes in educational mobility over time .................................. 250 8.1 Introduction ...................................................................................... 250 8.2 Changing schools ............................................................................. 250 8.2.1 Correlates of school change during primary schooling ............ 251 8.2.2 Correlates of school change associated with the transition to high school ................................................................................................ 255 8.3 The nature of changes in mobility ................................................... 257 8.3.1 Straight-line distance ................................................................ 257 xvi 8.3.2 Census geography ..................................................................... 265 8.3.3 Nearest school .......................................................................... 268 8.4 Correlates of type of mobility change for primary school school- changers ...................................................................................................... 270 8.4.1 Straight-line distance ................................................................ 271 8.4.2 Census geography ..................................................................... 272 8.4.3 Nearest school .......................................................................... 275 8.4.4 Conclusion: primary school school-changers ........................... 276 8.5 Correlates of type of mobility change for children transitioning to high school .................................................................................................. 277 8.5.1 Straight-line distance ................................................................ 277 8.5.2 Census geography ..................................................................... 278 8.5.3 Nearest school .......................................................................... 282 8.5.4 Conclusion: mobility change associated with transition to high school ................................................................................................ 282 8.6 Conclusion ....................................................................................... 283 Chapter 9: Modelling educational mobility .................................................... 285 9.1 Introduction ...................................................................................... 285 9.2 Straight-line distance ....................................................................... 286 9.2.1 1997 .......................................................................................... 286 9.2.2 2003 .......................................................................................... 289 9.3 Census area geography .................................................................... 291 9.3.1 Sub-Place level ......................................................................... 292 9.3.2 Main Place level ....................................................................... 294 9.3.3 Census area mobility discussion ............................................... 297 9.4 Nearest school analysis .................................................................... 297 9.5 Conclusion ....................................................................................... 300 xvii Chapter 10: Conclusion .................................................................................. 303 10.1 Introduction .................................................................................. 303 10.2 Overview of key findings ............................................................. 303 10.2.1 Developing approaches to measuring learner mobility ........... 305 10.2.2 Measuring the extent of learner mobility in Johannesburg- Soweto ....................................................................................................... 306 10.2.3 Potential child, household and community-level determinants of learner mobility ................................................................................. 307 10.2.4 Potential school-level determinants of learner mobility ......... 309 10.2.5 Changes in mobility as children age .......................................... 310 10.2.6 Predicting mobility ...................................................................... 311 10.2.7 Developing an evidence based conceptual framework for the study of learner mobility ................................................................... 314 10.3 Key contributions ......................................................................... 315 10.3.1 Methodological contributions ..................................................... 315 10.3.2 Empirical contributions ............................................................... 316 10.3.3 Theoretical contributions ............................................................ 318 10.4 Contextual relevance .................................................................... 320 10.4.1 Relevance to South Africa .......................................................... 320 10.4.2 International relevance ................................................................ 322 10.5 Project limitations and future work .............................................. 323 10.5.1 Sample composition..................................................................... 323 10.5.2 Study end point and longitudinal analysis ................................ 323 10.5.3 Methodological approach ............................................................ 324 10.5.4 Future work ................................................................................... 325 10.6 Conclusion .................................................................................... 328 Appendix A: Alternative data sources considered for the thesis .................... 329 A.1 Cape Area Panel Study (CAPS) ............................................... 329 A.2 Kwa-Zulu Natal Income Dynamics Survey (KIDS)................. 330 xviii A.3 National Income Dynamics Survey (NIDS) ............................. 331 A.4 Agincourt Health and Demographic Surveillance System (Agincourt) ........................................................................................ 332 A.5 Africa Centre for Health and Population Studies ..................... 332 Appendix B: Letter confirming approval of ethics clearance for thesis, received from the School of Education, University of the Witwatersrand ................... 333 Appendix C: Relationships within the study sample between variables hypothesized to act as determinants of learner mobility ................................ 336 C.1 Race and other variables .................................................................. 336 C.2 Gender and other variables .............................................................. 337 C.3 Age at first enrolment and other variables ....................................... 337 C.4 Phase of education in 2003 and other variables ............................... 338 C.5 Grade repetition and other variables ................................................ 338 C.6 Maternal education and other variables ........................................... 338 C.7 Maternal marital status and other variables ..................................... 339 C.8 Household SES and residential area poverty levels ......................... 339 C.9 Conclusion ....................................................................................... 340 References ...................................................................................................... 341 xix List of Figures Figure 2.1: Theoretical framework, based on De Jong (2000) and Hanson and Litten (1982) ......................................................................................... 44 Figure 3.1: Location of study variables within the conceptual framework presented in Chapter 2 .......................................................................... 56 Figure 3.2: Flow chart illustrating selection of sub-sample for use in thesis. .. 76 Figure 4.1: Distribution of Gauteng schools by the proportion of their learners who are black ...................................................................................... 113 Figure 4.2: Distribution of Gauteng schools by the proportion of their learners who are black ...................................................................................... 114 Figure 4.3: Distribution of the proportion of black learners for public primary and secondary schools in Gauteng ..................................................... 115 Figure 4.4: Kernel density plots of pass rates at primary and secondary schools ............................................................................................................ 121 Figure 5.1: Kernel density plots of the distribution of distances to school for the small and large samples, curtailed at a maximum distance of 60km, for both 1997 and 2003. ...................................................................... 128 Figure 5.2: Kernel density plot overlaid on histogram illustrating the distribution of distances travelled by sample members in 1997. The log transformation of the distribution is also provided. ............................ 130 Figure 5.3: Kernel density plot overlaid on histogram illustrating the distribution of distances travelled by sample members in 2003. The log transformation of the distribution is also provided ............................. 131 Figure 5.4: Cumulative density plot of distance between home and school, up to 10km, laid over a histogram illustrating the density distribution of distance ............................................................................................... 134 Figure 6.1: Kernel density plot of distance to school in 1997, on the basis of race ..................................................................................................... 144 Figure 6.2: Kernel density plot of distance to school in 1997, on the basis of race ..................................................................................................... 146 xx Figure 6.3: Kernel density plot of 1997 distance from home to school, by gender ................................................................................................. 149 Figure 6.4: Kernel density plot of 2003 distance from home to school, by gender ................................................................................................. 151 Figure 6.5: Kernel density plot of 1997 distance from home to school, by age at first school enrolment ..................................................................... 154 Figure 6.6: Kernel density plot of 2003 distance from home to school by age at first school enrolment ......................................................................... 156 Figure 6.7: Kernel density plot of 1997 distance from home to school by 2003 schooling phase .................................................................................. 161 Figure 6.8: Kernel density plot of 2003 distance from home to school by phase of education in 2003 ........................................................................... 162 Figure 6.9: Kernel density plot of 2003 distance from home to school by gender and phase of education in 2003 .............................................. 164 Figure 6.10: Kernel density plot of 1997 distance from home to school by grade repetition ................................................................................... 167 Figure 6.11: Kernel density plot of 2003 distance from home to school by grade repetition ................................................................................... 168 Figure 6.12: Kernel density plot of 1997 distance from home to school by maternal education level. .................................................................... 171 Figure 6.13: Kernel density plot of 2003 distance from home to school by maternal education level ..................................................................... 174 Figure 6.14: Kernel density plot of 1997 distance from home to school by maternal marital status ........................................................................ 178 Figure 6.15: Kernel density plot of 2003 distance from home to school, by maternal marital status ........................................................................ 179 Figure 6.16: Kernel density plot of 1997 distance from home to school, by 1997 household SES quintile .............................................................. 182 Figure 6.17: Kernel density plot of 2003 distance from home to school, by 2003 household SES quintile .............................................................. 184 xxi Figure 6.18: Kernel density plot of 1997 distance from home to school, by SAL poverty quintile .......................................................................... 190 Figure 6.19: Kernel density plot of 1997 distance from home to school, by SP poverty quintile ................................................................................... 192 Figure 6.20: Kernel density plot of 1997 distance from home to school, by MP poverty quantile .................................................................................. 194 Figure 6.21: Kernel density plot of 2003 distance from home to school, by SAL poverty quintile .......................................................................... 196 Figure 6.22: Kernel density plot of 2003 distance from home to school, by SP poverty quintile ................................................................................... 197 Figure 6.23: Kernel density plot of 2003 distance from home to school, by MP poverty quantile .................................................................................. 199 Figure 8.1: Kernel density plot of the change in distance from home to school between 1997 and 2003 for all sample members with full mobility information ......................................................................................... 259 Figure 8.2: Kernel density plot of the change in distance from home to school between 1997 and 2003 for all sample members who changed schools ............................................................................................................ 259 Figure 8.3: Distribution of change in distance from home to school for children travelling less far in 2003, by schooling phase ................................... 262 Figure 8.4: Distribution of change in distance from home to school for children travelling further in 2003, by schooling phase ................................... 262 Figure 10.1: Conceptual framework revised on the basis of study findings .. 315 xxii List of Tables Table 3.1: Variables used in the creation of SES scores .................................. 62 Table 3.2: Sources for school-level variables used .......................................... 68 Table 4.1: Breakdown of study sample members by race ................................ 87 Table 4.2: Breakdown of study sample members by gender ............................ 87 Table 4.3: Breakdown of study sample members by age at first school enrolment .............................................................................................. 87 Table 4.4: Breakdown of study sample members by phase of schooling in 2003 .............................................................................................................. 88 Table 4.5: Breakdown of study sample members by whether or not they have repeated at least one grade between 1997 and 2003 ............................. 88 Table 4.6: Breakdown of study sample by highest level of maternal education attained at the time at which the study sample member was born ....... 89 Table 4.7: Breakdown of study sample members by maternal marital status .. 89 Table 4.8: Breakdown of study sample by household SES in 1997 ................. 89 Table 4.9: Breakdown of study sample members by household SES in 2003 . 90 Table 4.10: Breakdown of study sample members by the poverty level of the SAL in which they live ......................................................................... 90 Table 4.11: Breakdown of study sample members by the poverty level of the SP in which they live ............................................................................ 91 Table 4.12: Breakdown of study sample members by the poverty level of the MP in which they live ........................................................................... 91 Table 4.13: Differences between cohort members included in the study sample, and those excluded from the study sample with regards to all available demographic variables collected at birth ............................................ 102 Table 4.14: Differences between members of the non-attrition sample included in and excluded from the study sub-sample, with respect to variables collected at birth ................................................................................. 103 Table 4.15: Availability of 1997 SES data for members of the non-attrition sample included in and excluded from the study sample ................... 105 xxiii Table 4.16: Differences between members of the non-attrition sample included in and excluded from the study sub-sample, with respect to SES in 1997 .................................................................................................... 106 Table 4.17: Availability of 2003 SES data for members of the non-attrition sample included in and excluded from the study sample ................... 107 Table 4.18: Differences between members of the non-attrition sample included in and excluded from the study sub-sample, with respect to SES in 2002/2003 ........................................................................................... 107 Table 4.19: Numbers of schools in each quintile in Gauteng province .......... 110 Table 4.20: Average number of learners for different types of schools in Gauteng ............................................................................................... 111 Table 5.1: Comparison of findings on distances travelled from home to school for different datasets, for 1997 and 2003. ........................................... 127 Table 5.2: Distribution of distances and log distances travelled by sample members ............................................................................................. 130 Table 5.3: Distribution of distances and log distances travelled by sample members ............................................................................................. 131 Table 5.4: Numbers and percentages of children classified as mobile in 1997 and 2003, for each binary definition of mobility considered. ............ 133 Table 5.5: Number and percent of children who live and attend school in the same SP, MP and MN areas, in 1997 and 2003 ................................. 136 Table 5.6: Correlation coefficients between distance-based and area-based measures of mobility for 1997 ............................................................ 137 Table 5.7: Correlation coefficients between distance-based and area-based measures of mobility for 2003 ............................................................ 137 Table 5.8: Number and percentage of learners attending the school closest to their home in 1997 and 2003, and the mean and maximum distances to the schools nearest to sample members‘ homes ................................. 139 Table 6.1: 1997 Distance between home and school on the basis of race ...... 144 Table 6.2: 1997 mobility at different levels of census geography, by race .... 145 xxiv Table 6.3: Children attending their nearest grade-appropriate school, by race, for public schools only, and for all schools ........................................ 145 Table 6.4: 2003 Distance between home and school on the basis of race ...... 146 Table 6.5: 2003 Mobility across different levels of census geography, by race ............................................................................................................ 147 Table 6.6: 2003 Children attending their nearest grade-appropriate school, by race, both for public schools only, and for all schools ....................... 147 Table 6.7: 1997 distance from home to school, by gender ............................. 148 Table 6.8: Gender breakdown of 1997 categories of distance from home to school .................................................................................................. 149 Table 6.9: 1997 mobility across different levels of census geography, by race ............................................................................................................ 150 Table 6.10: 1997 distance from home to school, by gender ........................... 151 Table 6.11: Gender breakdown of 2003 categories of distance from home to school .................................................................................................. 152 Table 6.12: 2003 mobility across different levels of census geography, by gender ................................................................................................. 152 Table 6.13: 1997 distance from home to school, by age at first enrollment .. 154 Table 6.14: 2003 distance from home to school, by age at first enrolment ... 155 Table 6.15: 1997 distance from home to school, by schooling phase in 2003 ............................................................................................................ 160 Table 6.16: 2003 distance from home to school, by progression to high school by 2003 ............................................................................................... 162 Table 6.17: 2003 distance from home to school, by gender and phase of education in 2003 ............................................................................... 163 Table 6.18: 1997 distance from home to school, by grade repetition status .. 166 Table 6.19: 2003 distance from home to school, by grade repetition status .. 168 Table 6.20: 1997 distance from home to school, by maternal education level ............................................................................................................ 170 Table 6.21: 1997 mobility across different levels of census geography, by maternal education level ..................................................................... 172 xxv Table 6.22: Children attending their 1997 nearest grade-appropriate public school, by maternal education ............................................................ 173 Table 6.23: 2003 distance from home to school, by maternal education level ............................................................................................................ 173 Table 6.24: 2003 mobility across different levels of census geography, by maternal education level ..................................................................... 175 Table 6.25: Children attending their 2003 nearest grade-appropriate school, by maternal education .............................................................................. 175 Table 6.26: 1997 distance from home to school, by maternal marital status . 177 Table 6.27: 2003 distance from home to school, by maternal marital status . 179 Table 6.28: 1997 distance from home to school, by 1997 household SES quintile ................................................................................................ 181 Table 6.29: 1997 mobility across different levels of census geography, by 1997 household SES .................................................................................... 183 Table 6.30: Children attending their 1997 nearest grade-appropriate school, by 1997 household SES ........................................................................... 183 Table 6.31: 2003 distance from home to school, by 2003 household SES quintile ................................................................................................ 184 Table 6.32: 2003 mobility across different levels of census geography, by 2003 household SES .................................................................................... 185 Table 6.33: Children attending their 2003 nearest grade-appropriate school, by 2003 household SES ........................................................................... 186 Table 6.34: 1997 distance from home to school, by SAL poverty quintile .... 189 Table 6.35: 1997 distance from home to school, by SP poverty quintile ....... 191 Table 6.36: 1997 distance from home to school, by MP poverty quantile ..... 193 Table 6.37: 2003 distance from home to school, by SAL poverty quintile .... 195 Table 6.38: 2003 distance from home to school, by SP poverty quintile ....... 197 Table 6.39: 2003 distance from home to school, by MP poverty quantile ..... 198 Table 7.1: Quintiles of schools attended by and nearest to sample members‘ homes .................................................................................................. 208 xxvi Table 7.2: Schools attended by sample members (note: schools classified as combined are included in both columns for 2003) ............................. 212 Table 7.3: Proportion of sample members closest to and attending independent schools ................................................................................................ 212 Table 7.4: Distribution of sample members across schools by school quintile rating ................................................................................................... 213 Table 7.5: Proportion of schools without Section 21 status nearest to and attended by sample members .............................................................. 213 Table 7.6: Size of schools nearest to and attended by sample members ........ 214 Table 7.7: Proportion of black learners at schools nearest to and attended by sample members ................................................................................. 215 Table 7.8: Fees charged by schools nearest to and attended by sample members ............................................................................................................ 216 Table 7.9: Proportion of schools nearest to and attended by sample members that were historically under the DET .................................................. 216 Table 7.10: Pass rates of schools nearest to and attended by sample members. ............................................................................................................ 217 Table 7.11: Relationship between child race (black and coloured children only) and properties of the school he or she attends .................................... 219 Table 7.12: Relationship between child gender and properties of the school he or she attends ...................................................................................... 220 Table 7.13: Relationship between child age at first school enrolment and school properties ................................................................................. 222 Table 7.14: Relationship between grade repetitions between 1997 and 2003, and school properties .......................................................................... 224 Table 7.15: Relationship between maternal education and school properties 226 Table 7.16: Relationship between maternal marital status and school properties ............................................................................................................ 228 Table 7.17: Relationship between 1997 household SES and school attributes in both 1997 and 2003 ............................................................................ 229 xxvii Table 7.18: Relationship between 2003 household SES and school attributes in 1997 and 2003 .................................................................................... 231 Table 7.19: Relationship between residential SAL poverty and school attributes in 1997 and 2003 ................................................................ 232 Table 7.20: Relationship between SP poverty and school properties ............ 234 Table 7.21: Relationship between MP poverty and school properties ........... 236 Table 7.22: Relationship between sector of attended school and learner mobility ............................................................................................... 238 Table 7.23: Relationship between quintile of attended school and learner mobility ............................................................................................... 240 Table 7.24: Relationship between Section 21 status of attended school and learner mobility .................................................................................. 241 Table 7.25: Relationship between size of attended school and learner mobility ............................................................................................................ 242 Table 7.26: Relationship between proportion of black students at attended school and learner mobility ................................................................ 244 Table 7.27: Relationship between fees of attended school and learner mobility ............................................................................................................ 245 Table 7.28: Relationship between historical DET status of attended school and learner mobility .................................................................................. 246 Table 7.29: Relationship between quintile of attended school and learner mobility ............................................................................................... 247 Table 8.1: Proportion of sample members remaining in the same school from 1997 to 2003, and proportion changing schools (note that information on schooling phase is only available for 1196 individuals) ............... 251 Table 8.2: Distribution of sample members with full residential and schooling data for both 1997 and 2003 across schooling phases, and stability of mobility behaviour .............................................................................. 257 Table 8.3: Changes in distance from home to school between 1997 for all sample members, disaggregated by schooling phase and school change status ................................................................................................... 258 xxviii Table 8.4: Counts of sample members who live closer to or further from their school in 2003 as compared to 1997, disaggregated by phase of schooling ............................................................................................. 261 Table 8.5: Children attending schools in the same categories of distance from their homes in 1997 and 2003, disaggregated by schooling phase ..... 263 Table 8.6: Numbers of children moving between schools in different geographical areas, disaggregated by schooling phase ...................... 265 Table 8.7: Changes in distance from home to school experienced by children moving between schools in different geographic areas, disaggregated by schooling phase ............................................................................. 266 Table 8.8: Children attending school in the same area as their home, for three different levels of geography, disaggregated by schooling phase ...... 268 Table 8.9: Children attending their nearest grade-appropriate public school in 1997, 2003 and at both points, disaggregated by schooling phase ..... 269 Table 9.1: 1997 regression results. Figures in parentheses are standard errors. ............................................................................................................ 289 Table 9.2: 2003 regression results. Figures in parentheses are standard errors. ............................................................................................................ 291 Table 9.3: 1997 & 2003 SP mobility regression results. Figures in parentheses are standard errors. ............................................................................. 294 Table 9.4: 1997 and 2003 MP mobility regression results. Figures in parentheses are standard errors. .......................................................... 296 Table 9.5: 1997 and 2003 nearest school attendance logistic regression results. Figures in parentheses are standard errors. ......................................... 300 Table 9.6: Summary of variables associated with increased mobility in the regression models presented in this chapter ....................................... 301 Table 10.1: Overview of the key findings presented in this thesis ................. 305 Table 10.2: Summarized results for the models of mobility developed in Chapter 9. Results presented are for regressions with robust errors. . 313 Table A.1: Significance of relationships between household SES and residential area poverty levels ............................................................ 340 xxix Glossary ASS – Annual Schools Survey Bt20 – Birth to Twenty Cohort Study DET – Department of Education and Training DOE – Department of Education EEA – Employment of Educators Act EMIS – Education Management Information System GDE – Gauteng Department of Education GIS – Geographical Information System MN – Municipality level MP – Main place level NEPA – National Education Policy Act PCA – Principal Components Analysis SAL – Small area level SASA – South Africa Schools Act SES – Socio-economic status SGB – School Governing Body SP – Sub-place level SRN – School Register of Needs 1 Chapter 1: Introduction 1.1 Introduction Much recent work has highlighted the generally poor state of education in South Africa. Schools, particularly those serving the less privileged, tend to be poorly performing, and the skill levels of South African children are notoriously low (Reddy 2006; Fleisch 2008; Spaull 2011; van der Berg, Burger et al. 2011). Simultaneously, the schooling system is also known for the highly variable resource levels enjoyed by different schools within the public sector, and the enormous variations in school performance that tend to accompany this. School quality is closely connected to South African history, with most well performing schools being those that historically educated white1 children. This means that well performing schools are usually located in historically white areas, and that school quality is closely related to geography (Fiske and Ladd 2004; Woolman and Fleisch 2006; Spaull 2011; van der Berg, Burger et al. 2011). Within this context, and in the aftermath of Apartheid‘s Bantu education policies, education is very highly valued by many South Africans. Completing secondary school and reaching tertiary education are core goals held by many young people, regardless of their backgrounds. That there is a strong relationship between the quality of the school attended, and future opportunities, is widely accepted. As a result, children and families are often willing to go to great lengths to ensure the best possible educational opportunities (Woolman and Fleisch 2006; Lombard 2007). 1 In this thesis, the four race groups defined by the Apartheid-era government will be used to categorize individuals, due to South Africa‘s unique historical context, and the ongoing relevance of these categories to the life experiences and educational opportunities of young South Africans. 2 This thesis explores some of the ways in which children and families in post- Apartheid Johannesburg-Soweto pursue the highest-quality educational opportunities possible. In particular, it focuses on school choice, and the associated phenomenon of learner mobility. Learner mobility is used to denote the travel of learners to attend schools other than those closest to their homes. Anecdotal evidence suggests that learner mobility is both widespread and substantial in contemporary South Africa, but very little is actually known about its dimensions, drivers and implications. Although several studies have identified its occurrence, often at fairly substantial levels, and some have even discussed it in relative depth (Fiske & Ladd, 2004; Nelson Mandela Children's Fund, 2005; Paterson & Kruss, 1998; Soudien & Sayed, 2003; Woolman & Fleisch, 2006) there has been only one attempt to construct a broader, data- driven picture (Sekete, Shilubane, & Moila, 2001). Although this study provides valuable baseline data for a critical period of history, it was not conducted on a population based sample, meaning that the applicability of its findings to a broader urban population is not clear. Given the apparently high level of learner mobility in urban South Africa, and its potentially significant implications for the South African educational system, updating and deepening our understanding of this phenomenon is an urgent need. While we know that learner mobility appears to be extensive, and that due to the distribution of educational opportunities in contemporary South Africa it is likely to have substantial importance to the life chances of young urban South Africans, we know very little about the actual extent of engagement in school choice and learner mobility. Although we have reason to believe that most learner mobility involves travelling to historically advantaged schools, we know very little about where children actually tend to enrol in school, relative to where they live. Our knowledge about the determinants of school choice and learner mobility is also very limited. This thesis uses secondary quantitative analysis of longitudinal data from the Birth to Twenty (Bt20) study, which tracked a sample of 3273 young people born in the highly urbanized 3 Johannesburg-Soweto area in a six week period in 1990, to address these questions. Establishing the dimensions, patterns and correlates of learner mobility in South Africa, and beginning to understand its implications, will fill significant gaps in the local and international literatures on school choice, and have important implications for the design and implementation of policies to ensure genuinely equitable access to high quality education in South Africa. 1.2 Defining learner mobility Before detailing the research questions addressed by this thesis, it is critical to clearly define learner mobility. The term ―learner mobility‖ is used to refer to the daily travel of learners to a school that is not the school nearest to their home (Karlsson 2007). It is derived from the phrase ―learner migration‖, which has previously been used in the South African literature on the travel patterns of learners (Sekete, Shilubane et al. 2001; Lombard 2007). This term, however, has not obtained widespread usage outside of this fairly limited literature. In addition, it does not adequately differentiate between daily travel or commuting, and genuine migration in which learners spend nights away from their permanent family residence for the purpose of attending a specific school. As both of these practices are believed to be widespread in contemporary South Africa, it is essential to distinguish between them, particularly as their implications for individual learners, the educational system and society more broadly, along with appropriate policy responses, are likely to differ substantially. In addition, the learners making use of them are expected to represent different groups, particularly with regard to residential location and family socio-economic status (SES) (Paterson and Kruss 1998). As daily travel is the focus of this dissertation, learner mobility is used as a more appropriate descriptor. It should be stressed that learner mobility is only one expression of school choice in South Africa. Other, and likely widespread, expressions of choice 4 occur when a family chooses their residence on the basis of proximity to specific schools, when a child leaves the public sector to attend a private school, regardless of location, or when a child lives with people other than his or her family in order to attend a specific school. These types of choice are excluded from the above definition of mobility for various reasons, including that it is not possible to measure them with the data available for this project, and that they can be understood to operate differently from the types of movement described above in terms of the resources required and used, the perceived benefits, the population groups most likely to be taking advantage of them, and their implications for public policy. While all these forms of school choice in South Africa certainly warrant further examination, they do not fall within the scope of this thesis. This thesis takes several different approaches to the measurement of learner mobility, in order to capture a range of different aspects of the phenomenon. The first approach involves measuring the straight-line distance between a child‘s home, and his or her school. This is selected because it is theoretically sound, capturing to some extent the level of investment required by mobility, and because it can be accurately measured using the available data. Additionally, and for these same reasons, it has been widely used in existing work on learner mobility, although in some cases distance is supplemented or replaced by travel time (Sekete, Shilubane et al. 2001; South African Human Rights Commission 2004; South African Human Rights Commission 2006; Pendlebury and Rudolph 2008). The second approach involves determining whether the learner is travelling to a school inside or outside of the area in which he or she lives. Unfortunately, identifying in a consistent and useful way whether a school and a home address are in fact located in the same area is very challenging in contemporary South Africa. Educational districts, as used by the Gauteng Department of Education (GDE) for administrative purposes, do not generally align with electoral wards, 5 census districts, or neighbourhoods. Most schools do not have a clearly defined catchment area, unless it has been defined by the school itself. Electoral wards and census districts have also undergone recent and substantial changes, and in many cases bear limited resemblance to historically defined neighbourhoods. In this context, the most feasible way to measure travel between areas proved to be by locating both school and home within their relevant census districts, using GIS coordinates. Although imperfect, this data does at least provide a preliminary measure of whether a child attends school in the area in which he or she lives. Finally, the third approach to measuring mobility is to identify whether or not a child is attending the grade-appropriate school nearest to his or her home. This is used primarily as an indicator of whether or not a child and his or her family are engaging in school choice at all. Of course, this measure is imperfect, as a child attending the school closest to his or her home may have chosen this school deliberately, while a child attending a school further from home may do so for completely involuntary reasons. Nonetheless, this measure is used as the best available indicator to provide an approximate measure of the extent of engagement in school choice amongst children in post-Apartheid Johannesburg-Soweto. It is critical to note that many other definitions of learner mobility are of course possible, and in various contexts may indeed be more appropriate. For example, when parents and learners make school enrolment choices, they certainly take into account a wide range of factors other than distance. Given South Africa‘s long and racially-defined history of learner migration and differential educational opportunities, the historical racial designations of schools is one factor that is likely to play a significant role (Paterson and Kruss 1998; Lombard 2007). Factors such as travel time, travel cost, safety, school reputation, and school fees, among others, are also likely to be of particular importance (Chisholm 2004; Maile 2004; Lemon 2005). 6 The measures of learner mobility used in this thesis, and discussed above, include both a continuous measure (distance from home to school), and binary measures (attending a school in the area in which a child lives, and attending the school nearest to the child‘s home). This is appealing, as mobility can be thought of as both binary – occurring or not occurring – and as a continuum – with the extent of mobility determined by the distance travelled to school. In addition, combining both binary and continuous measures allows for the application of a broader range of different analytical approaches. 1.3 Core research questions The literature review, presented in Chapter 2, explores and defines the context in which school choice, including learner mobility, occurs in contemporary urban South Africa. This provides the background against which the study‘s research questions can be addressed. Current school choice practices in South Africa have emerged in the context of rapid, global, changes in educational systems. Key elements of these changes have included increasing demands for higher levels of education along with universal access to basic education, a focus on improving the quality of education, which is often accompanied by increased levels of testing and ICT use, and pressure to keep public sector spending to a minimum through cost recovery and privatization (Carnoy 1999). During the 1990s there were widespread debates around the extent to which market forces and an increased private role in educational production could allow these otherwise somewhat contradictory goals to be met, and establishing school choice was viewed as a critical component of getting educational markets to work properly (Hoxby 2002; Hoxby 2003). While these debates have now died down, most of the pressures that spurred them have not gone away, and some parts of the response to these pressures, such as school choice, have become entrenched (Greene, Loveless et al. 2010). Unfortunately, this entrenchment has not been combined with evidence that school choice is 7 an effective response to these pressures, nor, indeed that these pressures should be accepted in the first place. Nonetheless, school choice, whether official or unofficial, regulated or not, is now accepted – or at least tolerated – in most countries. In South Africa, the issue of school choice is complicated further by the country‘s history of massive racial inequalities. Under the Bantu Education Act, children‘s educational opportunities, along with the resources devoted to these, were determined entirely by the colour of their skin (Fedderke, de Kadt et al. 2000). While this policy is now a thing of the past, it has left behind a very persistent set of geographically defined inequalities in educational infrastructure and resources (Fiske and Ladd 2004; Fiske and Ladd 2005; Spaull 2011; van der Berg, Burger et al. 2011). As residential desegregation is occurring only slowly, for most black learners the only opportunity to attend a high quality school comes through a willingness and an ability to travel in order to reach one. While government policies do not legislate against school choice, they also do not facilitate choice as an option for the most disadvantaged learners, nor do they protect these learners from any harmful effects of their inability to express choice (Maile 2004; Woolman and Fleisch 2006). The ability to express choice is therefore expected to remain closely linked to SES, even as SES gradually dissociates from race. School choice, and particularly learner mobility, has become entrenched and broadly accepted in contemporary South Africa, even though relatively little is known about its extent. In this context, it is critical to explore this phenomenon to develop an understanding of its implications, as well as how any negative effects can be mitigated. This thesis documents learner mobility between 1997 and 2003. This period of time is thought to be critical to the development and establishment of new, post-Apartheid patterns of interaction with educational opportunity. The most substantial policy changes thought to have an impact on mobility had already been made by this period. 8 I will now move through the two major research questions addressed in this thesis, providing critical context for each, and exploring key hypotheses. 1.3.1 What is the extent of learner mobility in South Africa? Given good reason to believe that learner mobility, as an expression of school choice, exists at significant levels in South Africa, the next task becomes to measure it (Karlsson 2007). Determining the extent of learner mobility in post- Apartheid Johannesburg-Soweto forms the first core research question in this thesis. To date, although there is ample evidence that learner mobility is fairly widespread, there is little concrete information about just what this means. This thesis measures learner mobility in the Johannesburg-Soweto metropolitan area, using the three different definitions of learner mobility described above. The actual dimensions of mobility are both of practical and theoretical importance. Practically, the extent of learner mobility – in each of the forms described – has implications for educational policy and planning. Theoretically, the extent of the phenomenon is important as it feeds into major, ongoing debates, both domestic and international, around the implications of school choice, particularly when unregulated, for both academic and societal outcomes. Although some work has been done to measure the extent of learner mobility and school choice in particular populations (Paterson and Kruss 1998; Sekete, Shilubane et al. 2001; Msila 2005; Karlsson 2007; Lombard 2007), to my knowledge there is no existing work which provides a measurement of the extent of mobility at a population level in contemporary urban South Africa. 1.3.2 What are the patterns of learner mobility in South Africa, particularly with respect to socio-economic determinants? There appear to be two major groups of determinants of school choice. The first set of determinants relates to the broader context in which school choice occurs, and includes factors like policy, the nature of the schooling system, the 9 distribution of educational opportunities, and so on. This is covered in this study‘s literature review, and has been fairly comprehensively dealt with by the existing South African school choice literature (Woolman and Fleisch 2006). The second set of determinants relate to the attributes of individual children, their families, and the communities in which they live, as well as the properties of the schools available to them. International work has documented the importance of these determinants, but very little theoretical work, and even less empirical work, has been done on this for South Africa. As a result, this area is where the bulk of the analytical work conducted in this thesis is focussed. This second research question addressed by the thesis therefore revolves around identifying who, primarily, is involved in learner mobility, and exploring the patterns evident in this mobility. The evidence that is available suggests that patterns of mobility are highly mediated by race, class, gender, age and geographic location (Paterson and Kruss 1998; Sekete, Shilubane et al. 2001; Fiske and Ladd 2004; Nelson Mandela Children's Fund 2005; Karlsson 2007). The relationship between each of these variables and mobility will be explored in this thesis, as will the relationship between mobility and a range of attributes of the school a child lives nearest to, and the school which that child attends. As the majority of documented learner mobility seems to involve children living in historically disadvantaged urban African communities (Sekete, Shilubane et al. 2001; Maile 2004), this thesis focuses particularly on the urban Johannesburg-Soweto area. In line with data provided by other work (Sekete, Shilubane et al. 2001; Chisholm 2004; Fiske and Ladd 2004), the most substantial portion of the mobility identified is hypothesised to be learners travelling from townships and informal settlements to schools in historically more advantaged areas. In addition, some evidence of choice and mobility 10 within the more disadvantaged urban areas are also expected, with children choosing to attend schools other than those nearest to their homes, and sometimes those outside of their immediate residential area. These forms of intra-urban learner mobility are hypothesised to be linked to family race, SES, and maternal education. Of those involved in learner mobility, the most advantaged learners and those with the most educated parents are expected to travel furthest, to attend schools in the most advantaged areas (typically, historically white schools). Somewhat less advantaged learners, and those with somewhat less educated parents, are expected to travel somewhat shorter distances to attend schools in more moderately advantaged areas (typically, historically Coloured or Indian schools). Travel to attend schools in other highly disadvantaged areas (typically, historically Black township schools) is expected to be restricted to the least advantaged among mobile learners. The thesis also explores changes in the mobility behaviour of children over time. Children are expected to become increasingly mobile as they age, with far higher levels of mobility anticipated at the secondary school level than at the primary school level. The final analytical component of the thesis ties together all of the potential determinants of mobility analysed, and provides preliminary models for the prediction of mobility behaviours in children in the Johannesburg-Soweto area. 1.4 Methodological approach The methodological approach taken in this thesis is quantitative secondary analysis. This is a widely accepted research method (Bryman 2004), and is well suited to answering the research questions outlined above. The thesis makes use of data on a sub-sample (n=1428) of the Bt20 cohort, which is comprised of all (n=3273) children born in a 6 week window in 1990 in the 11 Johannesburg-Soweto metropolitan area. In order to allow for the consideration of variables relating to schools and communities in the analysis, and the incorporation of spatial effects, the Bt20 data is combined with schools data from the Department of Education (DOE), and census district data from Census 2001. As detailed previously, a number of approaches to the measurement of mobility are taken. This is followed by a range of bivariate analyses, exploring the relationships between learner mobility and hypothesized determinants of mobility. Change in mobility over time is also examined. Finally, a regression approach is used to combine all relevant variables, and generate preliminary models of learner mobility. 1.5 Rationale 1.5.1 Rationale for a focus on learner mobility As mentioned previously, learner mobility is only one form of school choice in South Africa. Framing this study of learner mobility as a contribution to the international school choice literature, and as a contribution to understanding school choice and its implications for policy and practice in South Africa, requires answering two questions. Firstly, why only study a part of the phenomenon, and not the entire phenomenon? Secondly, why focus on learner mobility, as opposed to any other expression of school choice? The argument for addressing only a single form of school choice stems primarily from the complexity of school choice in contemporary South Africa, combined with the current paucity of data on the phenomenon. The review of the international, empirical literature on school choice (presented in Chapter 2) will demonstrate that the implications of school choice depend heavily on both the context of choice, as well as the exact nature of the choice that is available. If multiple forms of school choice are combined in a single study, the study will need to differentiate between these different forms of choice, and their 12 differing implications. Because we currently know so little about school choice in South Africa, any empirical study will need to begin by identifying and measuring each particular form of choice, before it is possible to move on to examine implications. Providing such a thorough and deep examination of multiple forms of choice would be ideal, but is simply beyond what is feasible in this study. Of the various forms of school choice in evidence in contemporary South Africa, there are several reasons for this study to focus specifically on learner mobility. Firstly, learner mobility appears to be extensive. Although the literature on learner mobility itself is extremely limited, data from studies of various other educational topics suggest that it is likely to be quite prevalent. For example, Karlsson (2007) provides evidence that the single largest group of commuters in contemporary South Africa is school children. Additionally, in a representative survey of Grade 12 learners nationwide in 2001, 24.9% were found to live more than 10km away from their school, a proportion which excluded those at boarding schools (Cosser and du Toit 2002). Even if some of this mobility can be explained by children in rural areas who have poor access to schooling, it still suggests that voluntary mobility in urban areas is likely to be prevalent. Secondly, learner mobility is of particular theoretical interest, due to its potentially significant role in decreasing the racial segregation of schools, but simultaneously increasing their socio-economic segregation, with substantial implications for educational access and equality. This is because the costs associated with learner mobility tend to be fairly substantial, meaning that it is likely to be shaped primarily by socio-economic status, rather than race. Recent work suggests that although the role of race in determining educational outcomes has been falling in post-Apartheid South Africa, the role of SES has been becoming increasingly important (van der Berg, Burger et al. 2011). Understanding the relationship between SES and school choice is a critical 13 precursor to understanding the nature of the shifting relationship between race, SES and educational outcomes. Thirdly, learner mobility is in many ways the most measurable form of non- private school choice, and can in fact be measured, at least for samples of the population, using data which already exists. This means that considerable information can be obtained without requiring the time-consuming and expensive collection of specialised data. In addition, learner mobility has implications for a number of areas of educational policy, including school financing and governance. The information generated by this study will allow for a reassessment of key elements of South African educational policy, as well as existing academic work based on the assumption of limited or no learner mobility. It will also highlight the practical questions that need to be asked about South Africa‘s current approach to school choice, and provide some guidance regarding potential alternatives. 1.5.2 Rationale for a focus on school choice more broadly As will be illustrated in Chapter 2, and as alluded to above, the South African literature on school choice is fairly limited, and of the work that does exist, only a very small proportion is empirical. This sparse empirical literature is in itself a strong justification for an empirical investigation of the actual extent of school choice in a fairly large urban population in South Africa. Additionally, most existing empirical work tends to be either extremely local or highly aggregated. While each of these types of work is very useful, particularly given the limited state of existing knowledge, this does leave substantial space for work at a more intermediate level, which allows for both the contextualization of data, but also the identification of more generalizable patterns. The international literature makes very clear that school choice policy and practice are very closely connected to issues of educational quality and educational equality. Understanding the determinants of access to high quality 14 education is of great relevance to national developmental goals of rapid and ongoing economic growth, job creation, racial equality, and socio-economic justice. Particularly in the context of the huge variations in educational quality present in South African public education, it is critical to understand who has access to good education, who does not, and how to best improve access for this second group. Although there is a fairly broad, and growing, literature on educational quality in South Africa, there is as yet very little exploration of how school quality shapes patterns of choice, and vice versa, or of the distribution of access to high quality educational opportunities. While this project will not provide definitive information about the implications of school choice for educational quality or outcomes, it will provide preliminary information about the relationship between engagement in school choice, and the quality of the educational opportunities a child is likely to be able to access. This will facilitate the development of data-driven hypotheses to be tested by future work. Understanding patterns of learner mobility will also provide useful information around questions of access to quality education, and potential patterns of investment to address quality improvement. Existing South African work on educational equality has tended to focus on resource and quality differentials across schools (Chisholm 2004; Fiske and Ladd 2004; Kanjee 2007). South Africa‘s highly unequal educational system has been strongly shaped by the country‘s history of racial segregation, and the race-based allocation of educational resources (Motala 1995; Fedderke, de Kadt et al. 2000; Motala, Dieltiens et al. 2009). Racial inequalities in the educational sector remain extensive, and rightfully receive a substantial amount of attention, both politically and in the academic literature (Fiske and Ladd 2004; Nkomo, McKinney et al. 2004). However, as socio-economic inequalities within race groups begin to grow (van der Berg, Wood et al. 2002), direct attention to the implications of socio-economic status for 15 education becomes increasingly important. In focussing closely on the correlates of learner mobility, this study will bring to light a great deal of information about the extent to which SES and access to high quality schooling are related, and by extension, whether it is important to begin focussing more attention on segregation across schools on the basis of SES rather than race. Although the South African literature on educational equality is much broader than that on school choice, once again relatively little of it is empirical, and concerns about the lack of empirical work at an intermediate level also hold. A great proportion of the existing work on equality and desegregation has also focussed on those schools generally thought of as higher quality, or more desirable – that is, generally, former Model C and independent schools. By contrast, schools which are perceived as low quality have been to a large extent ignored, even though they make up a far greater proportion of schools nationally. This project adds valuably to the literature because it will provide reliable information about who actually attends these very numerous, lower quality schools. In addition to contributing to the local understanding of an important but poorly understood phenomenon, and the local school choice literature, this thesis has also been designed to help fill several important gaps in the international school choice literature. Firstly, it will provide insight into the extent to which school choice can become accepted in developing country context, even without the existence of intentionally pro-school-choice policies. Secondly, it will explore the implications of unplanned and largely unregulated school choice for educational equality and segregation, in a context of extremely high inequality. 1.5.3 Rationale in terms of practical and policy implications The information generated in this project will assist in the assessment of current educational policies, most of which are based on the assumption that 16 the large majority of children attend local, neighbourhood schools. For example, if mobility levels are substantial, and mobility is related to SES, this means that the school poverty rankings which determine the allocation of governmental funds are likely to be inaccurate, leading to underfunding of some learners, and overfunding of others. Similarly, high levels of learner mobility would imply problems with current community-centred school governance policies, in both the schools which lose learners and those which receive them. For example, if schools in disadvantaged areas are enrolling only the most disadvantaged of the children living in these areas, the managerial and fund-raising capacity of their parent bodies, and by extension their School Governing Bodies (SGBs), will probably be far more limited than would otherwise be the case. Furthermore, if more advantaged children are attending schools relatively far afield, their parents, who might otherwise be eager to play an active role in school governance, will experience greater logistical barriers to engagement in those schools. Additionally, in light of current school fee policies, which require schools to grant fee exemptions to disadvantaged learners, the extent of mobility is likely to have implications for the revenue streams of both more advantaged schools enrolling relatively disadvantaged children from further afield, and more disadvantaged schools enrolling fewer of the local children with any ability to pay fees. Information on the extent to which school choice is currently practiced, and by whom, will also allow for data-driven reflection on the appropriateness of those very policies which currently serve to shape and constrain choice. At the level of learner outcomes, learner mobility has potential implications for academic performance, drop-out and repetition. Acting directly on learners, mobility can be hypothesised to have either positive implications, due to access to enhanced educational opportunities, or negative implications, due to travel time, sub-optimal resource allocation, and cultural differences. Mobility may also operate on learners indirectly, through its impact on the school environment. A question of particular interest in this regard is the extent to 17 which more advantaged children do actually leave their neighbourhood schools, and what implications this has for the children remaining in those schools. While these questions are not answered explicitly in this thesis, the information generated about mobility will allow for the development of clearer, data-driven hypotheses regarding the likely implications of learner mobility for both individual children, and for South Africa‘s schooling system more broadly. 1.6 Outline of the thesis Chapter 1 has provided an overview of the topic of this thesis, along with the specific research questions to be addressed, and preliminary hypotheses regarding anticipated findings. In addition, it has provided a brief rationale for the research project presented in this thesis. In Chapter 2, the international and South African literature relating to both school choice and learner mobility is reviewed. A particular focus is placed on the literature documenting the relationship between school choice policies and practices, and educational equity outcomes. The argument is made that the study of learner mobility in South Africa can provide valuable information about the ways in which school choice may impact on educational opportunities, specifically in the context of a developing country in which choice is largely unregulated. Chapter 3 provides an overview of the methodological approach taken in the study. Chapter 4 explores issues around sample representativity, and provides descriptive statistics for the sample. Chapter 5 uses three different approaches to measure learner mobility, and presents data on the extent of this mobility. In Chapter 6 a range of bivariate analyses are conducted to explore the relationships between mobility behaviours, and variables at the levels of 18 individual children, their families and households, and the communities in which they live. Chapter 7 extends these analyses to document the relationships between mobility behaviours and the attributes of the schools which children attend, as well as the schools closest to their homes. Chapter 8 documents the ways in which mobility behaviours change over time, as children age. Chapter 9 combines variables considered in the previous three chapters to generate preliminary models for each of the measures of learner mobility that have been discussed. Distance between home and school is modelled using OLS regression, while travel between areas, and attendance at the nearest school are modelled using logistic regression. Finally, chapter 10 provides a brief overview and discussion of the findings of the study, and concludes the thesis. 1.7 Conclusion This chapter began by providing some brief insight into the context of school choice and learner mobility in contemporary South Africa. It then defined learner mobility, and provided three different measures that will be used to document its extent in post-Apartheid Johannesburg-Soweto. Firstly, the straight-line distance between home and school will be measured. Secondly, Census 2001 district data will be used to determine whether or not a child lives and attends school in the same geographical area. Thirdly, whether or not a child attends the grade-appropriate school nearest to his or her home will be documented. The chapter then moved on to document the core research questions to be addressed in this thesis. These relate to measuring the extent of learner mobility in contemporary urban South Africa, and to identifying the correlates of engagement in learner mobility at the level of the child, the household, the community, and the school. This was followed by a brief description of the methodological approach taken in the chapter, and a rationale for the selection of the topic on both theoretical and practical, policy- 19 relevant grounds. The chapter then concluded by providing an outline and overview of the chapters to follow. 20 Chapter 2: Literature review 2.1 Introduction This chapter serves to place the key questions addressed by this thesis in the context of the relevant academic literature, and to outline the methodological and theoretical approaches used in addressing those questions. As learner mobility is a particular form of school choice, this literature comprises the international literature on school choice, and more specifically that part of this literature which explores the relationships between choice, quality, and equality. Due to learner mobility‘s potential implications for South African educational equality, the literature on South African inequality and segregation in the post-Apartheid era is also relevant. The first section of this chapter outlines various forms of school choice found in internationally, and argues that three key dimensions of choice policies are the extent to which they are officially legislated, the extent to which they include provisions to protect vulnerable groups from potential harm, and the extent to which they include provisions to allow members of vulnerable groups to actively engage in choice. The second section of the chapter outlines both the policy and practice relating to school choice in post-Apartheid South Africa, and makes the argument that the emergence of learner mobility in South Africa can best be understood as the outcome of an unplanned, and largely unregulated, school choice system, with very few protections for vulnerable groups. Having described and situated South African learner mobility within the context of school choice more broadly, the review then moves on to explore two key debates in the international school choice literature: firstly, the debate about whether or not school choice improves performance in educational systems, and secondly the debate about whether school choice increases inequality and segregation in educational systems. Following this, the literature on educational equity and desegregation in South Africa is reviewed, with a 21 particular focus on the very limited body of work which relates these issues to school choice. This illustrates that by providing data on the relationship between choice, mobility and educational equality, this thesis will make a useful contribution to the scholarly literature both locally and internationally. The next section of the chapter places this study‘s methodological approach within the context of the existing scholarly literature, and illustrates the methodological contributions this thesis makes. In the final section, the conceptual framework used as the foundation of the analyses presented in this thesis is described, with reference to how it has been shaped by the literature, both South African and international. 2.2 Major forms of school choice policy Although there is a broad and varied literature on school choice, it is a highly ideological literature, with much heated debate but fairly limited empirical data. Much of the available work has been commissioned by interested parties, or is driven by individuals with strong ideological positions (for a discussion of this phenomenon with respect to the UK school choice literature, see Gorard and Fitz (2006)). Additionally, the large majority of existing literature, particularly empirical literature, relates to school choice as implemented in the developed world, while work on school choice in the developing world is far sparser. As a result, this overview of the major forms of school choice policy draws primarily on examples from the developed world. School choice is generally understood to be occurring when families are able to make a decision about which school a child will be enrolled in. Globally, public schooling systems range from those where there is almost complete choice (such as New Zealand and the Netherlands), to those where children are required to attend particular schools, usually on the basis of residential location (such as Cuba, France and Japan). Most systems are located somewhere 22 between these extremes, with a global trend towards increasing levels of choice (Goldhaber and Eide 2002; Plank and Sykes 2003). There are some forms of school choice which are virtually impossible to control. The most significant of these is when people select their residences on the basis of access to particular schools (Holmes 2002). Another significant expression occurs when people leave public schooling systems to attend private schools, although this is obviously limited to those systems in which private schooling is accepted (Rinne, Kivirauma et al. 2002). The extent to which people can exercise these types of choice depends on their location, their flexibility in terms of location, and, critically, their wealth (Hoxby 1998). Importantly, these forms of choice interact with various different legislated systems of choice. Forms of choice provided by legislated systems of choice may include provisions for choice between a number of local schools, magnet school programmes which are accessible to any student on the basis of a lottery or academic performance, establishment of charter schools or multiple small schools in particular areas to provide alternatives to traditional schools, or simply unfettered enrolment at any school, regardless of location. Each form of choice will evidently have different implications in terms of equality of access, and consequences for particular population groups (Teske and Schneider 2001; Hoxby 2002). While some school choice legislation simply provides a legal right to a certain amount of choice between different public schools, others are designed to counter inequalities associated with school choice, particularly when expressed through access to private schools. Most widespread here are various voucher systems, common in the US, but also emerging in other countries such as Chile, which pay a varying proportion of private school fees for less advantaged learners (Goldhaber and Eide 2002; Elacqua, Schneider et al. 23 2006). In addition, provisions to protect particular groups may also be included in any type of school choice policy. They might, for example, require that schools enrol a certain proportion of students from particular ethnic or economic backgrounds, or might provide free transportation for certain students to the school of their choice (Holmes 2002). In summary, the key variations in school choice policies, shaping which people, and how many of them, can engage in school choice are the extent to which these policies: are planned or legislated, as opposed to unplanned or unofficial; include provisions to protect vulnerable groups from potential harm; and include provisions to ensure that vulnerable groups have equitable access to choice. I now explore school choice policy in South Africa, with particular attention to its properties in these regards. 2.3 School choice in South Africa 2.3.1 The context of school choice in South Africa Although the school choice literature on South Africa is quite limited, particularly with regards to empirical work, there is some strong theoretical literature documenting the ways in which existing policies shape school choice in the country. School choice in South Africa is regulated primarily through the National Education Policy Act (NEPA), the South Africa Schools Act (SASA), and the Employment of Educators Act (EEA) (Pampallis 2003; Maile 2004; Woolman and Fleisch 2006), although the more recent classification of a number of schools as no-fee schools also seems likely to be important (Ahmed and Sayed 2009). 24 While at first glance South African policy appears to constrain learners to attend neighbourhood schools on the basis of their residence, it combines with national history to provide both motivation and means for parents to choose their children‘s schools. Huge variations in both empirical and perceived school quality mean that many parents are extremely motivated to ensure that their children attend specific schools. Policies around school financing and the provision of teaching staff mean that schools are motivated to enrol as many children as possible. As public funding is limited, fee-charging schools are particularly eager to enrol large numbers of children who are able to pay fees. Decentralisation of managerial functions to school governing body (SGB) level means that schools do have some control over the design and enforcement of their admissions policies, and by extension over which learners they enrol, although this control is often de facto rather than de jure, and is subject to some legal constraints. In addition, school choice through residential selection continues to operate, as many advantaged schools prioritize the enrolment of local children. The growing independent schooling sector also provides parents with further choice. As a result, the opportunities for school choice in South Africa are substantial, but come at a fairly marked financial cost to parents. There is widespread evidence that significant numbers of parents are none-the- less exercising this choice (Sekete, Shilubane et al. 2001; Maile 2004; Lemon 2005; Nelson Mandela Children's Fund 2005; Johnson 2007; Lam, Ardington et al. 2008). Although the policies mentioned above have played a central role in shaping the way that school choice has developed in South Africa, it should be noted that, perhaps with the exception of legislation around independent (private) schooling, this role was generally not intended (Woolman and Fleisch 2006) – instead, school choice was a largely accidental outcome of policies developed for other reasons2. The system of school choice in South Africa can therefore 2 Although the fact that there have been no major efforts to reduce levels of choice in the system does suggest that the existence of choice appears to suit the political and social elite. 25 be understood as one which is generally, although not entirely, unplanned and unofficial. Due to the substantial freedom of choice that seems to exist, as well as the considerable inequality in both South Africa‘s schooling system and income distribution, the question of whether there are any protections in place for vulnerable groups in the context of large-scale school choice is clearly important. As the major determinant of the ability to exercise choice seems to be the ability to pay higher fees and pay for additional transportation, it is probable that the ability to exercise choice is strongly linked to socio-economic status. As a result, the major group at risk of an inability to engage in choice, or even at risk of harm by being left in the most poorly performing schoos, are those of lower socio-economic status (Pampallis 2003; Fiske and Ladd 2004). Potential protective policies might therefore include additional support for schools which primarily attract children from disadvantaged contexts, or the provision of incentives to advantaged schools to enrol less advantaged learners. However, given that there are no explicit government policies on school choice, there are also none of these types of provisions to protect or support vulnerable populations. Similarly, with regards to providing vulnerable groups with access to school choice, for example by paying their school fees or providing free transportation, there are also no policies in place3. This means that South Africa‘s school choice policy could be considered as one that is unplanned, unofficial, and unregulated, with few protective measures, while simultaneously allowing quite extensive levels of choice to certain sectors of the population. 3 Although a school fee exemption policy exists, which exempts disadvantaged learners from the obligation to pay fees, this only applies once children have been granted admission to the school in question. Additionally, implementation is generally acknowledged to be poor. Veriava, F. (2005). Free to learn: A discussion paper on the School Fee Exemption policy. Cape Town, South Africa, Children's Institute, University of Cape Town. 26 2.3.2 The practice of school choice in South Africa Due to the limited literature on South African school choice, the description of the practice of school choice in contemporary South Africa presented in this section draws primarily on the theoretical literature, although empirical studies related to school choice are cited where they exist. Currently, school choice in South Africa appears to take four major forms: residential, private, intra- area, and inter-area, where intra- and inter-area choice corresponds to learner mobility as defined in this thesis. Residential school choice occurs when parents select homes on the basis of their proximity to particular schools. Exercising residential school choice generally requires a relatively high level of income and parental education. Due to the geographic distribution of good schools in South Africa, with most good schools located in affluent, historically white areas with high property prices, the constraints on parental ability to exercise residential choice are likely to be particularly high. Due to the private nature of residential location decisions, this type of school choice is also extremely difficult to measure. Private school choice occurs when parents decide to exit the public schooling system altogether, instead sending their child to an independent (private) school. In South Africa, although growing, the independent schooling sector remains relatively small, accommodating just over 3 percent of learners (du Toit 2003; Hofmeyr and Lee 2004; Centre for Development and Enterprise 2010). While increasing numbers of independent schools offer relatively low fees, and access appears to have expanded greatly in recent years, these schools still serve only relatively small numbers of children (Centre for Development and Enterprise 2010). Most high quality independent schools charge high fees, and often select learners on the basis of academic capability. While the sector is increasingly diverse, racially and socio-economically (du Toit 2003; Hofmeyr and Lee 2004), choosing an independent school is still not an option for the large majority of less-advantaged parents, due to the small 27 size of the sector. In addition, finding a space for a disadvantaged child in an independent school is likely to require fairly substantial knowledge and effort on the part of a parent, again making it less of an option for most disadvantaged families. Intra-area choice occurs when parents are able to choose between a number of schools within their residential area, and make enrolment decisions themselves, on the basis of any particular set of factors. This type of choice is very difficult to measure, as there is no easy way to distinguish between learners who are simply attending the school most accessible to their home, and those who choosing to attend a particular school among those closest to their home for other reasons. In addition, because most residential areas in South Africa remain fairly homogenous, the extent to which this type of migration is likely to matter to socio-economic segregation may be relatively limited. Nonetheless, there is evidence from a small number of studies that parents do distinguish between local schools, and that even within disadvantaged areas, schools with better reputations tend to charge slightly higher fees and attract slightly more advantaged learners (Fiske and Ladd 2004; Msila 2009). Finally, inter-area choice occurs when parents choose a school outside of the area of their residence. This form of choice appears to be fairly wide-spread in South Africa, with large numbers of learners in various contexts reporting that they attend school relatively far from home (Sekete, Shilubane et al. 2001; Cosser and du Toit 2002; Nelson Mandela Children's Fund 2005). In some cases, particularly in rural areas, this travel may be due to children not having schools close to their homes, rather than choice, but in urban areas this is typically not a concern. In practice, the line between intra- and inter-area school choice is quite fuzzy, particularly in the South African context where, unlike in most developed countries, there are no consistently defined school catchment areas, and the 28 geography of school districts rarely meshes with that of residential areas. Nonetheless, because inter-area choice generally requires parents to access some additional information, as well as fund additional travel and potentially higher fees, it seems probable that parents accessing inter-area choice are likely to be somewhat more advantaged than those only able to access intra- area choice. On the other hand, inter-area choice is also unlikely to be used by the most advantaged families, as they tend to already be living in the areas with the strongest schools. This description of school choice practices in contemporary South Africa makes clear that the expression of school choice in the country is extremely complex and multi-layered. In some cases, multiple forms of choice may co- occur, for example with learners travelling substantial distances to attend independent schools, or parents sending a child to a local primary school, and a distant high school. As this example also illustrates, multiple forms of choice may be evident at different times during a single child‘s education. Each form of school choice is governed by different constraints, particularly with respect to the socio-economic attributes of those who are able to exercise them. 2.4 Debates in the international school choice literature Two major topics of debate are evident in the international literature around school choice. The first is the relationship between choice and educational quality, and the second is the relationship between choice and educational equality. Although historically these debates have been largely theoretical, studies drawing on data are increasing in number, and these two debates will be discussed with particular reference to this type of empirical work. Nonetheless, the highly ideologically driven nature of much of the literature, and the polarized nature of these debates, does make a clear and unbiased interpretation of this literature challenging. I will explore the debates around equality in some depth, as this is the area to which this thesis contributes most 29 directly, but will first touch briefly on the discussions around the relationship between school choice and educational quality. 2.4.1 School choice and quality Proponents of school choice argue strongly that choice creates markets, or at the least, quasi-markets, in the educational arena, resulting in competition between schools for students and resources, and by extension, for better academic performance (Coleman 1992; Hoxby 2002; Hoxby 2003). Critics, by contrast, argue that the market in public education is too imperfect to result in the type of competition which would improve performance overall. Instead, they argue, choice will lead to a growing divide between well-resourced schools attracting good students, and poorly resourced schools attracting the weakest learners (Levin 1991; Astin 1992). These students and their schools are often termed ―those left behind‖ in the school choice literature, and studies on how school choice policies affect them are particularly inconclusive (Teske and Schneider 2001). To date, research findings on the quality implications of school choice, both broadly and for specific populations, remain mixed. Much of the variation evident in the quality outcomes of school choice policies relates to differences in policy design, implementation, and evaluation methodologies (Henig 1994; Goldhaber 2000; Teske and Schneider 2001; Goldhaber and Eide 2002). Despite some contemporary claims of consensus around the notion that choice improves quality, an increase in the implementation of choice programmes, and a dramatic falloff in the quantity of research on the topic, there in fact remains very little agreement on this issue (Lubienski, Weitzel et al. 2009). 2.4.2 School choice and equality The second major area of debate, closely related to the discussions around quality, is around the implications of school choice for educational equality. School choice policies have potential to impact equality of opportunity, 30 through shaping access to particular schools, equality in school performance, equality for people from different backgrounds by way of approaches to diversity, and finally, equality with regards to whether and how parents may exercise choice in the educational realm (Godwin, Kemerer et al. 1998). While all of these aspects of equality are important, particularly in the context of rapid societal change, equality of opportunity is the most salient in contemporary South Africa, and it is here that this review is focussed. Levin (1991) argues that while school choice may provide private benefits to individuals in the form of access to better schools, this is likely to be accompanied by social harm. Equality of educational opportunity is closely related to student sorting, which has long been recognized as an outcome of any form of school choice (Hoxby 2003). Sorting along racial and socio- economic lines, which may lead to racial or economic segregation, has been a particular focus of the literature. Segregation between schools is a well- documented phenomenon both in South Africa (Chisholm 2004) and internationally, across educational systems with a range of policies towards choice (Coleman 1992). Critics of school choice argue that choice tends to increase segregation through two key mechanisms. Firstly, the ability and willingness to take advantage of school choice varies along with demographic variables; and secondly, the bases on which choices are made also vary with demographic variables (Holmes 2002; Ladd 2003; Denessen, Driessena et al. 2005; Elacqua, Schneider et al. 2006). By contrast, proponents of school choice tend to argue that choice has a positive effect on the equality of educational opportunity, decreasing segregation along lines of race and class by allowing disadvantaged students to escape from badly-performing neighbourhood schools. They acknowledge that segregation along the lines of performance may increase, but argue that this will simply increase incentives for competition between schools, improving outcomes across the system (Hoxby 2003). While some segregation is inevitable, systems must strive to avoid segregation on the basis of race or class, and use school choice as a tool 31 to this end (Coleman 1992). Critics counter that given the clear evidence of a close connection between class, race and academic performance, even segregation purely on the basis of performance will increase segregation on the basis of class and race (Astin 1992). To date, the evidence as to whether, and how, school choice might influence segregation levels in schools remains mixed (Goldhaber and Eide 2002; Viteritti 2005; Godwin, Leland et al. 2006). Given that the outcomes of school choice depend very strongly on both the context of implementation, as well as the design of the policy, this is hardly surprising (Henig 1994; Hoxby 2003). Different research methods have also played a role in the variable outcomes of research into school choice and segregation. For example, in the area of parental decision-making, the bulk of research has been conducted through surveys and interviews. Studies of actual enrolment patterns, however, have demonstrated that how parents claim to make enrolment decisions does not always correspond with how they actually make enrolment decisions (Elacqua, Schneider et al. 2006). Similarly, parental answers to questionnaires may be markedly different to their answers during in-depth interviews (Bagley 1996). Data constraints are another problem. For example, many studies use data only from a single point in time, which makes it impossible for them to demonstrate a relationship between particular polices, and changes in segregation levels (Gorard and Fitz 2000). Finally, the deep-seated beliefs around social justice or free market competition held by many involved in this debate has meant that research has not always been entirely objective, making it still more difficult to unravel the genuine implications of school choice policies for inequalities and segregation (Gorard and Fitz 2006). Nonetheless, it is worth presenting a brief overview of what has been found internationally, to provide an understanding of the current state of knowledge on the issue. In particular, these studies provide insight into how various types of school choice interact with various types of school system structure, to 32 shape outcomes related to segregation. Particularly in the absence of much empirical work on school choice in South Africa, exploring these international variations is critical to understanding how South Africa‘s unplanned and unregulated choice system is likely to play out in the context of its already extremely segregated and highly varied schooling system. Empirical investigations of the implications of choice policies for segregation While perhaps least relevant to South Africa, the evidence from small and fairly homogenous countries, or those with a single, national policy on choice, tends to be the most straightforward. This is particularly so when, as is the case in many European countries, education policy and curricula are nationally defined, and private education is minimal. For example, in 2000 in Sweden, school admissions decisions nationwide were switched from the basis of residential location to academic performance. This resulted in a growth of differentiation between schools in terms of achievement, but also, significantly, substantial growth in segregation on the basis of race, SES and immigration status (Daun 2003; Soderstrom and Uusitalo 2005). One of the key reasons for these types of outcomes appears to be that parents often take the racial, religious or socio-economic composition of a school‘s student body into account when making school choices for their children – although they are very rarely willing to admit this. School segregation in the Netherlands, for example, is clearly related to parental decision making (Karsten, Felix et al. 2006). It is not only advantaged parents who take ethnicity and class into account in decision making about schools, however. In the Netherlands, decision making by members of the typically disadvantaged Muslim minority appears to play an important role in shaping segregation (Denessen, Driessena et al. 2005). Likewise, in Germany, the provision of school choice at the primary level has been associated with ethnic segregation 33 due to the different patterns of decision making around schooling exhibited by both German and Turkish families (Kristen 2005). Even in these comparatively simple contexts interpretations of the data can vary substantially. Scholars concur that the introduction of school choice policies in New Zealand between 1989 and 1993 has had highly variable results for different schools. But while some argue for a clear underlying theme of increasing segregation along racial, and to a lesser extent, socio-economic, lines between schools (Waslander and Thrupp 1995; Fiske and Ladd 2000), others argue that this is not the case, and that segregation has actually decreased markedly (Gorard and Fitz 2006). The UK‘s schooling system lies somewhere between most European systems and the American system, both in terms of the centralization of educational decision making and planning, and in terms of the level of choice that has traditionally been available to parents, and therefore in the complexity of the analysis of school choice outcomes. In some ways, more so in terms of policy than resource levels or history, the schooling system in the UK resembles the South African system quite closely. In particular, they both combine extremely local school management and decision making with extremely centralized curriculum planning and policy making. The debate on the implications of changes to school choice policy in the UK is particularly heated (Gorard and Fitz 2006). Some evidence of a national, short-term decrease in socio- economic segregation in response to increased school choice has been presented (Gorard and Fitz 2000; Gorard, Fitz et al. 2001; Bradley and Taylor 2002), although the implications for racial segregation as well as long-term effects are less clear (Bagley 1996; Noden 2000). Other scholars argue, however, that even the evidence for a short-term decrease in socio-economic segregation is not clear, and that segregation has actually increased with the introduction of greater parental choice. In addition, regional variations in the effects of school choice policy appear to have been very high, and aggregate 34 changes in segregation at the national level may in fact have very little to do with choice policy (Gorard and Fitz 2000; Noden 2000). Evidence from the UK also suggests that parents take race and class into account when choosing schools. For example, white parents have been found to avoid schools with large numbers of non-white children, although the evidence for this claim is largely qualitative (Bagley 1996). The large majority of available research evidence on the implications of school choice comes from the US. While this evidence is particularly mixed, which is to be expected given the country‘s wide range of choice policies as well as its substantial demographic variations, it is also particularly rich. The first form of explicit choice made available in the US was through magnet school programmes. Originally designed as a tool to combat racial segregation, there is some evidence of their working effectively in this regard. However, there is also evidence that in a number of cases magnet schools have not decreased segregation; in some cases they may have increased it, or added a new dimension, such as SES, to existing racial segregation (Henig 1994; Goldring and Hausman 1999; Saporito 2003). Another type of choice programme designed to combat segregation, though in this case with a greater focus on socio-economic segregation, are voucher programmes. While voucher programmes were implemented starting in the 1970s, the data on their implications for segregation is extremely limited. It is clear that there are socio-economic and demographic differences between those parents who do and do not participate in voucher programmes, but what these differences mean for segregation levels is not clear. While desegregation in the private schools receiving voucher-bearing students can be expected, there are concerns about those schools ‗left behind‘, and the students they educate (Bridge and Blackman 1978; Capell 1981; Henig 1994; Witte and Thorn 1996; Levin 1998; Goldhaber 1999; Hoxby 2003; Peterson, Howell et al. 2003). With the 2002 Supreme Court decision that voucher programmes including religious 35 schools are constitutional, and the subsequent growth in voucher programmes, it seems likely that clearer information will gradually become available. Currently, a particularly widespread form of choice in the US is various charter school programmes, which provide parents with a way to opt out of local public schools. Again, those who choose to participate in this choice programme differ from the general population. Participating parents tend to be better educated and somewhat better off. Race is generally found to be a strong determinant of which charter school a student will choose to attend, and there is some evidence that charter schools have had the most appeal to black parents, meaning that charter schools do in fact tend to be fairly highly segregated (Weiher and Tedin 2002; Bifulco and Ladd 2006; Garcia 2008). A final set of choice programmes has been those operating at the intra-district level, allowing parents some choice between the different schools within a particular school district. However, this type of policy can only be effectively implemented in fairly densely populated, urban, areas, and again, outcomes have been mixed, with reports of both increased and decreased segregation (Henig 1994; Godwin, Leland et al. 2006). Across all forms of school choice, there is clear evidence that American parents do tend to take concerns about racial and socio-economic composition of schools into account, even if they do not admit this explicitly (Schneider, Marschall et al. 1998; Holmes 2002; Schneider and Buckley 2002; Saporito 2003). Furthermore, even when parents from different backgrounds vary in their school-choice preferences in terms of variables other than race or socio- economic composition, the end effect may remain one of segregation or sorting (Schneider, Marschall et al. 1998). While the US literature is both substantial and diverse, a few clear themes stand out. The first is that in shaping outcomes, details are important, both in policy design and in the context of implementation (Henig 1994; Hoxby 2003; 36 Greene, Loveless et al. 2010). The second is that even when choice policies are designed to attain a particular outcome such as desegregation, their actual effects are difficult to predict. And thirdly, we do not yet have any definitive answers to questions about what choice means for educational equality. Studies providing empirical data on school choice in developing countries are relatively few and far between. In terms of racial and socio-economic inequalities, resource levels in the educational system more broadly, and capacity to implement policy, studies from these countries are likely to be particularly relevant to South Africa. The structure of the schooling systems in most of these countries, which generally rely very heavily on private education, does however differ quite substantially from that found in South Africa. One of the most well-documented cases of developing world school choice is the national voucher plan implemented in Chile in 1980, which spurred a rapid growth in private sector educational provision (Carnoy and McEwan 2003). There is clear evidence that those parents making use of the voucher programme to send their children to the private religious schools which produce the best educational outcomes, tend to be more advantaged, both educationally and economically, than those who do not. In addition, Chilean parents also take the socio-economic composition of a school‘s student body into account when making school choices, and more advantaged parents are particularly likely to enrol their children in schools with other advantaged children, prioritizing socio-economic composition over academic performance (Carnoy and McEwan 2003; Elacqua, Schneider et al. 2006). However, the extent to which the voucher programme is actually responsible for the high levels of socio-economic segregation in Chilean schools remains debated (Narodowski and Nores 2002). 37 In China, school choice is beginning to emerge, after a long period of entirely state-controlled schooling, but remains limited primarily to the more advantaged members of society (Tsang 2003). Evidence of increasingly choice-oriented schooling systems in post-Soviet countries is also beginning to emerge, but information on their likely implications for segregation is not yet available (Filer and Munich 2003). While this review presents a very mixed picture, a few points are clear. Firstly, there is potential for school choice policies to influence racial and socio- economic segregation, in a range of different contexts, and in a number of different ways. Secondly, the exact nature of this influence is highly dependent on the specificities of the context of implementation, as well as on the details of policy design. This review therefore supports the contention that examining the implications of school choice in South Africa for racial and socio-economic segregation is likely to provide useful information, both for those involved in managing and improving the nation‘s education system, as well as for those interested in understanding more clearly the interactions between context, policy, choice and segregation. 2.5 School choice and equality in South Africa Although the literature on school choice in South Africa is limited, there is a fairly well developed literature which speaks more broadly to the inequalities inherent in the country‘s educational system. This literature forms the core of the review presented below, although work with an explicit focus on choice is referred to whenever appropriate. In contemporary South Africa, numerous factors shape the access of individuals to high quality educational opportunities (Soudien, Carrim et al. 2004). For many reasons, race is the most salient of these factors, and has received a great deal of academic attention. There is now clear evidence that historically advantaged South African schools are becoming increasingly heterogeneous, although to varying degrees, with 38 regards to race and language (Sekete, Shilubane et al. 2001; Maile 2004; Sujee 2004; Lemon 2005; Johnson 2007). While concerns remain about the inclusivity of this integration, and very few schools reflect the race distribution of the country or its regions, there has certainly been a marked change since 1994. However, while these changes in the racial composition of schools have been well-documented, much less work has looked at school socio-economic composition, either statically or over time. The information that is available tends to suggest that substantial differences exist across schools in terms of their socio-economic composition (Maile 2004; Chisholm 2005; Lemon 2005; Reschovsky 2006; Motala 2009; Hunter 2010). Given increasing levels of socio-economic inequality in South African society more broadly, it seems likely that socio-economic segregation at the school level has the potential to increase relative to racial segregation. A major concern about the literature on school choice and equality in South Africa is closely related to this point. While the importance of race should not be understated, its high salience has tended to obscure, to some extent at least, other dimensions of inclusion/exclusion and discrimination. Particularly important in the context of school choice is SES. Given the geographically unequal distribution of good schools in South Africa, the huge variations in public school cost, and the strong relationship between school cost and quality, SES is likely to be strongly related to the quality of education a learner can access. While SES and race remain inextricably linked in South Africa, the primary pathways through which limitations on school choice operate now appear to have shifted away from race, and towards socio-economic status. This makes the paucity of the literature examining the relationship between school choice and socio-economic status particularly interesting, and alarming. Understanding whether school choice is genuinely linked to SES, and how any related negative implications of choice can be mitigated, requires urgent study. 39 This question is particularly critical to those historically disadvantaged communities where populations are racially fairly homogenous, but increasing socio-economic diversity is becoming evident. In part, this is because of the risk that certain schools and learners will be ‗left behind‘ by school choice. While the current system of largely unregulated choice may enable well- performing schools to attract more resources and further improve their performance, it may occur at the cost of schools which are struggling due to historically low resource levels, and learners who do not have the resources to exercise their genuine preferences for schools (Motala 2009). This division of schools and learners is of even more concern in light of the extremely high levels of socio-economic inequality in South Africa. This thesis makes an important contribution to the South African literature on school choice firstly by documenting how wide spread school choice is in the Johannesburg-Soweto metropolitan area. Additionally, it fills theoretical and empirical gaps by examining a range of potential contributors to school choice, including both race and SES. Finally, by examining SES explicitly, and including both advantaged and disadvantaged individuals, it will shed light on how broadly school choice is available as an option for members of all racial and socioeconomic groups. In the following section, the methodological contributions of thesis are discussed with reference to the existing body of work. 2.6 Methodological approach As already mentioned, empirical work on school choice in South Africa is extremely limited. With a few exceptions, those empirical studies which do touch on either school choice or on educational inequality tend to belong to one of two extremes. Either, they focus on a small sample, providing deep, rich data, usually but not always qualitative (Soudien 2003; Msila 2005; Msila 2009; Bray, Gooskens et al. 2010; Hunter 2010), or they are quantitative and 40 extremely broad, providing only highly aggregated statistics (van der Berg, Wood et al. 2002; Sujee 2004). Each of these types of study clearly has great value, particularly when knowledge about a phenomenon is limited. However, as knowledge increases, particularly in a country as diverse as South Africa, it becomes important that a more rounded literature develops and provides information at an intermediate level of aggregation, allowing data to become increasingly contextualized. Very little is currently available at this level, and even when it is, it tends to focus on either school choice or on educational inequalities (Sekete, Shilubane et al. 2001). To my knowledge, no studies exist which combine the examination of choice and educational inequality at an intermediate level. This is important gap, as it prevents us from understanding, for example, potentially substantial variations between rural, urban and peri- urban areas, or between different parts of the country at substantially different levels of development. In light of this gap, there is considerable scope for an empirical analysis of the relationship between school choice and socio-economic status, and particularly one which makes use of a relatively large sample from a clearly specified context. The two core questions addressed by this thesis, relating to the scope and the correlates of learner mobility in Johannesburg-Soweto, South Africa, seek to fill this gap. Although these questions are fundamentally empirical in nature, addressing them requires that this thesis also tackles the methodological and theoretical gaps evident in the literature, and proposes some novel solutions. In addition, as will have become clear from the overview of international literature, this thesis will also enrich the international debate around school choice, and provide particularly valuable information for the discussions about how school choice, inequality and segregation interact. This study offers three further methodological contributions. Firstly, most existing empirical work around learner mobility, whether quantitative or qualitative in nature, comes from school-based studies (Fiske et al., 2004; 41 Sekete et al., 2001). While the school-based approach has many advantages, particularly with regards to understanding how levels of mobility impact school performance and functioning, it does also have drawbacks, most notably with respect to understanding population-based levels and patterns of mobility. This is partly because mobility appears to be highly clustered around specific schools. Once a school first begins to enrol learners from outside the local community, particularly when those learners are from a different race group, members of the local community tend to begin to avoid that particular school (Fiske et al., 2004). Focusing on particular schools, rather than on particular populations or communities, may therefore provide either inflated or deflated data, depending on whether the school is one that caters primarily to mobile learners or not. In order to overcome some of these difficulties, this thesis makes use of population level data. As far as I am able to determine, it is the first study of learner mobility in South Africa to do so. Unfortunately, the type of national data necessary to generate this understanding is not available for South Africa. The best available alternative is to draw on large-scale datasets that sample populations in particular parts of the country, and use this information to draw conclusions at an intermediate level. This can in turn guide future data collection and analysis, toward a more complete understanding of the phenomenon. In addition, the use of data at an intermediate level has advantages of its own, such as allowing for control for geographic and other associated variation. The reasons for the selection of the Birth to Twenty dataset are documented in Chapter 3. Using this type of data, however, provides an additional advantage, which is that changes in mobility can be explored over time. Again, this thesis is the first study of which I am aware which looks at mobility behaviour at more than one time point. A second methodological innovation is that the project makes use of data from a number of different sources, in order to overcome the limitations imposed by 42 each individual data source. This allows for the simultaneous examination of school-level, child and family-level, and community-level determinants of mobility. Again this is the first empirical South African study of which I am aware that is able to examine this full range of potential determinates of mobility. Thirdly, this thesis takes a completely novel approach to the measurement of learner mobility. Whereas previous studies have used only one approach to measuring mobility, typically either travel distance or travel time, this study uses three different measures of mobility, each capturing a different aspect of the phenomenon. These are straight-line distance between home and school, whether a child attends a school in the same area in which he or she lives, and whether a child attends the grade-appropriate school nearest to his or her home. This is critical in that it allows for the unpacking of different forms of choice and mobility, as well as their varying determinants, and raises questions as to whether learner mobility in South Africa is a unitary concept. These measures are documented more fully in Chapter 3. 2.7 Conceptual framework Although the international literature on school choice is fairly large, there is a very limited body of work which actually explores, empirically, the determinants of the choices made by individual learners and their families (Bosetti 2004). Where empirical data does exist, it is often focused on choice of a particular type of school, for example independent or religious, as opposed to the choice of a particular school (Le and Miller 2003; Elacqua 2006). While a range of variables have fairly consistently been found to be important – particularly those relating to social class, as discussed above – there is no clear model which is systematically used to predict engagement in choice. As a result, I draw on work in other areas to develop the conceptual framework used in this thesis. In particular, the framework draws on work on decision-making 43 around mobility more broadly (De Jong 2000), and the literature on the process by which students choose the higher education institutions they apply to (Hanson and Litten 1982). The framework that I have derived, presented in Figure 2.1 below, therefore also serves as a contribution to the theoretical literature around school choice in South Africa. The framework conceives of school choice as always occurring within a particular historical, geographical and policy context. This is guided by the evidence presented in the literature review above that context is critical in determining the ways in which school choice play out. Examples of relevant aspects of historical context in contemporary South Africa include the enormous variability of school quality, and the ways in which access to resources, including education, are distributed across the population, on the basis of both race and class (Fiske and Ladd 2004; Msila 2005; Msila 2009; Spaull 2011; van der Berg, Burger et al. 2011). Geographical context includes the variable nature of different residential areas, and the ways in which these are physically located, as well as the geographical distribution of schools of differing levels of quality (Hunter 2010; van der Berg, Burger et al. 2011). Finally, examples of the ways in which policy shape school choice include the high levels of variation in the cost of attending different public schools, the ways in which policy allows and constrains school choice, and the extent to which families have access to reliable information about particular schools (Fiske and Ladd 2004; Woolman and Fleisch 2006). 44 Figure 2.1: Conceptual framework, based on De Jong (2000) and Hanson and Litten (1982) In this framework, variables at the level of the individual child, the family, and the community in which the family lives, all interact in shaping each other, and simultaneously all feed into the decision making process (De Jong 2000). In this thesis, a range of variables at each of these levels are tested for their relationship with mobility. The selection of variables for testing is guided by the international and local literatures on school choice where available. Potentially important child-level variables include gender, race, school performance and academic aptitude, psychological adjustment, and child location within the family (for example, number and relative ages and genders Decision making process - Desired educational outcomes - Investment required for desired outcomes - Investment constraints Residential location Child Household & Family Maternal Historical, geographical and policy context School selection - School quality - Distance -‘Localness’ 45 of siblings). In this thesis, due to data limitations, race, gender, and a few indicators of academic aptitude are examined. Race is suggested for inclusion by the wide range of international literature suggesting that it is a strong predictor of school choice behaviours (Glazerman 1998; Fiske and Ladd 2004; Fiske and Ladd 2005; Karsten, Felix et al. 2006; West and Hind 2007). Gender is included due to the literature documenting differential parental investment in children‘s education on the basis of their gender (Alderman and King 1998; Klasen 2002; Unterhalter 2005), although concerns about this in the contemporary South African context are fairly limited. Academic aptitude is also included as there is some reason to believe that it may also shape parental willingness to invest in education, or contribute to school choice in other ways (Glazerman 1998; Zietz and Joshi 2005; West and Hind 2007). At the household level, variables such as education levels of members of the household, the structural stability of the household, the wealth of the household, and the household‘s residential stability are considered. Additionally, maternal attributes are likely to be of particular importance to schooling decisions. Relevant variables, very similar to household level variables include the mother‘s marital status, and the stability of her relationship status, her education level, her income, and her age. In this thesis, the variables tested are maternal education, maternal marital status (as a proxy for household stability), and household SES. Maternal education is included on the basis of evidence for a relationship between maternal education and educational choices made for children (Magnuson 2007; Andrabi, Das et al. 2009; Greenberg). Marital status is included as a proxy for household stability, as there is evidence that both indicators are related to child outcomes and wellbeing (Osborne and McLanahan 2007). Finally, household SES is included due to evidence for a relationship with school choice (Glazerman 1998; Msila 2005; Andrabi, Das et al. 2009; Msila 2009; Hunter 2010). 46 At the community level, relevant variables might include the quality and attributes of local schools, the affluence and education levels of the community, as well as the coherence of the community itself, and the extent to which it suffers from problems such as crime. This thesis includes a range of measures of school quality, such as poverty quintile rating, school fees charged, matric examination performance, and historical level of advantage, along with the racial composition of the student body and the size of the school, all of which have been demonstrated or hypothesized to relate to school choice in the South African context (Paterson and Kruss 1998; Fiske and Ladd 2004; Fiske and Ladd 2005; Msila 2005; Woolman and Fleisch 2006; Lombard 2007; Msila 2009). The thesis also includes a measure of community poverty, as this may influence the willingness of parents to send children to local schools (Msila 2005; Lombard 2007). All of these variables at the child, family, community levels are expected to feed into the decision making process, in which children and families weigh their desired outcomes in terms of school level attributes, with the investment required, and the constraints that they face. Required investments are likely to depend on desired outcomes, the geographical location of both the household and the schools considered, and the socio-economic status of the household. Constraints on investment are likely to depend on household access to human capital, social and economic resources, the structure of the household, and the extent to which the household prioritizes educational or other forms of investment. This decision making process is likely to result in the identification of a small group of schools which the child and family considers appropriate and feasible. The school at which a child finally enrols is likely to be shaped by some extent by school-level constraints, such as whether the school still has space available, and also probably by some degree of chance. 47 2.8 Conclusion This chapter has provided an overview of school choice practices and policy, both internationally and in South Africa, and has also provided a review of the school choice literature relevant to educational equality and segregation. It has identified a number of gaps in the scholarly literature, which this thesis aims to fill. At the international level, these gaps include the general lack of empirical data relating firstly to the determinants of school choice, and secondly to the implications of school choice for educational segregation and equality, and the absence of an appropriate conceptual framework for the empirical investigation of these issues. At the South African level, gaps include the absence of information about the dimensions of learner mobility, as well as a general shortage of methodological tools with which to explore the issue. The chapter has also highlighted the ways in which this thesis will make original contributions towards filling these gaps, at methodological, theoretical and empirical levels. Methodologically, contributions will include the use of a longitudinal, population-based dataset, at a level that provides both some generalizability, but also fairly detailed information at the level of the individual, the use of data from a range of different sources to explore potential determinants of mobility at a range of different levels, and finally the use of three different operational definitions of learner mobility to allow the exploration of a range of different dimensions of the phenomenon. Theoretically, the contribution will include a preliminary theoretical model of the determinants of school choice at the child, household, community and school levels. In addition, it will provide data and insight regarding long- standing debates around the relationship between school choice and educational equality and segregation. Finally, empirically, it will provide the first population-level data documenting the scope and dimensions of learner mobility and school choice in post-Apartheid urban South Africa, along with preliminary data on the determinants of these phenomena. 48 Chapter 3: Methods 3.1 Methodological approach: quantitative secondary data analysis This project makes use of quantitative, secondary analysis of pre-existing data to explore questions related to the extent and nature of learner mobility in contemporary urban South Africa. Secondary analysis of pre-existing data is a well-accepted research method with a long history of use in educational research as well as the study of mobility (McMillan and Schumacher 2005; Smith 2006; Fleisch and Schindler 2008). As will be described later, taking a quantitative approach to secondary analysis is particularly well suited to answering the questions posed in this thesis. Secondary data analysis is an approach to research that is based on the analysis, or in some cases the reanalysis, of pre-existing data (Bryman 2004; McMillan and Schumacher 2005). Typically, this data, which may be quantitative or qualitative, and may consist of primary or secondary sources, was originally collected for a particular purpose other than the research project under consideration. Secondary data analysis allows this data to be reused, to answer a different set of research questions. A major strength of this methodology is therefore the ability to make use of pre-existing data, eliminating the need for time-consuming and expensive data collection, and allowing for more time and effort to be dedicated to analysis. Eliminating the need to collect data also allows time and resources for the analysis of a greater volume of data, possibly covering a longer period of time, and greatly improving the breadth and reliability of work. Additionally, it enables research projects to make use of data from multiple sources, increasing the depth of findings, or to explore a particular historical era, generating period-specific conclusions. The volume of data available for analysis is likely to be far greater, and potentially of higher quality, than the data that could be collected 49 during the limited time, and with the limited resources, available for most doctoral research. Quality of data is also less of a concern, as most large, pre- existing datasets have already gone through multiple levels of quality checking. Quantitative analysis typically refers to the use of statistical approaches to deriving meaning from numerical data. Strengths of quantitative research include its potential for extracting meaningful and non-obvious information from large pools of data, and the efficacy with which it can be used on large sample sizes. Weaknesses include an inherent assumption that the principles of the scientific method apply to human phenomena, the inability to incorporate qualitative contextual information, and a deceptive sense of accuracy generated by the availability of numerical results (Bryman 2004). Its strength in aggregation, which makes it so valuable in providing an overall measure of a phenomenon, does, however, often also result in a substantial loss of individual detail. Taking a quantitative approach to secondary data analysis provides a research method well-suited to the major empirical questions the project answers, particularly in the context of an extremely limited and almost entirely qualitative pre-existing empirical literature. Measuring the scope of learner mobility in contemporary urban South Africa – the first major empirical task undertaken in this thesis – is essentially a quantitative question, and requires a quantitative approach. While we already have some information about children travelling to particular schools, and about learner mobility within particular, fairly constrained, communities (Sekete, Shilubane et al. 2001; Fiske and Ladd 2004; Msila 2005; Msila 2009), we don‘t currently have a broader understanding of the scale of this mobility. Qualitative approaches have proved informative in exploring some reasons for learner mobility, as well as documenting the behaviours of individuals, but they cannot give us an overview of overall levels of learner mobility in a major urban area. For this, a 50 quantitative approach to the analysis of data drawn from a fairly large sample is required. Answering questions about the scale of mobility also requires the use of data collected at a population level. To date, the large majority of research on learner mobility has explored the question either by focusing on particular schools, or by making use of a non-representative sample, typically drawn from a fairly geographically constrained area (Sekete, Shilubane et al. 2001; Fiske and Ladd 2004; Msila 2005; Msila 2009; Hunter 2010). While these approaches provide valuable data, particularly with regards to the causes and implications of the phenomenon, learner mobility appears to be highly clustered around particular schools, and amongst particular groups of people. This means that any sample that is not drawn to be relatively representative of a fairly sizeable and varied population is unlikely to provide an accurate measure of the overall scope of learner mobility. Unfortunately, collecting data on a relatively representative sample of a substantial population, such as that found in major urban hubs, is an extremely complex and time-consuming process, particularly if data is wanted for more than one point in time. Drawing on a dataset that has already been collected offers a way to gain access to a volume of reasonably representative, high-quality data that could not be otherwise be obtained in the context of a PhD project. A second major empirical question posed by this thesis relates to the patterns and correlates of learner mobility in contemporary urban South Africa, particularly with respect to socio-economic status. In answering this question, using pre-existing data is particularly valuable, as it allows for access to a wider range of variables, often over a wider interval of time, than would be feasible to collect for a single thesis. In particular, using secondary data provides access to data from a range of different time points. It also allows the researcher to tap into data from a range of different sources, and combine these to enable the exploration of dimensions of the phenomenon that might 51 otherwise not be possible. Identifying patterns in learner mobility also requires access to data for a large and reasonably representative sample of individuals. This provides further support for the use of quantitative secondary data analysis for this project. 3.2 Dataset selection Making use of quantitative secondary data analysis requires access to an appropriate data set. Although it would be ideal to draw data from a nationally representative sample, the type of data needed for this study (specifically data on the school at which children are enrolled) has unfortunately not been collected in the national census, or other nationally representative household surveys such as the October Household Survey, the Labour Force Survey or the Community Survey. Fortunately, South Africa has a number of other studies tracking fairly large numbers of people of varying ages, in different parts of the country, for different lengths of time. Making use of one of these studies was therefore the most feasible way of obtaining the necessary data. Deciding which of these various datasets would be most appropriate to use in answering the questions this thesis poses was an important step in developing the project. Key considerations were the extent of data focused on school-aged children, and in particular the availability of residential addresses and school enrolment information, the extent to which this data was available longitudinally, and the extent to which the children included in the study were representative of a fairly well-defined and large population. After substantial consideration, the Birth to Twenty (Bt20) study, based in the Soweto- Johannesburg area of South Africa was selected. Appendix A contains details of each of the other datasets considered for use, and documents the reasons that these sources were decided against in the context of this particular project. In summary, however, there were three main reasons for the selection of Bt20. These are explored in more detail below, but relate firstly to the availability of 52 particular variables, secondly to the availability of data for particularly important time points both in South African history and in the educational lives of the sample members, and thirdly to the fact that the data was made accessible to me. 3.2.1 Birth to Twenty The Birth to Twenty (Bt20) cohort study started in 1989 with pilot studies to test the feasibility of a long-term follow-up study of children‘s health and wellbeing (Yach, Cameron et al. 1991). Women were enrolled in their second and third trimester of pregnancy through public health facilities and interviewed regarding their health and social history and current circumstances. Singleton children (n=3 273) born between April and June 1990 and resident for at least 6 months in the municipal area of Soweto- Johannesburg were enrolled into the birth cohort and have been followed up 16 times between birth and 20 years of age (Richter, Norris et al. 2004; Richter, Norris et al. 2007). During the last 7 years, young people have been seen twice a year, at the Bt20 offices and at home. Attrition over two decades has been comparatively low (30%), mostly occurring during children‘s infancy and early childhood, and approximately 2 300 children and their families currently remain in contact with the study (Norris, Richter et al. 2007). The sample is roughly representative of the demographic parameters of South Africa with equal numbers of male and female participants. Assessments across multiple domains have been made of children, families, households, schools and communities during the course of the study, including growth, development, psychological adjustment, physiological functioning, genetics, school performance, and sexual and reproductive health. The third generation, children of Bt20 children, began to be born in 2004. The Bt20 research programme, including all data collection, has received clearance by the Ethics Committee on Human Subjects at the University of the Witwatersrand (M010556). The Federal-Wide Assurance registration number of the Committee is FWA00000715. 53 Of particular importance to the selection of this dataset for use in this thesis, data on both residential address and on school enrolment was available for each Bt20 participant at multiple time points, providing a uniquely longitudinal record of schooling and residence, and allowing for an exploration of variation in mobility over time. The residential data was particularly promising, as substantial work in cleaning the data had already been conducted for another PhD project (Ginsburg, Norris et al. 2009; Ginsburg, Richter et al. 2010), and GIS coordinates had been collected for most residential addresses at three recent time points. As the Bt20 study was designed as a cohort of children born in the Johannesburg-Soweto metropolitan area, and has achieved fairly limited attrition, this also meant that its use would permit the development of a roughly representative understanding of the extent and nature of learner mobility for a substantial urban area. In addition, in contrast to the Cape Town metropolitan area, the Johannesburg-Soweto area is substantially more similar to the rest of South Africa, and particularly urban South Africa, in terms of the performance of the schooling system, as well as the demographic makeup of the population. Bt20 also had the advantage of providing data on the earlier years of schooling, rather than only focusing on adolescence and later, where much South African research on schooling behaviour tends to be focused. A final important consideration was that Bt20 was able to make all residential and schooling enrolment data available to me for the purposes of this study, on the understanding that no information that could allow for the identification of participants would be made public. Finally, the Principal Investigator of the project, Prof. Linda Richter, and the Project Director, Prof. Shane Norris, both expressed interest in this project, and indicated a willingness to provide intellectual support for the work. Concerns about making use of the data from Bt20 included the limited amount of previous work making use of the more detailed schooling data collected. While data regarding the school at which each participant was enrolled in each 54 year had been collected, this had not previously been used, and was therefore likely to require substantial cleaning. While substantial work on the residential addresses and residential mobility of the cohort had already been conducted (Ginsburg, Norris et al. 2009), the GIS data had also not previously been worked with. A second concern related to the cohort nature of the database, particularly given the historically unique period of time, marking South Africa‘s transition to democracy, during which the participants were born and grew up. In a society as rapidly changing as South Africa during this period, it is possible that conclusions derived on the basis of this particular cohort might not apply to children born and attending school slightly later. A final concern relates to the extent to which the study sample is indeed representative of the youth population of the Johannesburg-Soweto metropolitan area. In particular, as will be detailed in the next section, decisions around data collection, combined with different response rates, has led to variable levels of enrolment, and subsequently attrition, for individuals from different racial and socio- economic backgrounds. Minority racial groups are therefore underrepresented in the study, as are both the most advantaged and the most disadvantaged individuals (Norris, Richter et al. 2007; Ginsburg, Norris et al. 2009; Richter, Panday et al. 2009). However, the study does appear to remain representative for the middle-income African population of the area, which is the group of greatest interest for the questions asked in this thesis. These concerns notwithstanding, Bt20 was the most feasible and suitable dataset for use in this thesis, and was therefore selected. 3.3 Ethical considerations Ethical clearance for this thesis was received from the University of the Witwatersrand. The letter of approval is attached to this thesis as Appendix B. As the project relies only on secondary analysis of existing data, there was no data collection conducted for this project, and there were by extension no ethical issues related to data collection or study instruments for this study. As 55 detailed previously, all data collection for the Bt20 study received clearance by the Ethics Committee on Human Subjects at the University of the Witwatersrand (original Birth to Ten protocol ethics clearance: 24/1/90; extended Birth to Twenty protocol ethics clearance: M01-05-56). Bt20 data is owned by Birth to Twenty, which is located in the Department of Paediatrics in the Faculty of Health at the University of Witwatersrand. Access to the data required for the purposes of this project was provided by Birth to Twenty. All data was made available on the basis of unique identifying numbers attached to each individual, with names and other identifying information removed. The sole exclusion to the removal of identifying information was with regards to residential addresses, which were essential to the project. All residential address data was stored securely at all times, on a password protected computer. Care was taken to ensure that exact residential addresses of individuals were not disclosed in writing this thesis and related work, and that addresses were only ever presented at the level of suburb. Similarly, graphics are presented at a level of detail which ensures that an address cannot be identified. In all other instances, only aggregated data is presented. The Bt20 data was supplemented by data from two additional sources. Data from the South Africa National Census 2001 was used to provide information regarding the demographics and socio-economic conditions of the communities in which Bt20 participants lived and schooled. Census 2001 data was used aggregated at the small-area level4 and higher, and did not contain any information which might identify individuals or households. Data provided by the South African National Department of Education (DOE, renamed the Department of Basic Education in 2009) was also used to supplement Bt20 data. This data consisted both of publicly available data, and data made available specifically for this project. All DOE data was provided at school 4 Detailed explanations of the various geographic levels used, including the small area level, are provided in the section detailing the Census 2001 data used, below. 56 level, and was linked to particular schools. Non-public data was made available on the understanding that data would not be reported in such a way as to link it to an identifiable school. 3.4 Overview of data and variables used In this section, I provide information on the construction of each of the variables used during my analysis, including details of the data sources used in their generation, and any cleaning or modifications to the data that were required. Descriptive statistics for each variable are presented in Chapter 4. As discussed in Chapter 2, variables relating to potential child-level, household and maternal-level and community-level determinants of mobility are included in this thesis. Figure 3.1, below, shows the location of each of the variables considered within the conceptual framework used in this study. Figure 3.1: Location of study variables within the conceptual framework presented in Chapter 2 Community Home location Community poverty index Maternal Maternal marital status Maternal education Child Race Gender School attended Age at first enrolment Grade repetition Phase of schooling 2003 Household & Family Household SES 1990 Household SES 1997 Household SES 2003 Change in SES, 1997 to 2003 57 3.4.1 Child level variables Race All children were recorded as either white, black African, coloured, or Indian, on the basis of information provided by caregivers shortly after birth. These groupings are not meant to denote biological categories, but represent socially meaningful categories in the South African context, as they are the groupings on which Apartheid-era policy was based, and which continue to shape to a significant extent the life experiences and opportunities open to South Africans. This variable has been extensively used in existing analyses, and did not require any cleaning or manipulation. Gender All children were recorded as either male or female on the basis of information provided by caregivers after birth. As with race, this variable has been extensively used, and required no cleaning. School attended The Bt20 education data required substantial manipulation prior to use. Firstly, as data collection waves did not mirror the academic year, it was necessary to restructure much of this data so that school name and grade could be attached to a particular calendar year, rather than a data collection wave. Secondly, data was available from two different types of source: prospective data, collected during each wave, and retrospective data collected for all calendar years to date during study Year 14. Thirdly, to maximise the value of the school name data, it was necessary to match each school‘s name to the correct Education Management Information System (EMIS) number, as it is through the EMIS number that information about a school, such as it GIS coordinates, enrolment and resource levels, can be obtained. The school level variables derived on the basis of EMIS numbers are discussed later, in the section on school variables. 58 This section focuses on the education-related variables obtained from the Bt20 data. The cleaning and reorganization of Bt20 education data followed a three-step process. Firstly, all prospective schooling data was reorganized by calendar year. Secondly, whenever the prospective record was incomplete, retrospective data, where available, was used to fill in these gaps. Thirdly, using information on the school name, location, and grades, schools were matched to the appropriate EMIS number. Due to the variable spellings used for school names, and the existence of several pairs of schools with the same or similar names, this process had to be done manually. All cases in which there were inconsistencies or a lack of clarity around which school the child attended were checked against the original data collected. When it was not possible to identify definitively the particular school attended, the school was coded as missing. School attended variables were generated for two points in time, 1997 and 2003. These two points were selected respectively as the earliest point at which all children could be expected to be enrolled in primary school, and the end of primary schooling. Data for 1997 was drawn from the Year 7 round of interviews, combined with retrospective data from Year 14. Over all, for 1997, schools were identified and matched to EMIS numbers for 1241 of the 1428 study sample5 members. Data for 2003 was drawn from the Year 13 and Year 14 interview rounds, and schools were identified and linked to EMIS numbers for 1311 of the study sample members. Age at first enrolment in school Age at first enrolment was calculated using the child‘s grade in 1997, controlling for repetition. If a child started formal schooling at the earliest 5 Details for the selection of the 1428 children included in the study sample are provided in a subsequent section of this chapter. 59 possible point, they would have been in grade 1 in 1996, and therefore, barring repetition or failure, would be in grade 2 in 1997. These children, as well as a few who appeared to start particularly early and were already in grade 3 by 1997, were classed as early-starters. Children in grade 1 in 1997 (with the exception of those who had repeated a grade), as well as a handful not yet enrolled in school by 1997, were classed as late-starters. Note that most of the children classified as late-starters by this variable do actually start school on time with regards to official policy, which only requires children to start school by the year in which they turn seven. However, in order to obtain any variation on this variable, it was necessary to look at the timing of first enrollment within the bounds specified by policy. Phase of schooling in 2003 Phase of schooling is a binary variable indicating whether a child was still attending a primary school in 2003, or whether he or she had progressed to high school already. Grade repetition Grade repetition is a binary indicator, coded 1 if the child repeated any grades between 1997 and 2003, and 0 if the child did not repeat any grades. 3.4.2 Household and maternal level variables Maternal marital status The maternal marital status variable is based on self-reported maternal data, and refers to 1990, the year of the child‘s birth. Mothers selected from the options married; unmarried but living together, single, and divorced/separated/widowed. For the purpose of this analysis, in which marital status was used as a proxy for household stability, all options other than married were combined into a single unmarried category. 60 Maternal education The maternal education variable is derived from each cohort member‘s mother‘s self-reported highest completed level of education in 1990, at the birth of the child. Mothers selected from the following options: no formal education; up to and including grade 5; grade 6 or 7; grade 8, 9 or 10; grade 11 or 12; and post-school education. When mothers reported post-school education, more detailed information as to the nature of this education was collected. As very few mothers reported no formal education (n=13 in the study sample), this category was merged with the group of mothers completing up to grade 5. As education beyond Grade 5 is used as the cut-off point for determining functional literacy, this group of mothers can be classified as functionally illiterate. Due to the relatively low numbers of mothers who had completed any specific type of post-school education, all forms of post-school education were combined into a single category. 1990, 1997 and 2003 household SES (raw PCA score and quintile) During each wave of data collection, a varying number of different indicators related to socio-economic status were collected. Using each of these indicators independently is not really feasible. Firstly, they tend to be very highly correlated, and secondly, the value of ownership of a particular asset, or access to a particular service, tends to change substantially over time. For this reason, it is more appropriate to combine those indicators appropriate to a particular point in time into a single SES index score, and use this in analyses. Grouping sample members into quintiles on the basis of their scores at each time point provides a straightforward means for comparison over time. Due to a combination of changing societal context over time, and the use of different indicators during each data collection wave, it is challenging to construct an SES variable comparable across different study time-points. The best option available within the Bt20 data was to construct an SES score drawing on asset ownership and housing quality data for three different points 61 in time (Filmer and Pritchett 2001; Rutstein and Johnson 2004; Howe, Hargreaves et al. 2008). Data from study years 0, 7, 12, and 13 were used to construct scores for 1990 (around the time of the child‘s birth), 1997 (around the time of enrolment in primary school), and 2003 (around the time of completion of primary school). See Table 3.1 below for details of the variables used in composing each SES index score. Time point 1990 1997 2003 Variables used in score creation All variables collected during pregnancy or the first two years of the child’s life (1989- 1992): Home type (1=house or flat; 0=anything else) Home ownership (1=owned; 0=anything else) Water type (1=indoor running water; 0=anything else) Water use (1=sole use; 0=shared) Toilet type (1=indoor flush; 0=anything else) Toilet use (1=sole use; 0=shared) Electricity in the home (1=available; 0=not available) TV ownership (1=owned; 0=not owned) Car ownership (1=owned; 0=not owned) Fridge ownership (1=owned; 0=not owned) Washing machine ownership (1=owned; 0=not owned) All variables collectd in study year 7 (1997-1998): Home type (1=house or flat; 0=anything else) Home ownership (1=owned & fully paid; 0=anything else) Water type (1=running indoor; 0=other) Toilet type (1=indoor flush; 0=other) Radio ownership (1=owned; 0=not owned) Car ownership (1=owned; 0=not owned) Washing machine ownership (1=owned; 0=not owned) VCR ownership (1=owned; 0=not owned) Microwave ownership (1=owned; 0=not owned) Variables collected in study years 12 (2002-2003) or 13 (2003-2004): Home type (Year 13; 1=house or flat; 0=anything else) Water type (Year 13; 1=hot and cold indoor running water; 0=anything else) Toilet type (Year 13; 1=indoor flush toilet; 0=anything else) Electricity in the home (Year 12; 1=yes; 0=no) TV ownership (Year 12; 1=yes; 0=no) Radio ownership (Year 12; 1=yes; 0=no) Motor vehicle ownership (Year 12; 1=yes; 0=no) Fridge ownership (Year 12; 1=yes; 0=no) Washing machine ownership (Year 12; 1=yes; 0=no) Telephone ownership (Year 62 12; 1=yes; 0=no) VCR ownership (Year 12; 1=yes; 0=no) Microwave ownership (Year 12; 1=yes; 0=no) MNet ownership (Year 12; 1=yes; 0=no) Satellite TV ownership (Year 12; 1=yes; 0=no) Cellphone ownership (Year 12; 1=yes; 0=no) Table 3.1: Variables used in the creation of SES scores After the relevant variables for each time point had been identified, all non- binary variables were recoded to become binary variables (see Table 3.1 for the final coding scheme used). A process of manual imputation was then conducted for the 1990 and 2003 data, to reduce the number of missing values. Imputation was done on the principle that if one variable predicted the value of another correctly for 75% or more of the cases without missing data, it could be used to impute the other variable where it was missing. As data for 1997 was either uniformly present, or uniformly missing for all variables, it was not possible to impute any values in this year. Once the amount of missing data had been minimized as far as possible, principal components analysis (PCA) was run for each year, to determine the appropriate weighting for each component variable (Vyas and Kumaranayake 2006). The literature is somewhat divided as to whether it is appropriate to use PCA on binary data, as has been done here. While some studies have shown it to perform as well as, or better than, alternatives (Filmer and Pritchett 2001; Howe, Hargreaves et al. 2008), others have found that its performance is suboptimal (Kolenikov and Angeles 2008). Nonetheless, it remains the accepted standard approach used for working with binary SES data (Filmer and 63 Pritchett 2001; Rutstein and Johnson 2004; Vyas and Kumaranayake 2006), and as such, has been used here. Depending on the nature of analysis for which the SES data is used in this thesis, either the raw scores generated by the PCA process are used, or the appropriate sample is divided into quintiles, which are then used. Change in household SES from 1997 to 2003 An additional variables, change in SES over time, was constructed for analyses exploring changes in mobility behaviour over time. This was constructed by taking the sample member‘s SES quintile in 2003, and subtracting the sample member‘s SES quintile in 1997. 3.4.3 Community level variables Home location Address information was collected for all cohort members during each wave of data collection, as this information was critical to maintaining contact. GIS coordinates for home addresses, however, were not captured until Year 13 of the study, when home visits were initiated. During Year 13, 15 and 16 home visits, data collectors stood outside of the participant‘s home, and used a mobile GPS device to record the coordinates at that location. These coordinates were then either manually captured on the home visit instrument, or downloaded into a study database at a later point. Unfortunately, while working with this data, it became evident that between a third and a half of the coordinates captured during each data collection wave were incorrect. These problems were traced to data collectors not resetting the GPS device correctly between uses, and therefore not always collecting the correct coordinates for each location. Unfortunately this meant that the GIS coordinates available could not be used without being checked manually for accuracy. Additionally, due to limited available street-name and number data for the Soweto area, and 64 the vague nature of many addresses provided, it was not possible to generate accurate coordinates for the majority of addresses using software or maps. Due to time constraints, a decision was made to pursue the correct GIS coordinates only for those children who had not moved home between 1996 and 2004, as this meant that only one set of coordinates would be required per child. Details of sample generation, and implications of the decision to limit the study sample on the basis of residential mobility are discussed in the section on sample selection and bias below. Of the initial BTT cohort of 3273, 66% (n=2158) completed a residential history questionnaire in 2005 or 2006. These individuals comprise the cumulative non-attrition cases. Data from the questionnaire was used to generate a longitudinal dataset containing all address and residential movement information for these sample members, from birth through to 2004 (Ginsburg, Norris et al. 2009). This dataset was used to distinguish between those individuals who had and had not changed residence during the period 1996-2004. 1470 individuals reported stable residential addresses during this period, and these formed the basis for the study sample. Very few of these children had moved between 2004 and the end of Year 16 data collection, meaning that 3 different sets of GIS data, corresponding to three different study waves, were available in most cases. The residential GIS coordinates for these remaining children were manually checked for accuracy using Google Earth, and fortunately, for the majority of participants, at least one of the sets of coordinates collected did appear to be correct, and could be used. For the remaining children with traceable addresses, GIS data was generated using Google Earth where possible, or otherwise re-collected through an additional visit to the address. Reliable GIS coordinates for residential addresses could not be obtained for 27. These instances of missing information were largely due to street name changes, house numbering systems, and redevelopment of areas, and were distributed throughout the greater Soweto area. 65 Once residential GIS coordinates had been obtained for sample members, a number of additional community-level variables were generated for each set of coordinates using data from Census 2001 – the most recent national census, and that most relevant to the time period under consideration. This data was used for two purposes. Firstly, Census 2001 geographical area boundary data was used to delineate the boundaries of residential areas, suburbs and municipalities. These boundaries were then used to identify the area in which each child lived. This allowed for the later examination of whether children were living in and schooling in the same areas, or not. Secondly, I also made use of Census 2001 data to provide contextual information, in the form of a poverty index, for the areas in which learners live, and the areas in which they attend school. Census geographical area delineation Census geography was used to define area boundaries as census geography generally – although not always – corresponds fairly closely to local perceptions of area boundaries. They are sensitive to the ways in which socio- economic factors and history, along with geographical features, have shaped perceptions of areas. In addition, the Census 2001 provides 4 different levels of geography, which allow for the exploration of mobility and context at a range of different levels. The smallest level used here is the Small Area Level (SAL). Each SAL typically corresponds to between one and three enumerator areas, and contains approximately 200 households. This level of geography is the lowest level at which census data is released. Given the relatively small size of most SALs, most ‗areas‘, as typically perceived, tend to contain several of them. The next level of geography is Sub-Place name level (SP). This level typically corresponds with residential suburbs, or small but distinct areas of a city. Examples might be Pimville or Diepsloot. The third level, Main-Place name 66 (MP) corresponds roughly to small cities or towns, or large but distinct areas within a large city, for example Soweto. Finally, the largest level of geography used in these analyses is the Municipal level (MN), which represents entire municipalities or districts. Each SAL is entirely contained within a particular SP, each SP within a particular MP, and each MP within a particular MN. The GIS coordinates home address for each sample member was linked to the SAL, SP, MP and MN in which the address falls, using gvSIG software. Similarly, the GIS coordinates for the school attended by each child, in both 1997 and 2003, were also linked to the SAL, SP, MP and MN within which they fall. Once these linkages have been made, it is possible to test whether children live and attend school in the same area or not. It is also possible to use these linkages to obtain information about the area in which the home or school is situated, as is discussed in the next section. Community poverty rating To explore the nature of the areas in which the BT20 families live, small area level data from Census 2001 was used. For each area of geography (SAL, SP, MP and MN), a PCA was conducted using a range of variables related to affluence, and obtained from the Census 2001 SAL dataset. Variables used were the percent of the working age population employed; average household income; the percent of households living in informal dwellings; percent of adults who had no secondary schooling; the percent of the area‘s population who were black Africans; and the percent of households who did not have access to services such as running water, electricity, hygienic toilets, refuse removals and landline telephones. The results of the PCA were used to generate a poverty level score for each area. This score has been used in its raw form, and has also been used to divide sample members into quintiles on the basis of the poverty of the area in which they live or attend school, where appropriate. 67 3.4.4 School variables In addition to the child, family and community-level variables above, which are hypothesised to shape and constrain decision making around school choices, attributes of potential and selected schools are also expected to play a role in shaping decisions. In addition, data on the properties of the school a child attends also provides valuable information on the quality of education that child is likely to receive. As a result, the variables documented above are supplemented with additional data on Gauteng province schools, obtained from the South African National Department of Basic Education (formerly the Department of Education). The bulk of the data comes from the Educational Management Information System (EMIS) 2008, 2009 and master schools lists for Gauteng, and the 2002 and 2003 Annual Schools Survey (ASS). This data includes the GIS coordinates of schools, historical classification of schools, school poverty quintile rating, enrolment data, and data on school fees. Although there are some important concerns about the quality of data coming from the Department of Education, particularly given that most data is self- reported by schools, it is by far the best available source of school information. This self-reported school-level data is supplemented with matric pass rates for secondary schools in 2002, which were obtained from the Department of Basic Education. Although this thesis examines schooling patterns from 1997, the schools data used is for 2002 or later. This is for a few different reasons. Firstly, data available for the post-Apartheid period prior to 2002 is primarily through the School Register of Needs (SRN), conducted in 1996 and 2000. Despite several requests, I was unable to obtain a copy of the school-level data collected in the SRN. Secondly, even had access been possible, issues of compatibility between SRN and ASS data would have limited the utility of earlier data (Yamauchi 2004). Finally, there is reasonable evidence to believe that the variables of primary interest in this thesis are unlikely to have changed substantially for individual schools over the period under consideration. These 68 reasons, along with the data source used, are discussed in the following paragraphs. Data sources are also summarized in Table 3.2 below. Variable Source Year School location (street address & GIS coordinates) Gauteng Schools Master List 2008 (The list includes information on schools operating in 2002, but which subsequently closed) School sector (Public or private) Gauteng Schools Master List 2008 Section 21 Status Gauteng Schools Master List 2008 Phase (Primary, Secondary or Combined) Gauteng Schools Master List 2008 School resource levels: School fees charged Annual Schools Survey 2002 School resource levels: Poverty Quintile rating Gauteng Schools List 2008 Historical Department of Education Gauteng Schools List 2008 School enrolment Annual Schools Survey 2002 Racial composition of the student body Annual Schools Survey 2002 Matric pass rates by school Department of Basic Education 2002 Table 3.2: Sources for school-level variables used School location, school sector, school phase, and Section 21 status School location, school sector (whether the school is public or private), school phase (primary, secondary or combined) and Section 21 status (whether or not the school is allowed to manage its own finances) are all typically fairly stable, and the data for these variables was obtained from the Gauteng schools master list for 2008. Using this data source meant that the variables were available not just for schools operating in 2008, but also for those schools which had closed previously. Although it would have been ideal to obtain these variables, with the exception of school location, from an earlier source, limited access to data 69 meant that this was not possible. Given the stability of school locations, the 2008 data was given preference over earlier sources due to the generally increasing accuracy of GIS data over time. School quintile rating Data on school resource levels proved somewhat complicated to obtain. The resource variable which is clearest and most widely used for South African schools is probably the school quintile rating. The quintile system, which ranks schools in categories from 1-5, depending on their resource levels, was first instituted in 1998 by the Department of Education‘s Norms and Standards for School Funding. At this point, schools were assigned to quintiles on a provincial basis. This was amended in 2006, after which schools were assigned to quintiles at the national level. This meant that many of the poorer school in the wealthier provinces were moved to higher quintiles at this point (Pampallis 2008). In using school quintiles as a measure of resource levels in 2002, it was necessary to decide between the old, provincial quintiles, and the newer, national quintiles. Unfortunately it did not prove possible to obtain the quintile ratings under the older provincial system, and so the ratings used here are those for 2008, and therefore developed under the national system. Although this is not ideal, it should be noted that despite changes in the quintile system, the relative resource levels and of schools with respect to each other, and related to this, their performance, has remained relatively constant (Fiske and Ladd 2004; Fiske and Ladd 2005; Fleisch 2008). That is, those schools which were most advantaged in 2002 are, more or less, the same schools that were most advantaged in 2008. Therefore, although the actual quintiles assigned to schools may have changed somewhat between 2002 and 2008, the rating of schools relative to each other is unlikely to have changed substantially. Using the 2008 poverty quintile ratings for the 1997 time point, when cohort members were entering primary school is even more problematic than its use for the 2003 time point. Between 1996 and 2000, substantial infrastructural 70 investments were made in historically under-resourced schools, particularly in upgrading the basic infrastructure and facilities at schools, for example, sanitation, telecommunications, water and power (Department of Education 2000). However, most of this investment was focussed on rural areas, and therefore was of less relevance with regards to schools in the Gauteng province. More importantly, the redistribution of resources occurred largely between provinces. By contrast, within provinces, even if the overall levels of resources changed, the relative proportions going to schools which historically served different race group remained fairly similar (Fiske and Ladd 2005). Historically advantaged schools continued to receive higher levels of state funding than historically disadvantaged schools, and this discrepancy was further exacerbated by the ability of parents at historically advantaged schools to supplement state funding through higher school fees (Fiske and Ladd 2005). Overall, then, those schools which were most advantaged in 2002 were also those that were most advantaged 1997, while those with the fewest resources in 2002 were those which had always received the fewest resources. School fees Due to the age of the data, as well as various concerns that have been raised about the quintile system more broadly (Kanjee and Chudgar 2009; Kanjee and Chudgar 2009), this thesis also makes use of two additional measures of school resources. Firstly, the school fees charged by schools in 2002, as reported by schools in the ASS 2002, is used as an indicator of the school‘s access to resources. It is widely accepted that schools with higher resource levels charge higher fees (Fiske and Ladd 2004; Pampallis 2008). This data is also valuable in that it dates to the time at which the children in this study were actually attending the schools in question. Historical department of education Secondly, as much of the performance of South African schools today continues to be explained by the educational department under which they fell 71 during the Apartheid era, a variable indicating whether the school was operated by the Department of Education and Training (DET) during the Apartheid era was used. The DET was responsible for the running of urban schools serving black children, and DET schools received far fewer resources than all other urban schools, and typically continue to be under-resourced and poorly performing to the present time (Fiske and Ladd 2004; Fiske and Ladd 2005; Fleisch 2008). This variable was obtained from the 2008 schools master list. Not all currently disadvantaged schools operated under the DET during Apartheid, and any schools which have been opened since the end of Apartheid will obviously not fall under historical DET status. Nonetheless, given the large number of education departments in place during the apartheid era, the binary variable used here to indicate the historical status of the school is the most feasible available option, and contains valuable information about the history of a school. As it is a historical variable, the use of the 2008 data in its composition is not problematic. School enrollment The next group of variables used related to school size and the composition of the student body. For school size, the enrolment reported by the school in the ASS 2002 was used. As this data is self-reported by schools, concerns have been raised that the figures may be inflated, particularly for less well-managed schools. However, this is the best available data, and as such, is used here. With regards to figures around school size, Fiske and Ladd (2005) and The Department of Education (2000) note some fluctuations in enrolment levels, but these do not appear to be very substantial. Importantly, school choice had already been possible for a number of years by 1997, suggesting that any initial surge in changing school enrolments post-Apartheid had probably already largely stabilised. Nonetheless, the data for 2002 was the earliest that could be obtained, and is therefore used as a proxy for 1997. 72 Racial composition of student body The variable used to describe the racial composition of the student body of the school was also derived from ASS 2002 data. For this variable, the reported number of black African students enrolled at a school was divided by the schools total enrolment, to obtain the proportion of the student body who were black African. Again, the use of 2002 data for the 1997 time point raises some concerns, as discussed above. Matric pass rate The final variable examined here is the Matric pass rate – the proportion of a school‘s students who write the national school-leaving examinations (the Matric examinations) and pass – which is used here as a proxy for school performance. This data was obtained for 2002 from the Gauteng Department of Education, and is available for the majority of public secondary schools in Gauteng. Private schools can choose between writing independent matric exams, or writing those that public sector schools write. Results are only available here for those schools which chose to write the public sector exams. It should be noted that well performing private schools typically choose the independent option, and so the results presented here for private schools are likely to be extremely biased. Unfortunately, no measure of the academic performance of primary schools is currently available for South Africa. In order to generate a proxy of the likely performance of primary schools in Gauteng, each primary school was matched to its nearest secondary school using gvSIG software. The primary school was then provided with the matric pass rate of its nearest secondary school6. Obviously this method is highly imperfect7, but given the geographical clustering of school performance in 6 Thank you to my examiners for suggesting that a three-year average of matric pass rate be used in future work, as this will help to counter concerns about the high annual variability in performance of many historically disadvantaged secondary schools. 7 Particular concerns about this measure relate to the fact that within any given context, there will always be a few schools which perform particularly well or particularly poorly. This approach has no way of differentiating these schools from those performing at more expected levels, which means that proxy pass rates generated for some schools will be inaccurate. 73 South Africa, it is at least plausible, and given the available data it is the only real option. School name and EMIS number Once all variables to be used in the analyses had been sourced, the next challenge was to create a list of the schools operating in the Gauteng province during the 1997-2003 period, and combine the variables with these schools appropriately. The best available base list of schools obtained was the 2008 Gauteng master schools list. This includes all registered schools in the province, including those which had previously been registered, and had subsequently closed. This list was therefore used as the base for the list of schools used in analyses. All special schools, as well as non-school institutions (FET colleges, pre-primary schools, exam centres, administrative offices, and so on) were dropped from this list, leaving 2604 institutions, both open and closed. All other variables were then merged with this list. Aggregated school quality variables Due to the strong correlations identified between many of the school related variables, and particularly those relating to school quality, it was necessary to combine these variables into an index for use in model generation. This was done by the use of PCA on all school attribute variables that were consistently found to be significantly related to mobility (school quintile, school fees, school enrolment, percent black learners, school sector, historical DET status, and pass rate). This process was repeated for the school attended by each child in 1997 and 2003, as well as for the nearest grade-appropriate school to the child‘s home in 1997 and 2003. In all cases, the eigenvalues of the first two components of the PCA were both greater than 1, and were therefore both retained. 74 3.5 Sample selection and creation of the analytical database As indicated previously, when conducting secondary data analysis, it is critical to develop a clear understanding of the datasets being used, and the way that the methodological decisions involved in creating the data may impact on data analysis and findings. In this section, I begin by providing an overview of how the study sample was composed, and detail the construction of the study‘s analytical database. A major concern with regards to the study sample is the extent to which it is representative firstly of the Bt20 cohort in general, and secondly of the youth population of urban Johannesburg-Soweto more broadly. These issues are explored extensively in Chapter 4, which includes a review of the way in which the Bt20 study was designed and conducted, and how this has influenced sample composition. It also explores how the study population, participant attrition, and composition of the sub-sample used in this thesis are likely to have impacted on the outcomes of the analysis, and on the extent to which findings are likely to be representative of the population of contemporary Johannesburg-Soweto youth. In the current chapter, however, the focus remains on the way in which the study sample and the analytical databases were constructed. 3.5.1 Sample selection As discussed previously, due to unanticipated challenges in preparing the GIS data for use, the study sample was limited to the 1470 cohort members who did not change address between 1996 and 2004. Subsequently, 28 additional children who were either not attending school in 2003, were enrolled in a special school at any time from 1996 to 2003, were enrolled in a school outside of the Gauteng province, or were known to be boarding at a school within the Gauteng province, were also excluded from the study sample. Children with 75 special educational needs were excluded as they and their families are unlikely to experience the same degree, if any, of choice around where they will be educated, due to their particular educational needs. Children attending boarding schools or extremely distant schools were excluded, as they are not travelling on a daily basis. An additional 14 cases of children who had changed address between 1996 and 2004, or who were spending substantial time a different address from their home, we also identified and removed from the sample. This left a sample of 1428 individuals, which formed the sample on which all subsequent analysis for this thesis was conducted. Members of all race groups were retained in the study sample. However, due the very small numbers of Indian and white participants, no results are presented for these groups. See Figure 3.2, below, for a flow chart illustrating this process of sample composition. 76 Figure 3.2: Flow chart illustrating selection of sub-sample for use in thesis. 3.5.2 Creation of the analytical dataset Selection of time points 1997 was selected as the initial year within the analysis as it was the earliest point for which reliable schooling information was available for the majority of cohort members. It also reflected the first point in time by which all cohort members, then aged 6-7, could safely be expected to be enrolled in formal education. 2003 was selected as the point closest to the end of primary schooling for the majority of the sample. In addition, it was selected as the year Cases removed due to non-standard schooling or residence (n=42) Final study sub- sample (n=1428) No residential change 1996-2004 (n=1470) Cases removed due to residential mobility (n=730) =1115 Non-attrition sample 2006, (full residential information) (n=2158) Cases lost to attrition (n=1115) Initial Bt20 Cohort (n=3273) 77 for which the most reliable address data was available and during which GIS coordinates were first collected, but this became irrelevant once the problems with the GIS coordinates had been discovered. Selection of these two time points, 1997 and 2003, allows this thesis to explore both ends of the primary school experience, as well as to address questions about both changes and consistency in schooling experience over time. One drawback of the use of these time points, however, is that by 2003, roughly a third of the sample had already completed primary school, and are enrolled in secondary schooling. This progression to secondary schooling is strongly related to socio-economic status, age at initial school enrolment, and academic performance. As these variables are expected to relate to learner mobility, and as mobility behaviour is also expected to differ between primary and secondary school children, this makes it particularly challenging to draw definitive conclusions about the cause of changing mobility at the end of primary schooling. This concern is discussed where appropriate in the results chapters. Details of the 1997 dataset Of all the schooling data encountered during this project, that for 1997 was, expectedly, the most problematic, for a few different reasons. The prospective data was not as reliable as in subsequent years, as in 1997 the children were still too young to report their own school names. School names were therefore generally provided by caregivers, who were not always aware of which school the child attended, or knew of the school only by an informal name. Secondly, data capturing for 1997 schooling data was only done in 2009, and was done using a drop down menu filled with school names as of 2009. This induced a number of data capture errors, particularly when several schools had similar names, as the data capturers would simply choose the first name that appeared to match on the drop-down menu. It also introduced problems when schools had closed or changed their names between 1997 and 2009, as the school name would not appear on the drop down list. In these cases, data capturers were 78 also inclined to simply choose the most similar name on the drop down menu – which would be incorrect. For this reason, all cases in which the school name provided for the child was similar to other contemporary or historical primary schools in the Gauteng area were double checked, and corrections made whenever possible. However, it is quite probable that there do remain a small number of incorrect school attributions in the data. The major reasons for concern about the quality of the retrospective data relates to the substantial length of time between 1997, and 2005-2006 when the retrospective schooling data was collected, and the fact that if children had changed schools frequently, they may have struggled to remember accurately which school they attended in 1997, when they were only 6 to 7 years old. As a result, two different home and schooling datasets were created for 1997. The first dataset was created by combining the 1997 address and GIS data with the prospective schooling data for that year. While this schooling data was highly accurate, it was only available for 760 of the sample individuals, just over 50%. Given this extremely high level of missing information, a second dataset was constructed using the less reliable, but more comprehensive retrospective data to fill in as many gaps as possible. This second dataset contained schooling information for 1244 sample members, with EMIS data missing for only 184 individuals. Given the complementary nature of these two datasets, with one being far more complete, but the data in the other being far more reliable, initial analyses were conducted using both datasets. As both datasets provided similar results, the more comprehensive one was used for the majority of analysis presented in this thesis. Details of the 2003 dataset For the same reasons as described above, two home and schooling datasets were also constructed for 2003. The dataset using only the more reliable prospective data contained information for only 760 cases, again just over 50% of the sample. When the retrospective data was integrated, schooling 79 information became available for 1310 sample members, with data missing for only 118 individuals. Given that the retrospective schooling data was collected in 2005-2006, the relatively lower number of cases with missing data is as expected. The accuracy of the retrospective data is also expected to be greater than for 1997, more closely resembling the prospective data collected in 2003. Again, analyses using both sets of data provided similar results, and the more comprehensive data including retrospective information is therefore used. 3.6 Operationalization of learner mobility Given the poorly developed state of theory around the concept of learner mobility, particularly within the South African context, as discussed in Chapters 1 and 2, I make use of data from a range of different sources to explore a number of alternative ways in which learner mobility might be operationalized. I begin by using GIS coordinates to calculate the straight line distance between each participant‘s home and school. I then explore definitions shaped by movement between different areas as defined by census geography. Finally I explore whether a child attends his or her nearest grade-appropriate public school. 3.6.1 A distance-based definition of mobility As discussed previously, a number of distance based definitions of what constitutes learner migration or mobility have been advanced. Typically, these work on the principle of assessing what constitutes excessive travel, on the basis of assumptions around the age of the child, the safety of the area, the availability of safe and affordable public transportation, and possibly other context-dependent concerns. Internationally, the literature suggests a range of maximum distances, ranging up to 10km. In the South African context, the maximum distance that a child should need to travel has tended to be fixed from between 2.5km up to this maximum of 10km. In current official policy, a 80 school‘s catchment area is defined as the area within a 3km radius of the school, suggesting that this is felt to be, at the policy level, the maximum distance a child should travel (Martin 2010). In working with a distance as an indicator of mobility, it is possible to either create a binary variable, using a particular distance as an indicator of whether or not mobility is occurring, or to work with distance as a continuous measure of the extent to which mobility is occurring. The binary, all-or-nothing indicator approach is the one typically implied by existing distance-based definitions of mobility, where mobility either occurs, or does not, on the basis of a specified cut-off point, with no middle ground. While a binary approach to measurement is therefore most useful from a policy assessment point of view, mobility can also be understood as something which always occurs, but at variable levels. As distance travelled has a close relationship to the resources required to be dedicated to that travel, a continuous definition of mobility can be seen as more closely related to reality as experienced by individuals engaged in mobility. As these two different approaches to measurement are complementary, this analysis makes use of both of them. A second concern when working with distance-based definitions of mobility relates to the way in which the distance from home to school should be determined. Straight-line measurements are both methodologically most simple, and are what tends to be used in mobility-related policy and assessment. However, it does have shortcomings, the most obvious being that children do not travel to school in a straight line, but make use of roads, footpaths, and transportation networks. Furthermore, the relationship between straight-line distance, and actual distance, is likely to be quite variable. Calculating actual distance travelled for this sample would have required the collection of substantial historical route data, and was not practicable. A more practical alternative is using GIS software to calculate the shortest feasible rout between two points, drawing on road network information. Unfortunately, road 81 network data for the Johannesburg-Soweto area was prohibitively expensive, which meant that this alternative could not be pursued for this project. As road network and similar data becomes increasingly available, however, this is an avenue that could be valuably explored. For the purposes of this project, however, all distances are straight line distances. These are calculated using the Haversine formula applied to the GIS coordinates of the child‘s home and the child‘s school (Sinnott 1984). The Haversine formula is calculated as follows, where R is the radius of the earth in kms: dlon = lon2 - lon1 dlat = lat2 - lat1 a = sin^2(dlat/2) + cos(lat1) * cos(lat2) * sin^2(dlon/2) c = 2 * arcsin(min(1,sqrt(a))) d = R * c 3.6.2 An area based definition of mobility The second approach to the operationalization of learner mobility draws on tests of whether the learner attends school in the same area, using a range of definitions, in which he or she lives. This is motivated by the concern that in some cases, a child‘s ‗local‘ school may not be the closest school, but the school that is located in the same community in which a child lives. Additionally, in some cases, barriers such as rivers, hills, busy roads or train tracks may mean that a child is cut off from the school that is closest to him or her on the basis of straight-line distance. In these cases, it would also be more natural for a child to attend a school that is slightly further away, but is located in the same geographic community. The various definitions of area which are used in this thesis are those developed for Census 2001, and discussed previously. For each of these levels, 82 SAL, SP, MP and MN, a binary indicator was created for each of 1997 and 2003, coded one if the child lived and attended school in the same area, and zero if the child did not live and attend school in the same area. If the child is mobile at the smallest level of geography, the SAL level, all other tests will also categorize a child as mobile. Similarly, if a child is mobile at the SP level, the MP and MN tests will also categorize him or her as mobile. 3.6.3 Mobility defined by attendance at the nearest school The final operationalization used for mobility is based on whether or not the child is attending his or her nearest grade-appropriate school. The two approaches to measuring mobility discussed so far tend to focus on those learners who are travelling particularly substantial distances. While travelling a substantial distance is an important indicator of the amount of effort and money invested in school choice, and certainly identifies the forms of school choice that are most salient in a gradually integrating post-apartheid South Africa, it does not reveal much about individuals who may only be able to participate in school choice at a relatively local level. One way to measure mobility without losing these individuals is to determine whether a child is attending the age-appropriate school nearest to their home, or whether they are choosing to travel slightly further to attend a different school. This is a particularly important aspect of mobility and school choice to explore in an area such as Soweto, where the density of schools is extremely high, with most children living within easy walking distance of more than one school. Obviously, the figures obtained using this operationalization will be an imprecise reflection of school choice. One particular concern is around those children who are explicitly choosing to attend the school closest to their home, and not simply attending it because it is closest. The extent of this phenomenon is unfortunately not measurable with the existing data, and if it is substantial, would mean that levels of school choice are actually higher than is reflected in the available data. A second concern is around the possibility that there may be 83 children who would like to attend their nearest school, but are unable to do so, perhaps because the closest school is over-enrolled, the child is discouraged by higher school fees at this school, or the child is (illegally) refused admission due to poor academic performance. In these cases, this definition would suggest that the child is engaging in school choice and mobility, when in fact the child is actively being prevented from exercising their choice. Thirdly, it is possible that for reasons of geography, the closest school on the basis of a distance calculation is not actually the closest school for a child on foot. In this instance, it is possible that children who are simply attending the nearest school on the basis of available roads are being misclassified as engaging in school choice and mobility. Nonetheless, this nearest-school analysis should provide a more accurate reflection of the extent of school choice when local school choice is included, than any other definitions explored this far. In order to measure the proportion of children attending their nearest age- appropriate school, a spatial join was conducted in gvSIG to identify, for each child, the relevant school nearest to their home. This was done twice, firstly for all schools, both public and private, and secondly using only public schools. For 1997, only primary, intermediate and combined schools were included in the analysis. For 2003, both the nearest primary phase school and the nearest secondary phase school was calculated. Then, on the basis of the child‘s grade, the most appropriate of these two schools was selected. Once the nearest grade-appropriate school for each child, at each time point, was obtained, this school was then compared to the school actually attended by the child at that time point. A binary indicator was created and coded one when the child did attend his or her nearest grade-appropriate school, and zero when the child did not attend that school. 84 3.7 Analysis Data management was conducted using Microsoft Access. GIS analysis was done with gvSIG, and statistical analysis with Stata Standard Edition 11. The analytical procedures followed to generate each set of findings are described in the relevant analytical chapters. In general, however, as most variables were non-normally distributed, non-parametric analyses were used. In particular, analyses involving only categorical variables were conducted with chi-square tests, unless the numbers in any category dropped below 5, in which case a Fisher exact test was used. Analyses involving both categorical and continuous variables were conducted using Mann-Whitney Wilcoxon rank-sum or Kruskal-Wallis tests, and analysis involving only continuous variables was conducted using Spearman rank correlation. For multivariate analysis, a multiple regression approach was used. Distributions are displayed as kernel density plots, which show a smoothed curve based on the estimated non- parametric probability densities of a variable‘s distribution (Scott 1992; van der Berg, Wood et al. 2002). Chapter 4, the first results chapter, presents a range of descriptive data, covering sample representativeness and bias, and properties of schools attended by sample members in relation to the universe of schools available to them. Chapter 5 documents the overall extent of mobility using each of the three different operationalizations detailed earlier in this chapter. In Chapter 6, the relationship between mobility behaviour and characteristics of learners, their families and households, and their communities is explored. Chapter 7 details the relationship between learner mobility and properties of the schools that children attend, and the schools that are closest to their homes. Chapter 8 explores the extent to which mobility behaviours are subject to change over time. Finally, Chapter 9 uses the findings of all previous chapters to generate preliminary models of learner mobility. Chapter 10 summarizes and discusses the findings, and serves as a conclusion to the thesis. 85 3.8 Conclusion This chapter has outlined the methodological approach taken in this thesis, that of quantitative secondary data analysis, as well as the rationale that lay behind this choice. It has documented the range of datasets that were considered for use in the study, and justified the selection of the Bt20 longitudinal cohort study as the most appropriate. The composition of this cohort, particularly with respect to demographic variables, SES, and residential mobility were discussed. Existing data was used to demonstrate that while the overall Bt20 cohort does under-represent the most advantaged and disadvantaged children in Soweto-Johannesburg, and particularly those in minority racial groups, it remains highly representative of black low and middle-income township residents – the group of primary interest to this thesis. The creation of the sub- sample of the Bt20 cohort used in this thesis was explained, and the tests for sample representativity and bias, which are will be presented in Chapter 4, were outlined. Finally, the three different approaches to the operationalization of learner mobility that are used in this thesis were presented, and approaches to data analysis were described. 86 Chapter 4: Sample descriptive statistics and representativity 4.1 Introduction This chapter begins by providing descriptive statistics for the sample with respect to the variables hypothesised to have a relationship to engagement in learner mobility. It then moves on to ask whether the study sample is representative firstly of the Bt20 cohort as a whole, and secondly of the youth population of Johannesburg-Soweto metropolitan area more broadly. These questions around representativity are answered both through a range of statistical tests, and through a discussion of the process by which the Bt20 cohort and the study sample were created. Finally, to provide some context for the discussions of school choice to follow in subsequent chapters, some basic descriptive statistics for schools in the Gauteng province (within which the study area falls) are presented. 4.2 Sample descriptive statistics This section presents descriptive statistics for key child, household and community variables for which there are theoretical grounds to anticipate a relationship with educational mobility. At the child level, race, gender, age at first school enrolment, school phase in 2003, and grade repetition between 1997 and 2003 are explored. At the household level, maternal education, maternal marital status, and household SES in both 1997 and 2003 are considered. Finally, at the community level, the poverty of the area in which the child lives is documented, for three different levels of geography. The relationship between these variables and mobility behaviour will be tested in Chapter 6. 87 4.2.1 Child level variables Race As noted previously, the study sample is mostly black. While coloured children are reasonably well represented, white and Indian children are under- represented, and their numbers are also extremely small. The exact breakdown of study sample members across race groups is presented in Table 4.1 below. Race Black African White Coloured Indian Number of children (n=1428) 1,145 (80.18%) 41 (2.87%) 192 (13.45%) 50 (3.50%) Table 4.1: Breakdown of study sample members by race Gender The study sample is approximately evenly split between males and females (see Table 4.2 below). Gender Male Female Number of children (n=1428) 711 (49.79%) 717 (50.21%) Table 4.2: Breakdown of study sample members by gender Age at first school enrolment Overall, a very slight majority of sample members enrolled in school for the first time either early or on time for their age, while a slight minority enrolled late (see Table 4.3 below). The extent of late enrolment, at over 47%, is striking. However, as noted in Chapter 3, although the late-starters being their schooling a year later than their peers, the majority of them do not start their schooling outside of the two-year window for enrolment specified by policy. Age at first enrolment Earlier Later Number of children (n=1275) 673 (52.78%) 602 (47.22%) Table 4.3: Breakdown of study sample members by age at first school enrolment 88 Schooling phase in 2003 Table 4.4, below, illustrates that by 2003, just under one third of children had progressed to high school, while just over two thirds remained in primary school. A sample member who started their primary schooling on time, and who had not repeated a grade, would be expected to have reached high school by 2003, whereas those who started late, or who had repeated a grade, would typically not be expected to have reached high school. Schooling phase (03) Primary High Number of children (n=1330) 897 (67.44%) 433 (32.56%) Table 4.4: Breakdown of study sample members by phase of schooling in 2003 Grade repetition As shown in Table 4.5 below, slightly more than one third of children repeated a grade between 1997 and 2003, while the remainder did not. This figure is similar, though slightly higher than that reported for other work on the Bt20 cohort (Fleisch and Schindler 2009). Grade repetition between 1997 and 2003 No repetitions One or more repetitions Number of children (n=1240) 778 (62.74%) 462 (37.26%) Table 4.5: Breakdown of study sample members by whether or not they have repeated at least one grade between 1997 and 2003 4.2.2 Household level variables Maternal education The distribution of maternal educational levels is shown in Table 4.6 below. The largest proportion of mothers have completed some secondary school, while relatively few are have grade 5 education or less, which is equivalent to functional illiteracy. The proportion with post-school education is also low. 89 Maternal Highest Completed Education Level Grade 5 or below Grade 6 or 7 Grade 8, 9 or 10 Grade 11 or 12 Post-school education Number of children (n=1305) 86 (6.59%) 94 (7.20%) 610 (46.74%) 399 (30.57%) 116 (8.89%) Table 4.6: Breakdown of study sample by highest level of maternal education attained at the time at which the study sample member was born Maternal marital status Slightly over one third of mothers were married at the time of the birth of study sample member, while just less than two thirds were unmarried (see Table 4.7 below). Maternal marital status in 1990 Married Unmarried Number of children (n=1418) 506 (35.68%) 912 (64.32%) Table 4.7: Breakdown of study sample members by maternal marital status Household SES: 1997 The grouping of households into quintiles on the basis of SES in 1997 is shown in Table 4.8 below. Due to several clusters of households with similar scores, it was not possible to create completely even quintiles. Household SES quintile 1997 1 (poorest) 2 3 4 5 (wealthiest) Number of children (n=1205) 254 (21.08%) 240 (19.92%) 233 (19.34%) 246 (20.41%) 232 (19.25%) Table 4.8: Breakdown of study sample by household SES in 1997 90 Household SES: 2003 Similarly, the household SES quintiles for 2003 are also not completely even, as evident in Table 4.9 below. Additionally, the small proportion of sample member for whom SES data is available for 2003 should be noted. Household SES quintile 2003 1 (poorest) 2 3 4 5 (wealthiest) Number of children (n=887) 181 (20.41%) 174 (19.62%) 179 (20.18%) 177 (19.95%) 176 (19.84%) Table 4.9: Breakdown of study sample members by household SES in 2003 4.2.3 Community level variables Small Area Level poverty An index of community poverty was calculated for each level of census geography, as described in Chapter 3. Households were broken down into quintiles on the basis of poverty level of the SAL in which they lived. The distribution of households across the quintiles of SAL poverty level is illustrated in Table 4.10 below. SAL poverty quintile 1 (lowest poverty) 2 3 4 5 (highest poverty) Number of children (n=1399) 280 (20.01%) 280 (20.01%) 280 (20.01%) 281 (20.09%) 278 (19.87%) Table 4.10: Breakdown of study sample members by the poverty level of the SAL in which they live Sub Place poverty Table 4.11, below, shows the distribution of households across quintiles based on the poverty level of the SP in which they are located. 91 SP poverty quintile 1 (lowest poverty) 2 3 4 5 (highest poverty) Number of children (n=1399) 284 (20.30%) 287 (20.51%) 288 (20.59%) 273 (19.51%) 267 (19.09%) Table 4.11: Breakdown of study sample members by the poverty level of the SP in which they live Main Place poverty Due to the small number of MPs represented in the data, with most sample members concentrated in just a few MPs, attempts to create poverty quintiles based on this level of geography were unsuccessful. The creation of tertiles was slightly more successful, and is illustrated in Table 4.12 below, although substantial clustering is still evident. MP poverty tertile 1 (lowest poverty) 2 3 (highest poverty) Number of children (n=1399) 491 (35.07%) 776 (55.43%) 133 (9.50%) Table 4.12: Breakdown of study sample members by the poverty level of the MP in which they live 4.3 Relationships between variables Tests were conducted to explore the relationships between each of the variables discussed above within the study sample. All relationships operated in the expected directions, and are documented in Appendix C. 4.4 Study sample representativity Understanding the representativity of the study sample consists of two different elements. Firstly the representativity of the full Bt20 cohort with regards to the broader population of similarly aged children in the 92 Johannesburg-Soweto area needs to be understood. This requires a discussion of the initial sample composition, as well as of subsequent sample attrition. Secondly, the representativity of the study sample, with regards to the full, non-attrition Bt20 cohort must be explored. This section begins by describing Bt20 cohort composition and attrition, and implications for cohort representativity. This is followed by series of tests to determine whether the decision to limit the study sample to only those children who did not change residential address between 1996 and 2004 introduced any additional representativity concerns. 4.4.1 How representative is Birth to Twenty? Cohort composition The Bt20 study enrolled and collected longitudinal data on a birth cohort of 3273 singleton children, born to mothers resident in the Johannesburg-Soweto area between April 28 and June 8, 1990. These children have been followed up 16 times to date, meaning that data is available for them at a range of points throughout their schooling. Along with data regarding home situation, caregivers, and a range of health and psychometric measures, Bt20 data relevant to schooling includes each child‘s school‘s name, grade, repetition, drop out, and academic performance as captured on standardized tests and by the children‘s school reports. Although the quality and depth of the data collected by Bt20 is high, and attrition is generally low, concerns about how representative the Bt20 cohort is of Johannesburg-Soweto children and youth, particularly over time, remain important. For this study, this is particularly relevant when the cohort may differ from the broader population with respect to variables such as SES that are expected to have a substantial influence on schooling choices and learner mobility. The Bt20 cohort was intended to consist of all the singleton children born in the Johannesburg-Soweto area between 23 April and 8 June, 1990. For various 93 reasons, however, not all eligible children were actually enrolled. A subsequent study undertaken to identify these ‗missing‘ children – children eligible to be in the cohort but who had either never been identified by the study, or had been identified but had not enrolled– revealed that non-enrolment was largely for two reasons. Firstly, for practical and resource-related reasons, study recruitment was concentrated in public sector health facilities. This meant that members of the more advantaged groups, who typically used private facilities, were less likely to come into contact with study recruiters, and were therefore less likely to be enrolled (Richter, Norris et al. 2004). Secondly, a number of individuals who had initially been identified by study recruiters either declined to enrol, or agreed to study participation but could not subsequently be traced for full enrolment. Reasons for non-enrolment included mobility, often combined with incorrect or incomplete address data, incorrect or incomplete recording of caregiver‘s names, particularly when multiple names were in use, and participant concerns about invasion of privacy or about participation being overly time-consuming. The majority of eligible children who did not enrol again came from relatively affluent backgrounds compared to the rest of the cohort, although some were also relatively disadvantaged (Richter, Norris et al. 2004; Richter, Panday et al. 2009). As a result of these two sets of factors, the initial Bt20 cohort underrepresented white and Indian children, along with more affluent children more generally, but was largely representative of the predominantly black African population of similarly aged children living in the area in 1990. This under-representation of more advantaged children, particularly white and Indian, may result in an under-representation of those children who live very close to high quality schools, and therefore do not experience any pressure to travel to attend schools further afield. It may also, however, result in an under-representation of those children who are able to travel particularly great distances in order to access the most desirable schools. It is therefore unclear how, in aggregate, the under-representation of more affluent children is likely to influence the 94 outcomes of this study. Given, however, that the population of primary interest to this study is the largely black African lower-middle-class population of Johannesburg-Soweto, and that the cohort was representative in this regard, these concerns are not of substantial significance. The relatively small size of the affluent population of Johannesburg-Soweto, as well as of the white and Indian populations, also means that any impact that their under-representation is likely to have on study outcomes will be fairly minor. A second concern relating to overall Bt20 cohort composition relates to the fact that, with the passage of time, the population of the greater Johannesburg- Soweto area has changed. This means that regardless of how representative the cohort was when it was launched in 1990, over time it is likely to have become less representative of the population of same-aged children actually living in the area. This is, of course, a concern with any cohort study, but while it is not specific to Bt20, Bt20 might be particularly seriously affected due the historical era which the study covers. The post-Apartheid era, during which the Bt20 participants grew up and attended school, has been characterized by substantial changes in the residential patterns of people, including children and youth. In part, this has been a response to the demise of Apartheid‘s strictly enforced segregationist residential rules, in which black South Africans were only permitted to live in urban areas if they were employed there. The perception that people living in urban areas, and particularly in Gauteng, are better off than those in rural areas, particularly with regards to access to services and economic resources, combined with the repeal of Apartheid-era segregationist policies, has triggered a substantial influx of new residents to the Soweto-Johannesburg area during the past 20 years (Richter, Norris et al. 2006). A sub-study of Bt20, the 2002 Children‘s School Survey, collected data on all children born between April 23 and June 8 1990 enrolled at 81% of the primary schools in the greater Johannesburg-Soweto area (Richter, Norris et al. 95 2006; Richter, Panday et al. 2009). With a 92% response rate, detailed demographic data was collected on 5367 children. Almost half of these children had not been born in the greater Johannesburg-Soweto area, indicating a substantial level of in-migration amongst children of this age. Analysis of this data revealed significant differences on a number of key indicators between those children who had been born in the greater Johannesburg-Soweto area and those who had moved there at a later point. Generally, in-migrating children appeared to be living in more adverse circumstances and were at greater risk of poverty than their peers who had been born in the area. They were less likely to live in formal housing, and their parents were less likely to own their dwellings. They had lower levels of ownership of most household assets, and poorer access to basic household services such as running water, electricity, sanitation, and refuse removal. Parents of in-migrating children were more likely to be unemployed, and a higher proportion of those who were employed were in unskilled employment. The in-migration of children appeared to be closely connected to school attendance, with the majority of children migrating into the area doing so before commencing their schooling. In-migrating children were also more likely to have started their schooling late, although no evidence of any impact of this delayed start on academic performance in 2002 was found (Richter, Norris et al. 2006). While we don‘t have access to information about out-migration, it is likely that the effects of this on the population have been captured fairly well by attrition from the cohort, which is discussed in the next section. From the information collected from the Children‘s School Survey, however, it is clear that the children migrating into the Johannesburg-Soweto area do tend to differ in terms of their home environments from those children born in the area, and represented in the Bt20 cohort. Specifically, cohort members are likely to be more advantaged, and therefore may be more likely to have access to the necessary resources to participate in school choice and learner mobility. As such, we may again expect to see higher levels of learner mobility in the 96 cohort than would be found in the overall population of same-aged children in Johannesburg-Soweto. Given, however, that we know very little about how income shapes learner mobility, it is not possible to estimate the scale of this effect. It appears, nonetheless, that our cohort remains largely representative of the lower-middle-income black people who make up the bulk of the population of Johannesburg-Soweto. Finally, as is the case with any cohort study, it cannot be assumed that the findings for children born in 1990 can necessarily be extrapolated to children born at other points in time. For the same reasons as those outlined above, relating to the political and social transformations that South Africa has undergone during the lives of the cohort members, it is also likely that Bt20 is particularly sensitive to these sorts of changes. Nonetheless, given the strengths of the cohort, and the absence of alternative sources of data, it remains the best source from which to derive findings relevant to primary school children in contemporary South Africa. Additionally, it seems likely that the period during which Bt20 cohort members were attending primary school was the period during which new, post-Apartheid patterns of school enrolment were defined and stabilized. As a result, the data presented documents a critical period in the evolution of South Africa‘s post-Apartheid schooling system, and can probably, with some caution, be extrapolated to more recent points in time. Bt20 cohort attrition Changes in the composition of the cohort over time, predominantly due to the non-random attrition of participants, will also affect the extent to which it is representative of the population that it was designed to represent. Attrition in Bt20, while low for a study of this length and magnitude, is known to be related to certain variables such as race and socio-economic status (Richter, Norris et al. 2004; Richter, Norris et al. 2007; Ginsburg, Norris et al. 2009). As these variables are expected to be related to learner mobility and school choice 97 more broadly, it is important to explore the nature of this attrition, along with the ways in which it might influence study findings. As mentioned previously, the Bt20 study covers a particularly eventful era in South African history, during which the Apartheid system of controls and regulations governing where people could live, work and be educated was dismantled, with huge implications for population distribution in South Africa. Along with in-migration, as discussed above, the Gauteng province was also affected by out-migration, particularly amongst women and young children, driven by political tension, and overstretched public services. High levels of circular mobility between urban and rural areas, along with mobility within urban areas, became features of the Gauteng province. Migration out of the study area, and as well as mobility within it, caused attrition of cohort members from the study. Attrition has also been caused by maternal or child death, child abandonment or adoption, and study fatigue (Richter, Norris et al. 2004). Even in times and areas of relative stability, the maintenance of a longitudinal sample is difficult, and the internationally accepted norm for sample attrition is between 10 and 20% per annum. In the early post-Apartheid South African context, the Bt20 study was greatly challenged to find ways to maintain the birth cohort, and minimize attrition (Richter, Norris et al. 2004). Using a combination of approaches at both the community and individual level, Bt20 succeeded in keeping sample attrition at an extremely low level, averaging below 3% per year. At the community level, efforts included cooperation with a Community Advisory Board, the use of local fieldworkers, and strict adherence to ethical guidelines including confidentiality, to build a strong relationship with participant communities, and by extension, trust. The provision of some limited social and health services to cohort members also encouraged them to maintain contact. At the individual level, participants were 98 contacted regularly by post, telephone and in person, and were followed up extensively if contact was lost. While attrition has been greatly limited by the efforts described above, it has nonetheless been non-random in nature. Most notably, attrition has been substantially higher among white participants, as well as participants with higher socio-economic status, exacerbating the existing under-enrolment of these groups (Richter, Norris et al. 2004). By contrast to more advantaged groups, retention amongst more vulnerable members of the sample has been extremely high, with black African mothers and their children being particularly likely to remain in the study. Ginsburg et al. (2009) present a number of mobility-related analyses comparing all cases lost to the study prior to 2005, and those cases remaining in the cohort. The attrition cases had experienced significantly higher frequency of residential movement, with 81.3% of them having moved at least once by 2005, and 13.3% moving within any of the documented intervals. By contrast, amongst children remaining in the cohort, only 55.5% had experienced any residential movement. The attrition group also contained significantly more white participants, those born in private hospitals, and those residing in the inner city or suburbs, as opposed to the townships. Mothers in the attrition group were more likely to have been married at the time of child‘s birth, and to have either no formal education, or to have attained some level of post-school education. As suggested by the maternal education information, attrition children were also more likely to live in particularly highly or poorly resourced households. These finding echo expectations that the most advantaged children, particularly white children and those living in the most affluent areas are likely to be underrepresented, as are the most disadvantaged children. As neither of these two groups are likely to be participating substantially in learner mobility (the disadvantaged due to inability, the advantaged due to living close to good schools), we may again expect that 99 figures on mobility are likely to be somewhat inflated for urban populations as a whole. However, as attrition amongst the predominantly African, township dwelling majority of the cohort has remained low, we can expect results fairly representative of this particular population, which is fortunately the group of most interest for this particular project. 4.4.2 How representative is the study sub-sample? As documented in Chapter 3, unanticipated problems with the residential GIS coordinate data, requiring a substantial and time-consuming cleaning process, meant that it was infeasible to use the full, non-attrition Bt20 cohort for this thesis, as had originally been planned. Of the initial Bt20 cohort of 3273, 66% (n=2158) completed a residential history questionnaire in 2005 or 2006 (Ginsburg, Norris et al. 2009), and forms the cumulative non-attrition cohort as of 2006. Of this group, 1470 individuals reported no changes in residential address during this period. Once children attending special schools or boarding schools, those attending schools outside of the Gauteng province, and those who were not attending school at all were removed from this group, along with a small number of children who were resident at multiple addresses during the study period, this left 1428 individuals. The data on these 1428 individuals forms the basis for all analysis presented in this thesis. While not ideal, the decision to focus on this sub-sample of cohort members was made to maximize the available sample size in light of problems with the residential GIS data which required a lengthy cleaning procedure. However, as residential stability between 1996 and 2004 appears likely to be closely related to SES and other variables which may influence school choice, it is important to measure and document the differences between this non-random sub-sample and the full, non-attrition cohort, and to think about how this is likely to influence study findings. Given that residential mobility levels are greatest amongst the most advantaged and most disadvantaged sectors of the population (Ginsburg, Norris et al. 100 2009), a sub-sample constructed primarily on the basis of mobility behaviours is clearly unlikely to be representative of the full sample from which it is drawn. Ginsburg et al. (2009) provide some valuable data on the differences between Bt20 participants who had experienced some residential mobility prior to 2005, and those who had not. While these figures are valuable as indicators, it should be noted that the sub-sample used in this thesis excludes only those children who experienced mobility between 1996 and 2004, and not those who moved earlier or later. The highest levels of mobility, however, were seen in the earliest years, with the commencement of primary schooling typically having a stabilizing effect on children‘s residence. For this reason, the differences between residentially mobile and non-mobile children presented in Ginsburg et al (2009) are likely to be somewhat more substantial than the differences between those children included and excluded from the sample used in this thesis. Additionally, sample construction seems less likely to have influenced representativeness for black African children with mid-range SES levels, the population of primary interest in this study. In order to better understand the nature of the sub-sample used in this thesis, and in particular the ways in which included cases may differ from the excluded, I compare both groups of cases. Firstly, I compare the members of the study sub-sample (n=1428) to all cohort members not included in the sub- sample (n=1845). This provides an indication of how different this sub-sample is from the full Bt20 cohort, including all those individuals lost to contact, as well as those excluded from this sub-sample for any other reasons. Secondly, I focus on the group of cohort members not lost to attrition (n=2158), and contrast those included in my sub-sample (n=1428) with those excluded from my sub-sample (n=730). This provides an indication of how different my sample is from those members of the full Bt20 non-attrition cohort who were not included in this study. 101 Thesis sub-sample compared to all excluded cohort members For SES, and each available demographic variable, the group of individuals included in the study sub-sample was compared to the group of all cohort members excluded from the sub-sample for any reason. Chi-squared tests were conducted to determine whether the distribution of individuals was significantly different across groups for each of the variables. As evident in Table 4.13, below, the included and excluded cohort members differed to a statistically significant degree on all tested variables, with the exception of gender. Importantly, and as hypothesized, the different distributions support the contention that the study sample appears to under-represent those at either extreme of the socioeconomic scale. Under-representation of the historically more advantaged race groups, those born in private hospitals, and outside of the Soweto-DiepMeadow area all suggest that the most advantaged are likely to be underrepresented, but can provide little information about the most disadvantaged section of the cohort. However, an examination of the SES data, as well as the data on maternal educational level suggests that the most disadvantaged are also underrepresented. This is clearest with regards to the SES variable, which shows that the study sample is biased towards the 3 middle quintiles, while the group of excluded individuals contains a higher proportion of individuals falling into the first (poorest) and fifth (wealthiest) quintiles. As discussed previously, the exclusion of the most advantaged and disadvantaged, who for various reasons are hypothesized to be least likely to engage in learner mobility, may lead to somewhat inflated findings regarding levels of learner mobility. However, given that the study sample does appear to be reasonably representative of the black, township-based middle-class, the group which is of greatest interest in this examination of school choice behaviours, this is not anticipated to be likely to be a major problem. 102 Variable Value Included # (% of included) Excluded # (% of excluded) Chi-squared results Child gender Male 711 (49.79%) 880 (47.70%) χ2(1) =1.412 not significant n=3273 Female 717 (50.21%) 965 (52.30%) Race Black 1145 (80.18%) 1423 (77.13%) χ2(3) =55.307 p<0.001 n=3273 Coloured 192 (13.45%) 191 (10.35%) Indian 50 (3.50%) 65 (3.52%) White 41 (2.87%) 166 (9.00%) Maternal age 18 or younger 162 (11.35%) 180 (9.76%) χ2(2) =20.733 p<0.001 n=3271 19-34 1084 (75.96%) 1511 (81.94%) 34 or older 181 (12.68%) 153 (8.30%) Place of birth Soweto/DiepMeadow 1134 (79.41%) 1295 (70.19%) χ2(3) = 135.345 p<0.001 n=3273 Historically Indian/Coloured area 221 (15.48%) 211 (11.44%) Inner city JHB 5 (0.35%) 64 (3.47%) Suburban JHB 68 (19.83%) 275 (14.91%) Hospital of birth Public 1255 (87.89%) 1576 (85.47%) χ2(1) = 4.038 p<0.05 n=3272 Private 173 (12.11%) 268 (14.53%) Maternal marital status Married 506 (35.68%) 696 (37.97%) χ2(3) = 42.236 p<0.001 n=3251 Cohabiting 53 (3.74%) 160 (8.73%) Separated/Divorced/Widow ed 28 (1.97%) 19 (1.04%) Single 831 (58.60%) 958 (52.26%) Maternal highest educational level No formal education 13 (1.00%) 34 (2.09%) χ2(3) = 23.812 p<0.001 n=2932 Primary schooling 167 (12.80%) 241 (14.81%) Secondary schooling 1009 (77.32%) 1140 (70.07%) Post-school education 116 (8.89%) 212 (13.03%) SES at birth Quintile 1 (most poor) 216 (18.56%) 393 (27.14%) χ2(4) = 36.803 p<0.001 n=2612 Quintile 2 215 (18.47%) 223 (15.40%) Quintile 3 286 (24.57%) 284 (19.61%) Quintile 4 238 (20.45%) 251 (17.33%) Quintile 5 (least poor) 209 (17.96%) 297 (20.51%) Table 4.13: Differences between cohort members included in the study sample, and those excluded from the study sample with regards to all available demographic variables collected at birth Thesis sub-sample compared to other non-attrition cases: historical data This second analysis explores the extent to which the study sub-sample differs from the group of cohort members from which it is drawn; that is, the full non- attrition sample. This captures the way in which those cases excluded because 103 the children moved home between 1996 and 2004 differ from those who did not move during this period. The same 1990 data was used for this set of analyses as in the section above. In particular, the SES estimates and quintile allocations for each case were not re-calculated, but were used as generated in the previous set of analyses, on the basis of the data from the full cohort. The results are presented in Table 4.14, below. Variable Value Included # (% of included) Excluded # (% of excluded) Chi-squared results Child gender Male 711 (49.79%) 342 (46.85%) χ2(1) = 1.672 not significant n=2158 Female 717 (50.21%) 388 (53.15%) Race Black 1145 (80.18%) 601 (82.33%) χ2(3) = 3.442 not significant n=2158 Coloured 192 (13.45%) 86 (11.78%) Indian 50 (3.50%) 18 (2.47%) White 41 (2.87%) 25 (3.42%) Maternal age 18 or younger 162 (11.35%) 92 (12.62%) χ2(2) = 19.940 p<0.001 n=2156 19-34 1084 (75.96%) 590 (80.93%) 34 or older 181 (12.68%) 47 (6.45%) Place of birth Soweto/DiepMeadow 1134 (79.41%) 584 (80.00%) χ2(3) = 9.950 p<0.05 n=2158 Historically Indian/Coloured area 221 (15.48%) 90 (12.33%) Inner city JHB 5 (0.35%) 7 (0.96%) Suburban JHB 68 (4.76%) 49 (6.71%) Hospital of birth Public 1255 (87.89%) 642 (88.07%) χ2(1) = 0.015 not significant n=2157 Private 173 (12.11%) 87 (11.93%) Maternal marital status Married 506 (35.68%) 247 (34.12%) χ2(3) = 3.144 not significant n=2142 Cohabiting 53 (3.74%) 31 (4.28%) Separated/Divorced/ Widowed 28 (1.97%) 8 (1.10%) Single 831 (58.60%) 438 (60.50%) Maternal highest educational level No formal education 13 (1.00%) 4 (0.60%) χ2(3) = 7.886 p<0.05 n=1971 Primary schooling 167 (12.80%) 64 (9.61%) Secondary schooling 1009 (77.32%) 521 (78.23%) Post-school education 116 (8.89%) 77 (11.56%) SES Quintile 1 (most poor) 223 (19.16%) 135 (22.06%) χ2(4) = 6.8454 not significant n=1776 Quintile 2 253 (21.75%) 104 (16.99%) Quintile 3 241 (20.70%) 137 (22.39%) Quintile 4 238 (20.45%) 122 (19.93%) Quintile 5 (least poor) 209 (17.96%) 114 (18.63%) Table 4.14: Differences between members of the non-attrition sample included in and excluded from the study sub-sample, with respect to variables collected at birth 104 The results of this second set of analyses suggest that the study sub-sample, selected on the basis of not having changed residence between 1996 and 2004, while differing from the full non-attrition sample in some regards (maternal age, place of birth, and maternal education), is not significantly different in others. Additionally, for those variables which are significantly different, the levels of significance are lower, with only maternal age remaining significant at p<0.001. These results suggest that children with mothers under the age of 35 were significantly more likely to be excluded from the study sub-sample than those with older mothers. Children with mothers with particularly high levels of education were also significantly more likely to be excluded from the sub-sample, as were children born in the typically more affluent suburban areas of Johannesburg. By contrast, children born in the historically Indian and coloured areas were particularly likely to be included in the sub-sample. While these figures do suggest that more advantaged children may be somewhat under-represented in the study sub-sample, compared to in the non- attrition sample, the lack of any statistical significance on this variable suggests that any genuine differences are likely to be fairly minor. The absence of any significant difference on race, hospital of birth, and maternal marital status is also encouraging. It therefore seems reasonable to conclude that while the composition of the thesis sub-sample selection may under-represent both the most advantaged and disadvantaged children, this effect is not as substantial as that anticipated on the basis of Ginsburg et al. (2009), and is certainly less severe than that caused by sample attrition. Thesis sub-sample compared to other non-attrition cases: 1997 and 2003 data A second question with regards to study sample representativity is whether, despite initial similarities, those included in and excluded from the study sample have changed over time in systematically different ways. Ideally, one 105 would also want to ask this question with regards to emergent differences between the study sample and the full cohort, but this is not feasible as, due to attrition, data for later time points is not available for all cohort members. For this reason, this second exploration of potential sample bias is restricted to testing for differences between the study sample and those non-attrition sample members excluded from it. SES data is available for both 1997 and 2003, and this is used to test whether the study sub-sample differs from the non-attrition sample in this regard at each of these time points. Socio-economic status in 1997 Full SES data for 1997 was available for 1758 of the 2158 cases in the non- attrition sample, and this is the data that was used for the following analyses. It is worth noting that cases removed from the study sub-sample due to residential mobility were substantially more likely to be missing SES data (χ2(1)= 23.828; pr<0.001) than those that were retained in the study sub-sample (see Table 4.15 below). While this makes sense, in that children who were mobile were probably harder to locate during any particular round of data collection, the implications of this difference in levels of missing data for the validity of the following analysis are not clear. Included: n (% of included) Excluded: n (% of excluded) Yr7 SES data available 1205 (84.38%) 553 (75.75%) Yr7 SES data missing 223 (15.62%) 177 (24.25%) Table 4.15: Availability of 1997 SES data for members of the non-attrition sample included in and excluded from the study sample As described in Chapter 3, PCA was used to estimate SES scores for each individual using asset ownership data (see Table 3.1). These scores were then used to generate poverty quintiles. Distribution across the quintiles differed significantly between those included in the study sample, and those excluded from the study sample on the basis of residential mobility (see Table 4.16 106 below). Substantially more cases in the very lowest quintile were removed in generating the study sample, while a relatively lower proportion of cases in the other quintiles were removed. This suggests that the study sample includes a somewhat higher proportion of children living in middle-class and affluent families in 1997 than the full non-attrition sample. Reasons why this might be the case are not obvious, but may relate to differences in the timing of mobility for families with different levels of SES, as only children who moved during their primary school years were excluded. Variable Value Included # (%) Excluded # (%) Chi-squared results SES quintile 1 (poorest) 254 (21.08%) 160 (28.93%) χ2(4) = 14.587 p<0.01 n=1758 2 204 (16.93%) 87 (15.73%) 3 261 (21.66%) 97 (17.54%) 4 236 (19.59%) 108 (19.53%) 5 (least poor) 250 (20.75%) 101 (18.26%) Table 4.16: Differences between members of the non-attrition sample included in and excluded from the study sub-sample, with respect to SES in 1997 Socio-economic status in 2003 As described in Chapter 3, SES for 2003 was estimated using an assets index collected during study year 12, and housing quality data collected during study year 13 (see Table 3.1). Following manual imputation of missing values, complete data was available for 1296 individuals, or approximately 60% of non-attrition cases. PCA was used to estimate an SES variable for each sample member. These were additionally used to categorize individuals into 5 poverty quintiles. The extent to which data was missing for cases included in the study sample, and for the non-attrition cases excluded, were compared (see Table 4.17 below). Once again, a substantially higher proportion of those excluded from the study sample on the basis of residential mobility were missing SES data (χ2(1) = 7.462; pr<0.01). 107 Included in study sample: n (% of included) Excluded from study sample: n (% of excluded) 2003 SES data available 887 (62.11%) 409 (56.03%) 2003 SES data missing 541 (37.89%) 321 (43.97%) Table 4.17: Availability of 2003 SES data for members of the non-attrition sample included in and excluded from the study sample A chi-squared analysis was conducted to test whether the distribution of cases across the SES quintiles was different for those participants included in the study sample, and those excluded (see Table 4.18 below). The test revealed no significant differences between these distributions. On the basis of these 2003 SES scores then, sample selection does not appear to have created any additional sample bias in favour of children with mid-range or high SES scores. This combines with the analyses of SES scores at other points in time to suggest that while sample selection on the basis of residential mobility may reduce the representation of the most disadvantaged and the most advantaged participants, this effect is relatively minor, and is weakest for the most contemporary data. Variable Value Included # (%) Excluded # (%) Chi-squared results SES quintile 1 (poorest) 181 (20.41%) 79 (19.32%) χ2(4) = 1.1916 not significant n=1296 2 173 (19.50%) 86 (21.03%) 3 180 (20.29%) 80 (19.56%) 4 181 (20.41%) 78 (19.07%) 5 (least poor) 172 (19.39%) 86 (21.03%) Table 4.18: Differences between members of the non-attrition sample included in and excluded from the study sub-sample, with respect to SES in 2002/2003 4.4.3 Sample selection & bias: Conclusion In summary, it seems likely that cohort enrolment, attrition over time, and the non-random selection of the sub-sample used for this thesis are likely to have each played a small role in contributing to a somewhat biased sample. In 108 particular, there seems to be some indication of an over-representation of black children, and t hose from middle-income families. By contrast, there is some under-representation of members of minority racial groups, as well as children whose families are amongst the richest or poorest 20% of the population. Overall, however, the extent of this under-representation does not appear to be extreme, particularly when the length of time for which data is available is considered. However, caution should of course be used in generalizing the findings of this study to the broader population, and particularly to children at extreme ends of the socio-economic continuum. In particular, it is possible that the level of mobility found in this cohort may be slightly higher than that found in the population more broadly, as it seems possible that children from medium-income families may be the most likely to engage in learner mobility. 4.5 Descriptive schools data: all Gauteng schools The final section in this chapter presents a range of descriptive statistics for the schools found in the Gauteng province of South Africa. As detailed in Chapter 3, the data presented here comes from a variety of time points between 2002 and 2008, but covers all registered schools, public and independent, known to have operated in the Gauteng province in the post-Apartheid era. This information is presented to provide an overview of the nature of the educational opportunities available to children growing up in the Johannesburg-Soweto area, which is essential to understanding the findings presented in subsequent chapters. Properties of schools with regards to each of the school level variables considered are described, and bivariate relationships with other school properties are described. Additional data relating solely to that subsample of schools attended by study sample members is presented in Chapter 7, as is information relating different school attributes to mobility behaviours. 109 4.5.1 School types and sectors The final schools dataset used for the analyses in this thesis contains data on 2604 schools, both public and independent, known to have operated in the Gauteng province in the post-Apartheid era. Of these, 1570 (60.29%) are primary schools, covering grades 1-7, and 289 (11.10%) are combined schools, running all the way from grade 1 through to grade 12. There are 656 (40.90%) secondary schools, covering grades 8-12, along with small numbers of intermediate schools (n=73; 4.55%), and finishing schools (n=16; 0.61%). Just below 20% of the schools in the dataset are independent schools. The majority of the combined (n=229, 79.24%) and finishing schools (n=15; 93.75%) are independent. Amongst public schools, 1406 (67.27%) are primary schools, 552 (26.41%) are secondary schools, and only 132 (6.32%) are other school types. This fact, that Gauteng contains a substantially larger number of public primary schools than public high schools will be revisited in subsequent chapters, as it relates to changes in mobility behaviour as children move from primary to high school. Of particular note is the fact that, on average, a child‘s nearest secondary school will be somewhat further away from his or her home than his or her nearest primary school. This means that, all else held constant, a child should be expected to travel somewhat further to school on enrolling at a high school. The smaller number of high schools also means that the range of schools which children are choosing between is more limited, reducing the extent of choice available to children. 4.5.2 School Quintile The quintile rating system8 applies only to public schools. As discussed in Chapter 3, it rates schools from 1 (being the poorest) to 5 (the most affluent), primarily on the basis of the community within which the school is located 8 It should be noted that despite its name, the quintile rating system does not divide either schools or learners evenly into five different groups. Very little information is publicly available as to how exactly the poverty quintile ratings for schools were arrived at, or why the quintiles are so variable in size. Available information is summarized in Chapters 2 and 3. 110 (Kanjee and Chudgar 2009). Within Gauteng province, schools are not very evenly distributed between the different quintiles, and the majority of schools are in quintiles 3 or above, which is in line with Gauteng being a primarily urban, and comparatively affluent province. Secondary schools appear to be somewhat more likely to be in higher quintiles than primary schools. The distribution of Gauteng public schools across the poverty quintiles is shown in Table 4.19 below. Quintile Primary schools (% of public primary schools) High school (% of public high schools) Total (% of all public schools) 1 n=176 (13.02%) n=42 (8.11%) n=218 (11.66%) 2 n=118 (8.73%) n=43 (8.30%) n=161 ( 8.61%) 3 n=433 (32.03%) n=145 (27.99%) n=578 (30.91%) 4 n=355 (26.26%) n=155 (29.92%) n=510 (27.27) 5 n=270 (19.97%) n=133 (25.68%) n=403 (21.55%) Table 4.19: Numbers of schools in each quintile in Gauteng province 4.5.3 Section 21 Status Section 21 status is also only relevant to public schools, and is based on 2008 data. Any public school can apply to operate as a Section 21 school, which places responsibility for the management of school finances at the school level, substantially increasing autonomy. If schools are not Section 21, their finances are operated by their provincial Department of Education, which is typically much less efficient, and can be substantially more expensive. As a result, schools typically pursue Section 21 status whenever they have any managerial capacity, although in some cases applications for Section 21 status are turned down. The large majority of the Gauteng schools on which data is available, 87%, have Section 21 status. High schools appear less likely to have Section 21 status (84%) than primary schools (90%), and a chi square test confirms that this is a significant difference (χ2(1)=13.6702, Pr=0.000). Predictably, there is a strong relationship between quintile rating and Section 21 status, with 111 more affluent schools being significantly more likely to manage their own finances (χ2(4)=35.1538, Pr=0.000). 4.5.4 School enrolment Due to the substantially larger number of public primary schools in Gauteng, it seems likely that they would tend to be smaller than high schools, even though primary schools cover 7 years of schooling, as opposed to 5 years covered by high schools. The available schools data bears this out, with the mean enrolment for primary schools in 2002 being 687, compared with 979 for secondary schools (see Table 4.20 below). Combined schools had the smallest mean size of any school type, probably because only a small proportion of them were public. Independent schools were typically smaller than public schools, with the mean size for a independent primary school at just 301 learners, while the mean for public primary schools was 711 learners. Similarly, independent secondary schools were a mean size of 344, whereas public secondary schools had a mean enrolment of 1067. The mean size overall was 525 learners, rising to 776 for public combined schools, and falling to 448 for independent combined schools. School Type Number in Gauteng Average number of learners Combined 297 620 Intermediate 76 782 Primary 1807 744 Secondary 702 1003 Table 4.20: Average number of learners for different types of schools in Gauteng Although school enrolment does vary significantly by quintile rating, according to a Kruskal-Wallis test (Pr=0.00), the nature of this relationship is not entirely clear. Quintile 2 schools are on average the largest, quintile 1 schools the smallest, and the average enrolments of schools in quintile 3, 4 and 5 are between these two extremes. This pattern holds when all schools are examined and when primary schools only are examined, but becomes less 112 extreme when only secondary schools are examined. Overall, there is no evidence that enrolment rises or falls strictly in line with quintile rating. An explanation for the particularly small size of quintile 1 schools may be their predominantly rural locations. The relationship between school size and Section 21 is clearer, with schools with Section 21 status tending to be substantially larger than those without. This may again relate to many of those schools without Section 21 status being located in rural areas, and therefore having lower enrolments. On the other hand, it may also be the case that those few schools without Section 21 status are a particularly poorly performing subset of schools, and therefore particularly unattractive to learners, leading to lower enrolment. 4.5.5 Percentage of black learners Although the mean proportion of black African learners in Gauteng schools was just over 73%, much in line with the population of the province as a whole, this figure obscures the actual distribution of black learners across the province‘s schools. When the data is broken down, it becomes clear that over 50% of schools have an enrolment that is over 99% black, while a full 10% of schools have fewer than 5% black learners. Schools that are meaningfully integrated, and representative of the racial composition of the province‘s population as a whole, are extremely rare. These figures are in line with those presented by Sujee (2004). Figure 4.1 below illustrates the distribution of the proportion of black learners across all Gauteng province schools. 113 Figure 4.1: Distribution of Gauteng schools by the proportion of their learners who are black Exploring the racial distribution of learners by schooling sector, public or independent, reveals that overall, independent schools have a lower proportion of black students. For public schools, the mean proportion of black students is 77%, while for independent schools it falls to 55%. In addition, a Wilcoxon rank-sum test indicates that the distributions of the proportion of black learners across schools is significantly different (P=0.000) across independent and public schools. Public schools are substantially more likely than independent schools to be 100% black, while independent schools are much more likely than public schools to have very low proportions of black children. Figure 4.2 below illustrates the different ways in which black learners are distributed across public and independent schools. 114 Figure 4.2: Distribution of Gauteng schools by the proportion of their learners who are black These figures highlight that there remains some accuracy to the perception of many independent schools as ‗white‘ institutions, even as there is evidence of the development of a substantial sub-group of independent schools which serve an entirely black student body (Centre for Development and Enterprise 2010). It seems likely that the disaggregation of independent schools into two groups, one entirely black, and one almost entirely white, is likely to fall largely along the same lines as the division of independent schools into two groups on the basis of performance – one excellent, and the other extremely poor (although the Centre for Development and Enterprise report referred to above does contest this hypothesis). Within the public sector, the distribution of black children across schools differs between primary and secondary schools, with primary schools being more likely to be entirely black than secondary schools, while secondary schools are more likely to be almost entirely white than primary schools (see Figure 4.3 below). A Wilcoxon rank-sum test finds that the distributions of the 115 proportion of black learners across primary and secondary schools are statistically significantly different (P=0.0112), and an examination of the differences in distributions reveals that while for primary schools the mean proportion of black learners is 78%, this falls to 71% for secondary schools. The differences in the racial composition of student bodies at primary and secondary schools seem likely to be an artifact of the high number of relatively small primary schools and the smaller number of relatively large secondary schools found in historically black areas. Unfortunately it is not possible to test this here. It is also important to point out that the data reflected here is only for one year (2002), during a period of extremely rapid change in South African society and schools, and that the distributions may well have since shifted. Figure 4.3: Distribution of the proportion of black learners for public primary and secondary schools in Gauteng The relationships between the proportion of black children in a school, and that school‘s quintile rating and Section 21 status are more straightforward than those relationships discussed previously, and are highly statistically significant (Kruskal-Wallis test, Pr=0.000). Almost all schools in quintiles 1-3 are entirely 116 black African. In quintile 4, the mean proportion of black children falls to 78%, and in quintile 5, the most advantaged schools, the mean proportion of black children is only 31%. Equally predictably, amongst schools without Section 21 status, almost all are entirely black. By contrast, amongst those with Section 21 status, the distribution is very similar to the distribution across all public schools as described previously. The evidence around the relationship between proportion of black learners and total school size is mixed. When all schools are considered, there is a very weak positive correlation, which does not reach statistical significance. However, when independent schools are removed from the sample and only public schools are considered, a statistically significant – but not extremely strong – negative correlation emerges. A scatter plot suggests that schools which are predominantly black have a much wider range of sizes than schools which are predominantly white, which tend to be middle-sized. Interestingly, the negative correlation between school size and percent black learners is particularly strong amongst public-sector primary schools – that is, public primary schools with a higher proportion of black learners tend to be smaller than those with a lower proportion of black learners. By contrast, among public secondary schools, there is evidence of a (more weakly) statistically significant positive correlation between school size and percent of learners that are black, suggesting that secondary schools with a higher proportion of black learners are larger than those with a smaller proportion. This tends to suggest that historically disadvantaged primary schools are likely to be particularly small, while historically disadvantaged secondary schools are likely to be particularly large. This will have implications for the average distance from a child‘s home, in a historically disadvantaged area, to his or her nearest schools – the distance to the nearest primary school is likely to be substantially shorter than the distance to the nearest secondary school. In addition, the child is likely to have access to a larger number of local primary schools than secondary schools. 117 4.5.6 School fees Of all the school data considered, the data on school fees is one of the most problematic, and is probably the least reliable. That said, school fees serve as an extremely useful measure of how accessible a school is, and are worth exploring even in the context of imperfect data. Overall, the recorded school fees for 2002 range from R0 to R9900. It seems likely that some schools, especially in the independent sector, were charging higher fees, but for some reason the captured figure was capped at R9900. In the full sample, the mean school fee charged was R1117, however, this obscured an extremely skewed distribution, as revealed by the median school fee of R120. When the schools are separated on the basis of public and independent, it is evident that while a small number of independent schools charged very low or no fees, the majority charged substantial fees, with the median figure being R2500. By contrast, when looking only at public schools, the median falls to R100, and the maximum to R8600. Within the public sector, school fees also vary substantially by phase, with secondary schools tending to be substantially more expensive than primary schools. For public primary schools, fees range from R0 to R6500, with a mean of R683, and a median of R70. For public secondary schools, the range is from R0 to R8600, with a mean of R1302, and a median of R200. Somewhat predictably, school fees also vary significantly by school quintile rating, with more affluent schools charging substantially higher fees, although there is some anomalous data for quintile 1 schools, which probably reflects poor reporting by those schools (Kruskal-Wallis test, Pr=0.0001). Along similar lines, fees are significantly higher (Wilcoxon rank sum test) at those schools with Section 21 status, as opposed to those without. There is a statistically significant positive relationship between school fees, and school size (Spearman correlation, Pr=0.0009). Much of this is probably explained by the higher fees typically charged by secondary schools, which are 118 also larger than primary schools. When only public primary schools are examined, however, the relationship between fees charged and enrolment becomes even stronger. By contrast, within the group of public secondary schools, fees tend to fall as enrolment increases. These divergent patterns are consistent with previous data showing that smaller primary schools typically have more black learners, and are therefore likely to be historically disadvantaged and less affluent, while in the secondary phase it is the larger schools that typically have more black learners. Also in line with this data, direct analysis of the relationship between school fees and the proportion of a school‘s learners that are black reveals an extremely strong relationship (Spearman correlation, Pr=0.0000). As schools become increasingly black, school fees fall substantially. This pattern is consistent across all schools, public and independent, and within the groups of primary and secondary schools as well. The relationship is stronger within the public sector than the independent sector, however, suggesting that a proportion of black learners attending independent schools may well be buying out of the public sector. Of all the relationships described so far, this negative relationship between school fees and black enrolment is by far the strongest. In the South African context where economic disadvantage and race are so strongly conflated, this is not in the least surprising. 4.5.7 Historical racial status of the school Of the Gauteng schools for which Apartheid-era department data is available, just under 60% fell under the DET. Although the majority of these schools remain in the public sphere, there are a small proportion of them – about 10% – that have subsequently become independent schools. When only public schools are considered, a significantly higher (χ2(1)=4.1224, Pr<0.042) proportion of primary schools are historically DET schools than the proportion of secondary schools. However, when independent schools are included, this distinction disappears. It is also noticeable that the majority of public schools 119 which are intermediate or combined are historically DET schools. Together, this suggests that fewer DET schools have become public secondary schools than would be expected, and that this may be because a number have moved into the independent sector, while others remain categorized as combined or intermediate schools. Predictably, historical DET status is strongly linked to quintile (χ2(4)=961.5693, Pr=0.000), with very few Quintile 4 or 5 schools having historically operated under the DET. Surprising, at first glance, is the fact that by far the largest proportion of historically DET schools are found in Quintile 3. However, given that the DET was an urban department, and that schools in the more rural areas of Gauteng were typically run by other departments, this does in fact make sense. Clearly, historical DET status cannot be used purely as a proxy for low resource levels at a school, as few of the quintile 1 and 2 schools were operated by the DET. However, it can probably operate as a useful proxy for poor schools located in urban contexts. Schools historically operated under the DET were substantially less likely to have obtained Section 21 status by 2002, compared to schools operated under other departments (χ2(1)=42.5644, Pr=0.000). Public schools historically operated by the DET typically had lower enrolments in 2002 than other schools (Wilcoxon rank-sum test, Pr=0.0000). When these schools are broken down into primary and secondary schools, however, it become clear that historically DET operated high schools are larger than other high schools (Wilcoxon rank-sum test, Pr=0.0000), while it is only the primary schools that actually tend to be smaller (Wilcoxon rank-sum test, Pr=0.0000). Historical DET status also provided a strong predictor of the 2002 proportion of a school‘s learners who were black (Wilcoxon rank-sum, Pr=0.0000). For ex-DET schools, the mean proportion of black students was 99%, while for 120 others it was 39%. This was not substantially different when further disaggregated by phase. In line with all data presented previously, historical DET status was also a strong predictor of school fees, with ex-DET schools charging substantially lower fees than others (Wilcoxon rank-sum, Pr=0.0000). 4.5.8 Matric pass rate As detailed in Chapter 3, due to the absence of school performance data for primary schools, the matric pass rate of the nearest secondary school was used to approximate performance for primary schools. While highly suboptimal, particularly given recognition that the matric pass rate is often a dubious indicator of school performance in secondary schools, this was the best available option, and as a result is presented here. Figure 4.4 below provides the kernel density plots for the matric pass rates at secondary schools, and the rates extrapolated onto primary schools as discussed in Chapter 3. Overall, both distributions are fairly similar. The one aberration is the much higher proportion of primary schools with pass rates around the 70% level. This is probably related to the trend of a larger number of smaller primary schools in historically black urban areas, where the typical pass rates of high schools are around the 70% level. The similarity of the distributions of matric pass rates applied to primary and secondary schools is also evident when details of the two distributions are examined. Both have means in the low seventies, although the median score in the secondary school distribution is substantially higher (78%) than that for the primary school distribution (72%). Standard deviations and scores at more extreme percentiles are also very similar. 121 Figure 4.4: Kernel density plots of pass rates at primary and secondary schools For various reasons, when matric pass rate is explored by school status, public or independent, the data is not very consistent. At the primary school level, the distinction does not make sense, as the imputed performance bears no relationship to whether the school is independent or public, but only to the performance of the secondary school closest to it. At the secondary level, the data is more meaningful, but, as mentioned in Chapter 3, performance data for the more highly performing secondary schools is largely missing. As a result, although performance varies significantly on the basis of whether a school is public or independent (Wilcoxon rank sum test, Pr=0.0951), with independent schools performing more poorly, this is due to a bias in the data and cannot be taken at face value. Due to concerns about the validity of the data for independent schools, the remaining analyses will be conducted on public schools only, unless otherwise specified. Examining the pass rates of public schools on the basis of their poverty quintile rating reveals, unsurprisingly, a strongly significant difference (Kruskal-Wallis 122 test, Pr=0.0001). Schools in the lowest quintiles (that is, the poorest schools) perform substantially more poorly than those in the higher quintiles, particularly at the extremes. The mean pass rate for quintile 1 schools is 64%, for quintile 2 schools it is 69%, for quintile 3 schools a slightly inconsistent 65%, for quintile 4 schools 72%, and for quintile 5 schools 92%. When primary and secondary schools are examined in isolation, this pattern does not change substantially. A strongly significant relationship was also identified between section 21 status in public schools and matric pass rates, with section 21 schools performing substantially better than those without section 21 status (Wilcoxon rank-sum test, Pr=0.0001). The mean pass rate was 66% for schools without section 21 status, and 74% for those with this status. The difference between schools with and without section 21 status is greater at the secondary school level. There is also a statistically significant relationship between pass rate and the size of a school, with larger schools tending to perform somewhat better (Spearman correlation, Pr=0.0005). However, when schools are broken down into primary and secondary, this relationship changes. At the primary school level, the positive relationship between school size and performance persists, whereas at the secondary level this relationship reverses, with smaller schools out-performing larger schools. This almost certainly relates to the high numbers of comparatively smaller primary schools found in less affluent urban areas, and the tendency for the secondary schools in these same areas to be larger than their counterparts in more advantaged areas. The strongest and most statistically significant relationship identified with pass rate is that with the proportion of a school‘s learners who are black (Spearman correlation, Pr=0.0000). As the proportion of black learners in a school rises, performance falls. The relationship is strongest for public sector secondary 123 schools, and somewhat weaker at the primary school level. The relationship between school fees and pass rates is almost as strong, with performance rising substantially in the schools charging the highest fees (Spearman correlation, Pr=0.0000). Again this relationship is weaker – though still strong – at the primary school level. The final analysis undertaken explored pass rate by whether the school was historically a DET school or not. Predictably, performance between DET and non-DET schools was significantly different, with DET schools substantially underperforming all other schools (Wilcoxon rank-sum test, Pr=0.0000). This was consistent across public and independent schools, as well as primary and secondary schools. 4.5.9 Descriptive schools data: discussion In summary, this descriptive schools data suggests that schools in Gauteng are strongly clustered, with a relatively small group of highly performing, well- resourced and expensive schools, and a much larger group of less well- performing, and more poorly resourced schools. Resource levels, school fees, racial composition of the student body, and school-level academic performance all remain closely related, even in the context of post-Apartheid South Africa. Additionally, these analyses provide evidence that the distribution of primary and high schools with respect to each other is somewhat different in different areas of the Gauteng province. In particular, there are substantially higher numbers of public primary schools than public high schools. In less advantaged urban areas (typically township areas), primary schools tend to be fairly small, and rather densely distributed. In these same areas, high schools tend to be particularly large, and far more sparsely distributed. In more advantaged areas, and amongst independent schools, primary and high schools are far more similar in terms of size, as well as in the numbers of schools available at each phase. 124 4.6 Conclusion This chapter began by providing an overview of the study sample with regards to the key explanatory variables examined in this thesis. It then moved on to address issues related to sample representativity and bias. By comparing the study sub-sample to both the full Bt20 cohort, as well as the 2005/6 non- attrition cohort, it was possible to establish that while the study sub-sample does under-represent children at the most advantaged and disadvantaged extremes of the population, this was largely the result of sample attrition, rather than the a result of the method of sub-sample construction. While caution should therefore be taken in applying the findings of the study to the full urban population of Johannesburg-Soweto, the sub-sample appeared to remain largely representative of the black African majority with mid-range levels of SES, who are the group of primary interest with regards to the questions asked in this thesis. Finally, the third component of the chapter presented descriptive statistics for the schools in the Gauteng province. It provided evidence that schools across the province are far from comparable, varying widely in terms of their student bodies, their fees, their access to resources, and their performance. Historically advantaged schools, typically located in historically advantaged areas, continue to outperform historically disadvantaged schools. All findings were very much in line with other analyses of the South African educational system (Fiske and Ladd 2004; Sujee 2004). In addition, this section provided evidence that in historically disadvantaged urban areas, a large number of relatively small primary schools are found, along with a fairly small number of much larger secondary schools. By comparison, in more advantaged areas, as well as in the independent sector, primary and secondary schools are much closer in size, and more evenly distributed. This has clear implications for the range of schools between which children in different areas are choosing, as well as the distances they are likely to need to travel from home to school. 125 Chapter 5: Measuring the extent of learner mobility in contemporary urban Johannesburg-Soweto 5.1 Introduction This chapter answers the first major empirical question asked in this thesis, by presenting data on the extent of leaner mobility in contemporary Johannesburg-Soweto. As discussed in previous chapters, due to the limited levels of knowledge and theory about learner mobility, particularly in the South African context but in the international literature as well, the best approach to measuring learner mobility is not immediately clear. While the majority of existing studies have looked at travel distance or time (Sekete, Shilubane et al. 2001), there is reason to believe that other approaches to measuring mobility, such as whether home and school are in the same area (Msila 2005; Karlsson 2007; Hunter 2010), or whether children attend their nearest school (Msila 2009), may also be important. Therefore, this chapter uses the three different operationalizations of learner mobility discussed previously to measure learner mobility amongst members of the study sample in both 1997 and 2003. Firstly, straight line distance between home and school is used as an indicator of distance. Secondly, whether or not the child lives and attends school in the same ‗area‘ is used as an indicator of whether a child attends a local school. Thirdly, whether or not a child attends his or her nearest grade-appropriate school is used as an indicator of choice. 126 5.2 Distance-based operationalization of learner mobility In this section, two different approaches to the use of the straight line distance between a child‘s home and school in measuring mobility are presented. The first approach simply looks at the distance between home and school, while the second is to use this distance to create binary indicators coded one if a child is travelling further than a particular distance, and zero if he or she is not. This allows us to answer two different question – firstly, how far children are travelling, and secondly, how many children are actually mobile. 5.2.1 Actual straight-line distance from home to school Comparing datasets As discussed in Chapter 3, two different variables for the school attended were created for each time point. The first variable for each time point was created using purely prospective schooling data, collected at that particular point in time. The second variable was based on the first, but used additional retrospective data collected at a later point to fill in gaps. For both timepoints, the prospective dataset therefore has a much higher number of missing cases, but is likely to have greater accuracy than the retrospective data, which makes use of recollection at a later point. For this reason, initial explorations of distance travelled to school focused on comparing these two different datasets for each point in time, to establish whether or not they provided satisfactorily similar results. The figures obtained using the different datasets are presented in Table 5.1 below. Sample No. of Observations Mean (km) Standard Deviation (km) Minimum (km) Maximum (km) 1997, small sample 746 5.623 10.798 .007 79.867 1997, small sample 742 5.249 9.544 .007 57.766 127 constrained at 60km 1997, large sample 1221 5.901 10.955 .007 105.038 1997, large sample constrained at 60km 1214 5.493 9.524 .007 57.766 2003, small sample 745 5.479 8.725 .045 57.551 2003, large sample 1285 5.625 9.768 .045 105.948 2003, large sample constrained 1281 5.355 8.462 .045 57.551 Table 5.1: Comparison of findings on distances travelled from home to school for different datasets, for 1997 and 2003. It is immediately apparent that although the minimum and mean distances from home to school using the prospective and retrospective variables are extremely similar, there is a substantial difference in the maximum distances, with much higher maximums presented in the retrospective data. For both time points, however, this is due to a small number of cases in the retrospective variable that travelled further than the maximum distance reported in the prospective dataset. Re-examination of the raw data in these cases did not provide any definitive information as to whether these distances were correct, but in most cases it seemed implausible that a child would travel that distance to attend the school identified, suggesting that perhaps an incorrect school name had been provided, or that the child was in fact not actually resident at the reported home address. Given that a few extreme outliers would bias the results of any further analysis, it was decided to constrain the data by recoding as missing any reported distances of over 60km between home and school. This removed 7 cases from the 1997 data, and 4 cases from the 2003 data. Once these outliers had been removed, the figures and distributions for the prospective and retrospective 128 data became substantially more similar, as is evident in Table 5.1 above, and Figure 5.1 below. Figure 5.1: Kernel density plots of the distribution of distances to school for the small and large samples, curtailed at a maximum distance of 60km, for both 1997 and 2003. Examination of the properties of both distributions for 1997, constrained at 60km, reveals that they are extremely similar. The same holds true for both distributions for 2003. Conducting an unmatched t-test on the two different samples for each time point also fails to reject the hypothesis that the means are the same in both cases, further supporting the argument that the distributions are indeed similar. Given the similarity in each year between the small sample and the large sample when the most extreme cases have been removed, all subsequent analysis makes use of these constrained larger samples, unless otherwise specified. 1997 distance from home to school The distance data for 1997 show a very high concentration of learners at the lowest levels of mobility (see Figure 5.2 and Table 5.2 below). 25% of learners are travelling less than half a kilometre to school, and almost half travel less than a full kilometre. As the distance of travel increases above 1km, however, the distribution begins to spread out substantially. The 75th percentile is reached at just below 6km, and the remaining 25% of the sample forms a long, 2003 1997 129 thin tail reaching out to 60km. In practical terms, this means that although almost half of learners attend a school that is extremely close to their home, there are also almost 25% of learners who travel over 6km – a fairly substantial distance for a 7 year old, and one that almost certainly indicates that these children or their families are making use of school choice, and are investing some financial resources into this choice, at the very least in terms of paying for transportation. These children are also likely to be travelling to schools in communities that differ substantially from those in which the live, particularly with regards to community affluence, resource levels, and historical racial designation. Given the shape of the distribution of distances travelled, taking a log transformation provides a useful way to compress the tail, and makes the distribution somewhat more normal. Key data regarding the distributions of the distance and transformed distance are provided in Table 5.2 below, and the kernel density plots are provided in Figure 5.2. The log transformation is particularly interesting in that it pulls together the cases spread out over the tail of the untransformed distribution. The second peak, around 3, is the effect of concentrating all these cases, and demonstrates that despite their low density, they do actually form a significant proportion of the distribution when considered together. Although the log transformation still fails standard tests for normality (Shapiro-Wilk in Stata 11), it is far closer to a normal distribution than the original data. 130 Figure 5.2: Kernel density plot overlaid on histogram illustrating the distribution of distances travelled by sample members in 1997. The log transformation of the distribution is also provided. 1997 25% 50% 75% 95% Mean Distance (km) 0.471km 1.032km 5.825km 24.686km 5.493km Natural log of distance -0.753 0.031 1.762 3.206 0.454 Table 5.2: Distribution of distances and log distances travelled by sample members 2003 distance from home to school The distribution of the 2003 data resembles the distribution of the 1997 data very closely, even though the children are substantially older at this time point (13 years, as opposed to 7 years), with a number already enrolled in high school (see Figure 5.3 and Table 5.3 below). The distribution is however slightly more compressed, indicated by the lower mean even as the values at the percentile levels are generally slightly higher. This may relate to the better quality of the data, which has cleared out spurious cases from the tail of the distribution, or it may relate to actual differences in the behaviour of children or the distribution of the relevant schools. The 2003 data is analysed more closely, controlling for schooling phase (primary or secondary) in Chapter 6, and a detailed comparison of the data for 1997 and 2003 is presented in Chapter 8. For the moment, however, the key points to note are that around 50% of children are attending schools less than 131 1.25km away from their home, but also that over a quarter of 13 year old children are travelling more than 6.5km to get to school on a daily basis. This is very similar to the data for 1997, which is unanticipated as it was hypothesized that mobility would increase substantially as children aged. Instead, it suggests that a fairly similar (although still high) proportion of children and their families are participating and investing resources in school choice in 2003 as was the case in 1997. Figure 5.3: Kernel density plot overlaid on histogram illustrating the distribution of distances travelled by sample members in 2003. The log transformation of the distribution is also provided 2003 25% 50% 75% 95% Mean Distance (km) 0.578km 1.243km 6.879km 23.362km 5.355km Natural log of distance -0.549 0.218 1.928 3.151 0.584 Table 5.3: Distribution of distances and log distances travelled by sample members Again, taking the log transformation of the distances travelled is helpful in compressing the distribution, and revealing the extent to which cases are concentrated in the tail end of the distribution. Once again, although it still fails tests for normal distribution, the transformed distribution is closer to a normal curve. 132 Actual straight line distance to school: Conclusion Overall, calculation of the distances between children‘s homes and schools reveals two key points. Firstly, the average (one-way) distance from home to school, regardless of which year or sample size is examined, is somewhere between 5 and 6km. Secondly, the distances are distributed as an approximately normal curve centred somewhere between 2 and 3km, with an extremely long and narrow tail to the right representing the roughly 25% of children who appear to live particularly far away from their schools. 5.2.2 Binary definitions of mobility While examining the actual distances between children‘s homes and their schools provides more detailed information about how far children are travelling, the use of binary definitions of learner mobility can facilitate the development of policy around learner mobility and school catchment areas, as well as the assessment of the implementation of existing policies. While some information is lost in moving from a continuous measure to a binary definition, analysis and interpretation are also simplified. As discussed in Chapter 3, various cut-off points for the binary definition of learner mobility suggest themselves on the basis of the existing literature and information on the topic, and the analysis presented here makes use of a number of them. 3km is used as this is the maximum distance a learner can travel and still be considered to attend a local school in South African policy (Martin 2010). It is also probably the maximum distance that a young child can be expected to walk to school. 5km and 10km cutoffs are also used, as they are frequently encountered in the local and international literature (Sekete, Shilubane et al. 2001; South African Human Rights Commission 2004). Working with these definitions, and exploring cumulative density plots suggested that various other definitions, particularly around 1, 1.5, 2 and 2.5km would also provide useful information. In all instances, the variable is defined by coding all children travelling up to and including the cut off distance as 0 (not mobile), and all those travelling more than the cut off distance as 1 (mobile). 133 As the data for 1997 and 2003 were again extremely similar, both time points are discussed together here, and data for both are presented in Table 5.4, below, which provides the numbers and percentages of children who are classified as mobile and non-mobile for each of the various binary definitions of mobility considered. The first important outcome of exploring the various binary definitions of mobility as proposed above is that there is remarkably little difference between the proportions of children defined as mobile across the different definitions, once a distance of 2.5km has been exceeded. After this point, the most striking shift occurs in the interval from 3 to 5 km, where the proportion of children classified as mobile falls from 33.53% to 27.59% (1997) and 33.96% to 28.96% (2003) with an increase in travel distance of a full 2km. While the absolute decreases in mobility from the 5km to the 10km definition are greater, this is spread over an interval of 5km. Given the steepness of the distribution curve prior to 2.5km, the relative flatness of the curve at the 2.5km to 3km interval is somewhat surprising, and indicates that this interval may have some significance. Mobility definition 1997: Number (%) mobile 2003: Number mobile (%) Travel more than 1 km 613 (50.49%) 727 (56.75%) Travel more than 1.5 km 505 (41.60%) 574 (44.81%) Travel more than 2km 451 (37.15%) 503 (39.27%) Travel more than 2.5 km 418 (34.43%) 458 (35.75%) Travel more than 3km 407 (33.53%) 435 (33.96%) Travel more than 5km 335 (27.59%) 371 (28.96%) Travel more than 10km 226 (18.62%) 239 (18.66%) Table 5.4: Numbers and percentages of children classified as mobile in 1997 and 2003, for each binary definition of mobility considered. Looking at a plot of the distribution of distances (see Figure 5.4 below) substantiates this indication that something important is happening around the 2-3km interval. The initial, parabolic distribution ends here, and the long flat tail of the distribution seems to begin. Similarly, the slope of the cumulative density function shifts from steep to flat during this interval. That this shift in 134 distributions occurs in the interval between 2 and 3km is fairly compelling for both empirical and theoretical reasons. Empirically, it corresponds roughly to the distance that that a young, school-age child could reasonably be expected to walk to school on a regular basis. Theoretically, the 3km endpoint of this interval corresponds to the South African definition of a local school as being within 3km of a child‘s home. Figure 5.4: Cumulative density plot of distance between home and school, up to 10km, laid over a histogram illustrating the density distribution of distance The second important outcome of these analyses is that they do indicate that for any of the proposed definitions of mobility, particularly those that are guided by the learner mobility literature, a substantial proportion of children are actually travelling substantial distances on a daily basis. In particular, roughly one third of children are travelling more than 3km. This is pretty clear indication that they are not attending local schools, particularly in an area such as Soweto where there is an extremely high density of public schools. It also suggests – at least in an urban area – that at least a third of children are making use of transportation, whether public or independent, to access schooling. This entails a substantial additional level of family investment in the schooling of these children. 2003 1997 135 5.3 Area-based operationalization of learner mobility As described in Chapter 3, the second approach to the operationalization of learner mobility draws on various levels of geographic areas as defined by Census 2001. The smallest area is the Small Area Level (SAL), followed by the Sub-Place (SP), the Main Place (MP), and finally the largest area, the Municipality (MN). Preliminary investigations of the mobility at the SAL level revealed that, due to the small size of SALs, very few children (less than 7%) attended school in the same SAL in which they lived. This makes sense, as very few urban schools are small enough to serve a community of only 200 households, suggesting that SALs are likely to be sharing schools, and as a result, data for mobility at the SAL level is not presented here. Once again, due to the strong similarity between the data for 1997 and 2003, findings for both time points are presented together. The numbers and proportions of children who are mobile for each of SP, MP and MN are presented in Table 5.5 below, for both 1997 and 2003. The SP level of analysis shows that just over 40% of children attended school in the same SP as they lived in in 1997, and just below 37% in 2003. Given that SP geography is roughly equivalent to residential suburbs, this suggests that around 40% of children are attending a local school within their suburb, while the other 60% are travelling to schools outside of their suburb. At the MP level, which corresponds to major areas of the city (for example Soweto, Meadowlands, Johannesburg, and so on), the proportion of children attending school within the MP where they live rises to over 70% for both 1997 and 2003. Interestingly, the proportion of children travelling across MP boundaries is very similar to the proportion travelling over 5km. Finally, at the MN level, very few children are travelling to a different MN area for school. Given the size of MN areas, and the fact that each MN includes a wide range of schools in terms of performance and cost, this is unsurprising. 136 1997: number (%) not mobile (i.e. school and home in same area) 2003: number (%) not mobile (i.e. school and home in same area) SP 494 (40.63%) 473 (36.90%) MP 882 (72.53%) 901 (70.28%) MN 1157 (95.15%) 1,236 (96.41%) Table 5.5: Number and percent of children who live and attend school in the same SP, MP and MN areas, in 1997 and 2003 The figure for mobility at the MP level is particularly significant, because the boundaries at the MP level of geography correspond most closely to the historical boundaries between areas designated for different race groups. This is critical, because the historical racial designation of a school remains one of the strongest predictors of school performance in contemporary South Africa, and is likely to be one of the major determinants of school choice. Additionally, historical racial group is also a strong predictor of the cost of attending a school. For this reason, those children crossing MP boundaries can be roughly equated to the group that are choosing to attend schools that were historically restricted to white, Indian or coloured children. Those children crossing SP, but not MP, boundaries, can by contrast be roughly equated to those children exercising some degree of school choice but without travelling to areas that were historically designated for other racial groups. This group of just under 30% of children should be roughly equivalent to those who are exercising school choice within historically disadvantaged areas, without leaving those areas. Determining the correlations between the different possible measures of mobility reveals that there is a very strong overlap between the MP definition, and the distance based definition using travel greater than 5km (see Tables 5.6 and 5.7 below). This substantiates the notion that both of these measures are identifying roughly the same group of children, those travelling fairly substantial distances to attend historically more advantaged schools, and that these measures are therefore likely to be of particular significance. 137 1997 Travel >2.5km Travel >3km Travel >5km Travel>10km SAL 0.1963 0.1924 0.1672 0.1296 SP 0.5897 0.5847 0.5114 0.3962 MP 0.7650 0.7700 0.8036 0.7131 MN 0.3063 0.3043 0.3334 0.4241 Table 5.6: Correlation coefficients between distance-based and area-based measures of mobility for 1997 2003 Travel >2.5km Travel >3km Travel >5km Travel>10km SAL 0.1503 0.1445 0.1287 0.0965 SP 0.5674 0.5486 0.4885 0.3664 MP 0.7850 0.8010 0.8212 0.6980 MN 0.2558 0.2482 0.2614 0.3440 Table 5.7: Correlation coefficients between distance-based and area-based measures of mobility for 2003 5.4 Nearest school based operationalization of learner mobility The final approach to measuring learner mobility involves determining whether or not children are enrolled at their nearest grade-appropriate school. As discussed in Chapter 3, while this is not a perfect indicator of engagement in school choice, the proportion of children not attending their nearest school is expected to provide a fair approximation of the proportion of children engaging in choice. Again, due to substantial similarities over time, the data for 1997 and 2003 is presented together. The first key finding using this approach to measuring mobility is that less than 20% of children are actually attending the grade-appropriate school nearest to their homes in both 1997 and 2003 (see Table 5.8 below). This figure is surprisingly low, and suggests that over 80% of children are travelling further than strictly necessary in order to attend school. One possible reason that children might not be attending their nearest grade-appropriate school could be that the school in question is an independent (private) school. Due to this, two 138 sets of figures are presented, one including only public schools, and one including independent schools as well. As is clear from the data in Table 5.8, this makes very little difference to the results. However, when the 2003 data is disaggregated by schooling phase – that is, when children enrolled in primary school are separated from those enrolled in secondary school – an interesting pattern is revealed. Despite hypotheses that mobility should be higher amongst high school children, a substantially higher proportion of these children are attending their nearest school (just under 22%). The overall proportion of children attending the nearest school remains the same because the proportion of primary school children attending the nearest primary school actually falls fairly markedly to just over 15%. While the higher proportion of high school children attending the closest school may just be due to a smaller number of available high schools (see Chapter 4), the lower proportion of primary school children attending their closest school at age 13 is more intriguing. One potential explanation is that children who are attending schools further afield perform more poorly, making them more likely to still be in primary school at age 13. An alternative, and somewhat more plausible explanation may be that when children fail a grade, their parents are more likely to try sending them to different schools, which may be further from their homes. These hypotheses, and others, will be explored in the subsequent chapters. Number (%) of learners attending the school closest to their home Mean distance to nearest school Maximum distance to nearest school 1997 public schools only 219 (17.92%) 0.417km 3.142km 1997 public and independent schools 217 (17.76%) 0.398 km 2.767km 2003, public schools only 235 (18.58%) 0.489km 4.230km 2003, public schools only; 141 (16.49%) 0.428km 3.142km 139 primary school learners only 2003, public schools only; high school learners only 94 (22.33%) 0.616km 4.230km 2003, public and independent schools 222 (17.40%) 0.466km 4.230km 2003, public and independent schools; primary school learners only 131 (15.32%) 0.410km 2.767km 2003, public and independent schools; high school learners only 91 (21.62%) 0.583km 4.230km Table 5.8: Number and percentage of learners attending the school closest to their home in 1997 and 2003, and the mean and maximum distances to the schools nearest to sample members’ homes The data on the distance from children‘s homes to their nearest schools provides an additional interesting finding: the mean distance a child needs to travel to attend their nearest primary phase school is just approximately 400m, and less than 5% of children need to travel over 1km. When contrasted to the actual distances children are travelling – previous calculations indicated over 50% of children travelling over 1km – this highlights the extent to which travel, even of moderate levels, appears to be due to children attending schools further from home than is strictly necessary. In 2003, however, not all children are still within easy walking distance of a grade-appropriate school. Although 95% of children in 2003 have to travel less than 1.15km to reach their nearest school, there are a small number of children who have to travel over 3km. This is probably primarily due to the fact, discussed in Chapter 4, that there are substantially fewer high schools in the Johannesburg-Soweto area, due to their typically having a somewhat larger 140 size. Disaggregating the children by schooling phase supports this hypothesis, as the data generated for the primary school children remains similar to that generated in 1997. Nonetheless, even when children are in high school, the average distance travelled remains substantially greater than the distance a child would need to travel to access his or her nearest school. 5.5 Conclusion: This chapter has explored three different approaches to defining and measuring learner mobility, and provided data about the extent of learner mobility in Johannesburg-Soweto on the basis of each of these definitions. Each definition is likely to prove particularly valuable for certain purposes, and in certain contexts. Using a distance-based measure provides both a binary and a continuous measure of mobility. The distance-based binary measure is of the form that is typically used in school choice related policy, and is therefore particularly valuable in assessments of the appropriateness or applicability of policy. The continuous measure of distance is particularly useful in exploring the actual extent of mobility and what it entails for particular learners in terms of the investments they are required to make, both financially and in terms of time. In addition, it allows for the examination of the distribution of the distances travelled by the entire sample, and, as it is the measure that has been most commonly used in the existing literature, it also allows for comparison with previous findings. The definition of mobility based on census geography is particularly useful in that it makes use of generally accepted geographical areas to explore the extent to which mobility is occurring within and between these areas. This is helpful in identifying whether learners are travelling between areas historically designated for different race groups, and thereby significantly enhancing the quality of education they are likely to receive. Additionally, it is, and thus identifying those learners who are likely to be making the most substantial 141 economic investments in their education. Finally, the definition based on whether or not the learner is attending the age-appropriate school closest to his or her home is useful in highlighting the extent to which even learners with relatively low levels of mobility may be engaging in more travel than strictly necessary or anticipated, and may also be engaging in school choice, particularly within the historically disadvantaged areas. This chapter has made two key contributions to the literature. Firstly, in providing three different approaches to the conceptualization and measurement of learner mobility, it has significantly enhanced the methodological tools available to the study of this practice. Secondly, it has, for the first time, provided population-based data on the extent of learner mobility in contemporary urban South Africa. In so doing, it has identified preliminary evidence to suggest that there may in fact be two patterns of school choice and mobility in operation in Johannesburg-Soweto. Firstly, there is a group of approximately 25% of the sample who are engaged in substantial travel from home to school on a daily basis, and who seem likely to be making significant investments in this mobility. Secondly, and somewhat less expectedly, there is also evidence that a large proportion of children who are not travelling substantial distances to school are still engaging in mobility and school choice. Even though they are attending schools relatively close to home, they are not attending the nearest grade-appropriate school to their home, and are often travelling to schools that are not located in the same residential areas as their homes. These patterns, and their importance to understanding the implications of learner mobility to educational access and equality, are explored in greater detail in the subsequent chapters. 142 Chapter 6: Individual, family and community characteristics of mobile learners 6.1 Introduction The previous chapter illustrated the extent of learner mobility in post- Apartheid Johannesburg-Soweto. Even using the most stringent definitions of mobility, the numbers of children engaged in mobility are substantial. Having developed an understanding of the extent to which learner mobility is taking place amongst school-age children in Johannesburg-Soweto, along with a clearer idea of exactly what this mobility entails, two further questions arise. The first relates to which children in particular are most likely to be engaging in learner mobility. The second relates to the characteristics of schools these children are choosing to attend, and the types of schools they are travelling to avoid. This chapter addresses the first of these questions, while the second will be addressed in Chapter 7. A child‘s educational mobility is expected to be closely related to the level of investment the child‘s family makes in his or her education. For this reason, it is useful to explore variation in both a family‘s access to resources to invest in education, and in variables which might be associated with a family‘s propensity to invest in education. This chapter explores the relationship to mobility of a range of variables at the levels of the individual child, the child‘s household, and his or her community. The child characteristics considered are race, gender, age at first enrolment in school, grade repetition and schooling phase. Family characteristics examined are maternal education, maternal marital status, and household SES, in both 1997 and 2003. At the community level, SAL, SP and MP poverty levels are examined. 143 Child level characteristics 6.2 Race Given the strong relationship between race, access to resources, and area of residence in South Africa, it is anticipated that race is related to educational mobility behaviours (Fiske and Ladd 2004; Fiske and Ladd 2005). While white children are most likely to have access to the resources required to engage in educational mobility, they are also least likely to need to engage in it to access good schools, given that most highly performing schools are located in historically white areas. Black children, by contrast, are likely to have the greatest incentives to engage in mobility, typically living in the areas with poorest schools, but are simultaneously least likely to have access to the necessary resources. Indian and coloured children are likely to fall somewhere in between the black and white children in terms of both incentives and ability to engage in mobility. As the numbers of white and Indian children present in the study sub-sample (28 and 25, respectively) are extremely small, findings for these groups are unlikely to be broadly representative, and are therefore not presented here. Discussion will be limited to the behaviour of black and coloured children. 6.2.1 1997 Straight-line distance Examining the distance between home and school on the basis of race reveals a strong relationship with race. As is evident in Table 6.1, black children tend to travel substantially further to school than coloured children (Wilcoxon rank- sum, Pr=0.0000). A kernel density plot (see Figure 6.1 below), illustrates just how different the distances from home to school are for black and coloured children. The kernel density plot for the coloured children is far more concentrated at very low levels of travel for coloured children than for their black peers. 144 Race Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Black African 1002 5.773km 9.589km 0.492km 1.138km 6.904km Coloured 159 3.935km 9.533km 0.360km 0.561km 1.834km Table 6.1: 1997 Distance between home and school on the basis of race Figure 6.1: Kernel density plot of distance to school in 1997, on the basis of race Census geography An area-based approach to measuring mobility finds similar patterns (see Table 6.2 below). At all levels of geography other than MN, coloured children are substantially more likely to live and attend school in the same area than black children. Black (n=1003) Coloured(n=160) χ2 School and home in same SAL 48 (4.79% 29 (18.31%) χ2(1) = 39.7146, Pr=0.000 School and home in 373 105 χ2(1) = 46.0932, 145 same SP (37.19%) (65.63%) Pr=0.000 School and home in same MP 696 (69.39%) 141 (88.13%) χ2(1) = 24.0038, Pr=0.000 School and home in same MN 957 (95.41%) 150 (93.75%) χ2(1) = 0.8334, N.S. Table 6.2: 1997 mobility at different levels of census geography, by race Nearest school Mobility analyses exploring whether or not children attended their nearest grade-appropriate schools again provided similar results, with coloured children being substantially more likely to attend their nearest school than black children (see Table 6.3). The analysis was conducted firstly using only public schools, and secondly using both public and independent schools, and in both cases similar figures were obtained. The finding that black children are the least likely to attend their nearest school makes sense given that they are likely to live in the poorest areas (as described in Appendix 3), and their nearest school is therefore more likely to be particularly poorly performing. Black (n=1009) Coloured (n=155) χ2 Child attends nearest school (public or independent) 146 (14.47%) 60 (38.71%) χ2(1) = 54.2007, Pr=0.000 Child attends nearest school (public only) 143 (14.17%) 62 (38.75%) χ2(1) = 57.6862, Pr=0.000 Table 6.3: Children attending their nearest grade-appropriate school, by race, for public schools only, and for all schools 6.2.2 2003 Straight-line distance As evident in Table 6.4, the distances from home to school in 2003 are slightly different from those in 1997, with an increase in the difference between the mean distances travelled by black and coloured children (Wilcoxon rank-sum, Pr=0.0000). In Figure 6.2, the extent to which coloured children are more likely to live very close to their school than black children is highly evident. 146 Race Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Black African 1065 5.834km 8.617km 0.635km 1.380km 8.090km Coloured 163 2.625km 7.121km 0.402km 0.668km 1.315km Table 6.4: 2003 Distance between home and school on the basis of race Figure 6.2: Kernel density plot of distance to school in 1997, on the basis of race Census Geography The 2003 area-based analysis (see Table 6.5 below) provides very similar results to the 1997 analysis. Once again, coloured children are much more like to attend school in the same SAL, SP and MP areas in which they live than black children. Black (n=1066) Coloured (n=163) χ2 School and home in same SAL 28 (2.63%) 16 (9.82%) χ2(1) = 72.4870 Pr=0.000 School and home in same SP 350 (32.83%) 110 (67.48) χ2(1) = 72.4870 Pr=0.000 147 School and home in same MP 702 (65.95%) 150 (92.02%) χ2(1) = 45.5357 Pr=0.000 School and home in same MN 1026 (96.25%) 159 (97.55%) χ2(1) = 0.6904 N.S. Table 6.5: 2003 Mobility across different levels of census geography, by race Nearest school Coloured children remain significantly more likely to attend their nearest grade-appropriate school than black children in 2003 (see Table 6.6 below). Black (n=1009) Coloured (n=155) χ2 Child attends nearest school (public or independent) 154 (14.50%) 61 (39.35%) χ2(1)= 57.4420 Pr=0.000 Child attends nearest school (public only) 161 (15.16%) 66 (40.99%) χ2(1)= 61.7256 Pr=0.000 Table 6.6: 2003 Children attending their nearest grade-appropriate school, by race, both for public schools only, and for all schools 6.2.3 Race and mobility discussion There is strong evidence that both in 1997 and 2003, race is closely related to mobility behaviour, regardless of the way in which mobility is measured. Overall, black children appear to be substantially more engaged in all forms of learner mobility than coloured children. There are a range of possible explanations for this, including that coloured children live in areas with better schools, that the coloured community is more cohesive and prefer to keep their children at local schools, or that coloured families are less likely to want to make substantial investments in their children‘s education for various reasons. 6.3 Gender It is possible that families approach the education of girls and boys differently. Certainly, in contemporary South Africa, girls are known to remain in formal 148 education longer, and also tend to outperform boys (Unterhalter 2005; Fleisch and Schindler 2009). During the Apartheid era, both policy and practice favoured different approaches to education on the basis of gender (Fiske and Ladd 2004), and some legacy of this might be expected to persist, particularly around the levels of investment in the education of children of different genders. To determine whether mobility, and by extension, educational investment, differs on the basis of gender, male and female populations were compared, using different definitions of mobility. 6.3.1 1997 Straight-line distance Examining the distribution of distance by gender does suggest girls, on average, travel slightly further than boys (see Table 6.7 and Figure 6.3 below). Closer examination of the data, however, seems to suggest that this difference, particularly evident in the means, may be caused primarily by a cluster of girls travelling fairly substantial distances, particularly between 20 and 60km, pulling the overall mean for girls (along with the standard deviation and percentile breaks) upwards. A Wilcoxon rank-sum (Mann-Whitney) test fails to find any significant difference in the distribution of distance from home to school on the basis of gender. Gender Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Boys 592 4.976 km 8.782 km 0.466 km 0.941 km 4.997 km Girls 622 5.985 km 10.163 km 0.475 km 1.086 km 6.928 km Table 6.7: 1997 distance from home to school, by gender 149 Figure 6.3: Kernel density plot of 1997 distance from home to school, by gender If distance from home to school is grouped into categories, a chi-square test does, however, reveal a significant difference between boys and girls (see Table 6.8 below). Overall, while distributions are fairly similar, boys are somewhat more likely to be travelling extremely short distances, and girls are somewhat more likely to be travelling distances over 20km. Up to 1km 1km- 2.5km 2.5km- 5km 5km- 10km 10km- 20km Over 20km χ2(5) =12.156, Pr=0.033 Boys (n=592) 306 (51.69%) 88 (14.86%) 50 (8.45%) 47 (7.94%) 66 (11.15%) 35 (5.91%) Girls (n=622) 295 (47.43%) 107 (17.20%) 33 (5.31%) 62 (9.97%) 68 (10.93%) 57 (9.16%) Table 6.8: Gender breakdown of 1997 categories of distance from home to school Census geography Girls were significantly more likely to attend a school in the same SAL in which they lived than boys (see Table 6.9 below). At all other levels of 150 geography, however, there was no evidence that girls and boys behaved differently. Boys (n=592) Girls (n=622) χ2 School and home in same SAL 33 (5.56%) 50 (8.03%) χ2(1) =2.893 Pr = 0.089 School and home in same SP 242 (40.81%) 252 (40.45%) χ2(1) =0.0163 Not significant School and home in same MP 439 (74.03%) 443 (71.11%) χ2(1) =1.303 Not significant School and home in same MN 570 (96.12%) 587 (94.22%) χ2(1) =2.376 Not significant Table 6.9: 1997 mobility across different levels of census geography, by race Nearest School Chi-squared tests provided no indication for any gender differences in the likelihood of children attending their nearest school, regardless of whether independent schools were included in the analysis or not. 6.3.2 2003 Straight-line distance Overall, patterns of mobility by gender in 2003 remained largely consistent with those identified in 1997, although the mean distance travelled to school by girls did decrease slightly (see Table 6.10 ). Nonetheless, girls still continue to travel, on average, almost a kilometre further than boys. Interestingly, the standard deviation on the distances travelled by girls has fallen quite substantially, approaching fairly closely the standard deviation on the distances travelled by boys. For both genders, the percentile distances have increased slightly, with the effect more noticeable for girls, particularly from the 75th percentile up. In sum, this suggests that distribution of travel distances for girls may have spread out slightly towards the tail end (representing greater distances), with proportionally fewer girls continuing to travel particularly short distances, although this effect is not large enough to be evident on the 151 kernel density plot of the distributions of distance travelled by gender (see Figure 6.4 below). A Wilcoxon rank-sum test indicates that girls travel further than boys in 2003 (Pr= 0.0489). Gender Number of observations Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Boys 631 4.953469 8.224416 0.522km 1.175km 5.524km Girls 650 5.745164 8.67542 0.619km 1.303km 7.446km Table 6.10: 1997 distance from home to school, by gender Figure 6.4: Kernel density plot of 2003 distance from home to school, by gender In 2003, the chi-square analysis of distance grouped into categories no longer reveals any significant difference between boys and girls, although a higher percentage of girls continue to travel particularly great distances (see Table 6.11 below). 152 Up to 1km 1km- 2.5km 2.5km- 5km 5km- 10km 10km- 20km Over 20km χ2(5) =8.428; Not significant Boys (n=631) 290 (45.96%) 123 (19.49%) 47 (7.45%) 65 (10.30%) 66 (10.46%) 40 (6.34%) Girls (n=650) 264 (40.62%) 146 (22.46%) 40 (6.15%) 67 (10.31%) 71 (10.92%) 62 (9.54%) Table 6.11: Gender breakdown of 2003 categories of distance from home to school Census geography Examining mobility by census area in 2003 reveals that the gender difference previously evident at the SAL has disappeared (see Table 6.12). A gender difference at the level of SP level has emerged, although in the opposite direction, with boys being more likely to attend school in the same SP where they live. There is no evidence of any gender difference at the MP or MN levels. Boys (n=631) Girls (n=651) χ2 School and home in same SAL 24 (3.80%) 26 (3.99%) χ2(1) =0.031 Not significant School and home in same SP 247 (39.14%) 226 (34.72%) χ2(1) =2.699 Pr =.100 School and home in same MP 452 (71.63%) 449 (68.97%) χ2(1) =1.087 Not significant School and home in same MN 609 (96.51%) 627 (96.31%) χ2(1) =0.037 Not significant Table 6.12: 2003 mobility across different levels of census geography, by gender Nearest school As with 1997, there is no evidence in that children of either gender were more likely to attend their nearest school in 2003, regardless of whether independent schools are included in the analysis or not. 153 6.3.3 Gender and mobility discussion There was some evidence that girls travelled, on average, further than boys in both 1997 and 2003, but this was extremely sensitive to the way in which mobility was measured. If girls are travelling further, one possible explanation is that parents value the education of girl children more highly, and therefore tend to invest more in their education. An alternative explanation may be that girl children are more likely to travel with parents, and attend school close to a parents‘ work place, perhaps due to safety concerns. Overall, however, the data presented here does not provide conclusive evidence for any substantial difference in mobility on the basis of gender. 6.4 Age at first school enrolment In South Africa, children have a legal window of two years during which to start their schooling. It is possible that the point during this window at which children begin their formal schooling relates to the level of interest or commitment that parents feel towards their child‘s schooling, with more committed parents enrolling children earlier. By contrast, it may also relate to a parent‘s ability to fulfil care-giving responsibilities, in which case parents with fewer resources may be more likely to pursue the earliest possible enrolment of their children to reduce their care-giving burden. It may also relate to different enrolment and application policies applied in different schools, with more selective schools preferring to enrol older and more independent children. It is therefore conceivable that the distance a child travels to school is connected to their age at first school enrolment, although the expected direction of this relationship is not evident. 154 6.4.1 1997 Straight-line distance Table 6.13 and Figure 6.5 show that children who start school at a later age travel significantly further than those who start at an earlier age (Wilcoxon rank-sum test; Pr= 0.0004). Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Earlier starters 646 4.704km 9.216km 0.449km 0.880km 4.569km Later starters 557 6.482km 9.866km 0.498km 1.221km 9.044km Table 6.13: 1997 distance from home to school, by age at first enrollment Figure 6.5: Kernel density plot of 1997 distance from home to school, by age at first school enrolment Census geography Although there was no relationship between starting school late and travelling between SALs or MNs for schooling, children who started school early were 155 significantly more likely to school within their residential SP (χ2(1)=10.8425; Pr=0.001), as well as in their residential MP (χ2(1)=9.8651; Pr=0.002). Nearest school There was no significant relationship between age at first enrolment and whether or not a child attended their closest grade-appropriate school, regardless of whether independent schools are included or excluded. 6.4.2 2003 Straight-line distance In 2003, although late starters still travel further on average than early starters, this difference is no longer statistically significant (Wilcoxon rank-sum test). Additionally, although the distance for the 75th percentile of late starters is still substantially higher than for early starters, at the 25th and 50th percentile, the early starters are actually travelling further. The distribution for late starters is therefore wider, but slightly flatter, than that for early starters (see Table 6.14 and Figure 6.6 below). One potential explanation for this change over time is that by 2003, more of the children who started school early are enrolled in high school, and that this requires them to travel somewhat further. This hypothesis is explored in the next section of this chapter, on the relationship between schooling phase in 2003 and mobility behaviours. Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Earlier starters 644 5.122km 8.143km 0.615km 1.288km 6.187km Later starters 563 5.750km 8.867km 0.567km 1.181km 8.077km Table 6.14: 2003 distance from home to school, by age at first enrolment 156 Figure 6.6: Kernel density plot of 2003 distance from home to school by age at first school enrolment Census geography analysis By 2003, there is no longer any evidence for differential mobility at any level of census geography on the basis of whether a child started school early or late. Nearest school There is a weakly significant relationship between whether a child starts school late, and whether he or she attends his or her nearest grade-appropriate school in 2003, but only when both public and independent schools are considered (χ2(1)= 2.9904, Pr = 0.084), with children starting late being less likely to be attending their nearest school in 2003. When only public schools are considered, there is no significant relationship. 6.4.3 Age at first enrolment and mobility discussion In 1997, there is a relationship between mobility and whether a child starts school earlier or later, with children starting later being more likely to travel 157 further. By 2003, this relationship has however largely disappeared. This may relate to the fact that a proportion of those who started school early have begun attending secondary schools, while those who started late are almost all still in primary schools. The relationship between mobility and schooling phase is explored further in the next section of this chapter. There is no clear and obvious explanation for why, in 1997, children who started school late are likely to have a greater distance between their homes and their schools. It may be the case that parents who plan to send their children to schools further afield, necessitating independent travel, as well as in many cases the ability to adapt to a different cultural environment, are waiting until children are slightly older before enrolling them in school. It may also relate to enrolment practices at more advantaged schools, were children are in some instances required to take entrance tests or undergo interviews. A third potential explanation is that less affluent parents, who are not able to send children to schools far from home, may also not be able to afford pre-school or child care for their children, and therefore prefer to send them to school as early as possible. More affluent parents, by contrast, may be less pressed to enrol children in primary school, preferring to ensure that children are genuinely school-ready. 6.5 School phase in 2003 As discussed in Chapter 4, there is evidence that those children who have reached secondary school in 2003 differ systematically from those that have not on the basis of race, gender, age at first enrolment, grade repetition, maternal education, and 1997 SES. All of these variables have been hypothesized to have some relationship to mobility, and as a result, children who have reached high school in 2003 may be exhibiting different mobility behaviours to other children simply because of this. On the other hand, high 158 school status may in itself have implications for mobility behaviour for a range of reasons. As described in Chapter 4, the Gauteng province contains a fairly small number of large high schools, and a much larger number of much smaller primary schools. Due to this, all else being constant, we would expect to see high school children travelling slightly further to school on average, and we would also expect a higher proportion of high school children to attend their nearest school. The descriptive mobility data presented in Chapter 5 suggests that this is indeed the case. An additional reason to anticipate changes in mobility behaviour between primary and secondary schooling is that the costs and benefits of mobility may change, in turn changing preferences around the selection of schools. For example, children of high school age can travel greater distances, on their own, more safely than younger children, and may also be able to walk further, lowering the cost of attending a more distant school. A factor in favour of stability in mobility behaviours between the primary and secondary schooling years, however, is any element of path dependence. For example, secondary schools may give preference to children from local primary schools, and those primary schools might likewise encourage children to enrol in local high schools. The difficulty of moving between schools in different areas may be much higher for older children. Preferences may also be shaped through different criteria at the secondary school level. For example, the academic performance of a secondary school is more immediately salient than the performance of a primary school, largely due to the availability of some information about the matric exam pass rate. The greater availability of evidence with regards to school academic performance may affect the importance attributed to school academic performance in school selection. At the high school level, children may also be 159 far more actively involved in the selection of their school, and may well be driven by different priorities than those used by their parents in selecting a primary school. By contrast, however, secondary schooling is typically more expensive than primary schooling, and this may influence parents to maintain or even increase their role in school selection. Finally, secondary schooling may simply be attributed greater value than primary schooling, changing the level of investment which families and children are willing to make in education. It is not immediately evident, however, in which direction these potential changes in school preferences should influence mobility when aggregated. Differences between the mobility behaviours in 2003 of children who have reached high school, and those who haven‘t, could be attributed either to their being different with regards to variables associated with mobility, or alternatively simply to the fact that they have reached high school. It is also possible that any difference is due to a combination of these factors. Due to the nature of the sample used in this study, in which only a relatively small, non- random group of children has reached high school by 2003, the relative contributions of individual and household variables on mobility cannot be separated out from any independent effect of high school status. Really untangling the extent to which schooling phase shapes mobility independent of socio-economic and other individual and family-level variables could be approached either through the use of data for years beyond 2003 for the current sample, or through the use of a broader, or differently structured sample. However, the available data does provide one way of obtaining some insight into this issue, by looking at whether the two groups of children had similar mobility behaviours in 1997 or not. If they behaved similarly in 1997, it seems likely that more of the difference in 2003 can be attributed to schooling phase. By contrast, if behaviour was already different in 1997, this suggests a more 160 important role for individual, family and community variables associated with mobility. This data is presented below. 6.5.1 1997 Straight-line distance Children who were still in primary school in 2003, and those who had reached secondary school in 2003, had very similar distances from home to school in 1997, as evident in Table 6.15 and Figure 6.7. A Wilcoxon rank-sum test confirmed that distance from home to school in 1997 is not statistically significantly different for the two groups. Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Primary school in 2003 780 (65.60%) 5.729km 9.350km 0.469km 1.132km 6.739km High school in 2003 409 (34.40%) 5.285km 10.073km 0.466km 0.910km 5.414km Table 6.15: 1997 distance from home to school, by schooling phase in 2003 161 Figure 6.7: Kernel density plot of 1997 distance from home to school by 2003 schooling phase Census Geography Children who have reached high school by 2003 are slightly more likely to attend a school in 1997 that is in the same SAL as their home (χ2(1)=4.3214, Pr=0.038), and the same SP as their home (χ2(1)=7.6029, Pr=0.006). There is, however, no evidence of any differences in mobility on the basis of 2003 schooling phase at either the MP or MN levels. Nearest School There is no evidence that schooling phase in 2003 is associated with a child‘s likelihood of attending his or her nearest grade-appropriate school in 1997, regardless of whether independent schools are included in the analysis or not. 162 6.5.2 2003 Straight-line distance As evident in Table 6.16 and Figure 6.8 below, children attending high school travel significantly further to school than those still in primary school in 2003 (Wilcoxon rank-sum test; Pr=0.0263). Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance High school in 2003 418 5.791km 9.216km 0.712km 1.421km 7.367km Primary school in 2003 853 5.164km 8.085km 0.533km 1.175km 6.475km Table 6.16: 2003 distance from home to school, by progression to high school by 2003 Figure 6.8: Kernel density plot of 2003 distance from home to school by phase of education in 2003 163 Given the fairly strong relationships between gender, age at first enrollment, and schooling phase in 2003 (see Appendix 3 for details), it is worth exploring the combined interactions of these variables with distance from home to school. Table 6.17 and Figure 6.9, below, illustrate that distance from home to school is different for each group when broken down by both gender and schooling phase. In all cases, boys travel less far than girls, with primary school boys travelling the shortest distances of all. High school girls are travelling further than any other group, including primary school girls. Both girls and boys at high school level are less likely to be travelling short distances to school, and it is only at the particularly high distances that the distributions actually differ on the basis of gender. At shorter distances (as is evident in Table 6.18), the distributions for high school girls and boys are fairly similar, as are the distributions for primary school girls and boys. A Wilcoxon rank-sum test indicates a weakly significant positive relationship between high school status and distance travelled for girls (Pr= 0.0689), but not for boys. There is, however, no evidence that girls travel significantly further than boys at either the primary or high school level. Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Primary school boys 459 4.778km 7.620km 0.509km 1.144km 5.816km High school boys 166 5.564km 9.805km 0.655km 1.260km 5.524km Primary school girls 394 5.614km 8.583km 0.567km 1.183km 7.425km High school girls 252 5.941km 8.823km 0.758km 1.481km 8.211km Table 6.17: 2003 distance from home to school, by gender and phase of education in 2003 164 Figure 6.9: Kernel density plot of 2003 distance from home to school by gender and phase of education in 2003 Census geography There is no evidence for any significant relationship between mobility at any level of census geography in 2003, and whether a child is in primary or high school. Nearest school Children who are in high school in 2003 are significantly more likely to be attending their nearest grade-appropriate school, whether independent schools are included (χ2(1)= 7.9497, Pr = 0.005) or excluded (χ 2 (1)= 6.3958, Pr = 0.011). 6.5.3 School phase in 2003 and mobility discussion The results presented in this section have suggested that those children who had reached secondary school by 2003, and those who had not, behaved fairly similarly with respect to educational mobility in 1997. In 2003, by contrast, those children who have reached secondary school do travel further than those 165 still in primary school. This makes sense as high schools tend to be larger and therefore less densely distributed, meaning that the nearest high school will on average be slightly further from a child‘s home than the nearest primary school. School distribution is also likely to explain why children in high school are more likely to be attending their nearest school – there are simply fewer options, particularly when a child faces financial constraints. The fact that these differences only emerge in 2003, once these children have reached high school, also supports the hypothesis that the changes in behaviour are linked more to high school status itself, than to the individual, family and community attributes of the learners in question. While it would be ideal to explore these patterns further, the relatively small number of children in high school by 2003, and the non-random constitution of this group, raises problems. These questions could, however, be usefully explored in future work making use of data from subsequent years, or a broader sample. The data presented in this section also suggest that gender- based differences in mobility may increase at the secondary school level. Again, however, additional data will be required before these relationships can be more conclusively tested. 6.6 Grade repetition Once enrolled in formal schooling, children have different experiences with grade progression. While many children do pass smoothly through the grades, a fairly substantial number are forced to repeat a particular grade, or even a number of grades (Fleisch and Schindler 2009). This section explores whether grade repetition between 1997 and 2003 has any relationship to distance travelled to attend school. This relationship is explored both in 1997, prior to repetition, and in 2003, after repetition. 166 Grade repetition may be thought of as an indicator of a child‘s inherent academic capabilities. In this case, it is possible that it might influence parental decisions on investment in schooling in both 1997 and 2003. Parents might choose to invest less in an academically less gifted child, or they might choose to invest more in the hopes of ensuring that child‘s success. To the extent that grade repetition reflects inherent academic capacity, the direction in which it should be expected to impact mobility is not clear. However, grade repetition may also simply reflect the quality of the school which a child attends (Lam, Ardington et al. 2008). In this case, grade repetition would be expected to be negatively associated with mobility. 6.6.1 1997 Straight-line distance Children who repeated a grade between 1997 and 2003 are significantly more likely to have shorter distances between home and school than those who did not repeat any grades (see Table 6.18 and Figure 6.10 below), as confirmed by a Wilcoxon rank-sum test (Pr=0.0000). Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Repeaters 440 4.082km 7.997km 0.398km 0.823km 3.751km Non- repeaters 730 6.516km 10.364km 0.506km 1.195km 8.152km Table 6.18: 1997 distance from home to school, by grade repetition status 167 Figure 6.10: Kernel density plot of 1997 distance from home to school by grade repetition Census geography Children who repeated a grade at least once between 1997 and 2003 were less likely to attend schools outside of the SP (χ2(1)= 5.1148; Pr = 0.024) and MP (χ2(1)= 12.6441; Pr = 0.000) in which they lived. There was no relationship between repetition and mobility at the SAL and MN levels. Nearest school There is no significant relationship between grade repetition and whether or not a child attends their closest grade-appropriate school in 1997, regardless of whether independent schools are included or excluded. 6.6.2 2003 Straight-line distance The relationship between grade repetition and distance travelled in 2003 is similar to that with distance in 1997. Again, as illustrated in Table 6.19 and 168 Figure 6.11 below, children who have repeated grades have a shorter distance from home to school (Wilcoxon rank-sum test; Pr=0.0000). Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Repeaters 439 4.249km 7.236km 0.464km 1.013km 4.640km Non- repeaters 750 6.145km 9.121km 0.664km 1.447km 8.583km Table 6.19: 2003 distance from home to school, by grade repetition status Figure 6.11: Kernel density plot of 2003 distance from home to school by grade repetition Census geography In 2003, children who have repeated grades are again significantly more likely to attend a school in their residential SP (χ2(1)= 8.3371, Pr = 0.004) or MP (χ2(1)= 8.3578, Pr = 0.004). There remains no evidence for any relationship at the SAL or MN levels. 169 Nearest school There is no evidence for any significant relationship between grade repetition, and whether or not a child attends his or her nearest school, public or independent, in 2003. 6.6.3 Grade repetition and mobility discussion Although there is a strong and consistent relationship between repetition and mobility, the available data says nothing about the causal direction of this relationship – does mobility shape repetition, or the reverse? Because most sample members live in relatively disadvantaged areas, mobility is usually associated with attendance at more advantaged schools, which tend to have lower levels of repetition. By extension, attending a local, less-advantaged school is likely to be associated with higher levels of repetition. For this reason, it seems more likely that mobility predicts repetition, than the reverse. A similar explanation seems plausible for the relationship between repetition and mobility at SP and MP levels of census geography. Of course, repetition is not determined purely by the school a child attends, and there is likely to also be an interaction effect operating, with children whose family circumstances favour better academic performance also being more likely to travel. The next set of analyses presented will explore the relationship between family attributes often associated with academic performance, and mobility. Household level characteristics 6.7 Maternal education Maternal education is anticipated to have a positive relationship with mobility, as more educated mothers are expected to place a higher premium on educational investment, and to have access to more resources to invest in their children‘s education. 170 6.7.1 1997 Straight-line distance As expected, mean distance travelled to school increases with maternal education level (see Table 6.20 and Figure 6.12 below). The mean figures for children with mothers with lower levels of education, up to grade 7, are skewed upwards by a few children who are travelling extremely substantial distances. This effect is evident in the high standard deviations and the very low distances at the 25th, 50th and 75th percentile. Maternal Education level Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Up to Grade 5 64 4.591km 13.075km 0.353km 0.708km 1.456km Grade 6 or 7 78 5.572km 11.411km 0.468km 1.121km 3.304km Grade 8, 9 or 10 530 4.207km 8.353km 0.434km 0.81km 3.646km Grade 11 or 12 349 6.828km 9.400km 0.608km 1.491km 9.769km Post- school education 92 7.224km 9.392km 0.642km 2.148km 11.879km Table 6.20: 1997 distance from home to school, by maternal education level Figure 6.12, below, provides a kernel density plot of the distances travelled by children to school, grouped by maternal education. The graph illustrates very clearly the variable nature of mobility by educational level, with children with functionally illiterate mothers being most likely to travel very short distances. As maternal education increases, however, a smaller proportion of children can be seen to travel very short distances, and the distributions gradually spread out, becoming slightly more normal. There is, however, substantial overlap between those children whose mothers have either grade 6 or 7, and those whose mothers have either grade 8, 9 or 10. A Kruskal-Wallis test indicates 171 that distance from home to school varies significantly on the basis of maternal education (Pr=0.0001). Figure 6.12: Kernel density plot of 1997 distance from home to school by maternal education level. Census geography Using chi-squared tests, whether or not a child attended school in the same MP as their home was the only geographical measure significantly linked to the distribution of maternal education for all levels of maternal education (see Table 6.21 below). When the children were divided on the basis of whether their mothers had completed primary education or not, significantly different levels of mobility at the SAL, SP and MP levels were identified. 172 Grade 5 and lower vs. Grade 6 or higher (functionally illiterate vs. functionally literate) Grade 7 and lower vs. Grade 8 or higher (primary school vs. higher than primary school education) Grade 10 and lower vs. Grade 11 or higher Grade 12 and lower vs. any post-school education (no post- school education vs. any post- school education) School in same SAL as home χ2(1)= 1.902 Not significant χ2(1)= 4.202 Pr = 0.040 χ2(1)= 0.815 Not significant χ2(1)= 0.153 Not significant School in same SP as home χ2(1)= 2.578 Not significant χ2(1)= 15.201 Pr = 0.000 χ2(1)= 2.1208 Not significant χ2(1)= 1.734 Not significant School in same MP as home χ2(1)= 6.409 Pr = 0.011 χ2(1)= 41.05 Pr = 0.000 χ2(1)= 9.363 Pr = 0.002 χ2(1)= 8.216 Pr = 0.004 School in same MN as home χ2(1)= 0.299 Not significant χ2(1)= 0.081 Not significant χ2(1)= 3.13 Pr = 0.077 χ2(1)= 0.693 Not significant Table 6.21: 1997 mobility across different levels of census geography, by maternal education level Nearest school A chi-square test indicated that children whose mothers were functionally illiterate were significantly more likely to attend their nearest public school than children whose mothers had higher levels of education (see Table 6.22 below). However, this relationship did become less linear at the highest levels of maternal education. When independent schools were included in the analysis, figures were largely similar, although somewhat more statistically significant (χ2(4) = 12.707, Pr=0.013). 173 Maternal education Up to grade 5 Grade 6 or 7 Grade 8, 9 or 10 Grade 11 or 12 Post- school education χ2 test results Child attends nearest public school (n=198) 19 (28.79%) 13 (16.67%) 103 (19.36%) 48 (13.64%) 15 (16.13%) χ2(4)= 10.7994 Pr = 0.029 Table 6.22: Children attending their 1997 nearest grade-appropriate public school, by maternal education 6.7.2 2003 Straight-line distance As was the case in 1997, distance from home to school in 2003 is also closely connected to maternal education (Kruskal-Wallis test, Pr=0.0001). As evident in Table 6.23 below, the gap between children whose mothers have completed up to Grade 10, and those who have completed Grade 11 or higher appears to have grown, while the distributions for children whose mothers have completed Grade 11 or 12, and those whose mothers have some post-school education, have become more similar. This is even more evident in Figure 6.13 below, in which the distributions of distance travelled are shown on the basis of maternal education. Maternal Education level Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Up to Grade 5 73 3.417km 8.443km 0.397km 0.778km 1.532km Grade 6 or 7 83 3.021km 5.826km 0.398km 0.788km 2.332km Grade 8, 9 or 10 560 3.855km 7.372km 0.506km 1.002km 2.692km Grade 11 or 12 364 7.792km 9.403km 0.835km 2.870km 12.175km Post-school education 97 7.811km 9.189km 0.938km 4.410km 11.858km Table 6.23: 2003 distance from home to school, by maternal education level 174 Figure 6.13: Kernel density plot of 2003 distance from home to school by maternal education level Census geography Again, as in 1997, chi-squared tests indicated that children with more educated mothers were more likely to attend a school outside of the area in which they lived for all levels of census geography other than MN (see Table 6.24 below). The 2003 data generally seems to indicate a stronger relationship between maternal education level, and whether or not children live and school within the same geographic area than was evident in 1997. Grade 5 and lower vs. Grade 6 or higher (functionally illiterate vs. functionally literate) Grade 7 and lower vs. Grade 8 or higher (primary school vs. higher than primary school education) Grade 10 and lower vs. Grade 11 or higher Grade 12 and lower vs. any post-school education (no post- school education vs. any post- school education) School in χ2(1)= 7.417 χ 2 (1)= 5.215 χ 2 (1)= 3.579 χ 2 (1)= 1.765 175 same SAL as home Pr = 0.006 Pr = 0.022 Pr = 0.059 Not significant School in same SP as home χ2(1)= 3.925 Pr = 0.048 χ2(1)= 52.025 Pr = 0.000 χ2(1)= 12.829 Pr = 0.000 χ2(1)= 9.414 Pr = 0.002 School in same MP as home χ2(1)= 4.135 Pr = 0.042 χ2(1)= 78.232 Pr = 0.000 χ2(1)= 17.673 Pr = 0.000 χ2(1)= 10.817 Pr = 0.001 School in same MN as home χ2(1)= 0.215 Not significant χ2(1)= 2.229 Not significant χ2(1)= 0.686 Not significant χ2(1)= 0.592 Not significant Table 6.24: 2003 mobility across different levels of census geography, by maternal education level Nearest school The distribution of levels of maternal education is significantly different between children attending their closest grade-appropriate school, public or independent, and those travelling further afield (Table 6.25). Children whose mothers have completed schooling up to grade 7 appear to be substantially more likely to attend their nearest public school than children whose mothers have higher levels of education. As with all previous analyses of maternal education, this relationship again appears to be stronger in 2003 than 1997. Mother’s education level Up to grade 5 Grade 6 or 7 Grade 8, 9 or 10 Grade 11 or 12 Post- school education χ2 test results Child attends nearest public school (n=213) 20 (27.03%) 26 (31.71%) 111 (19.89%) 43 (11.85%) 13 (13.54%) χ2(4)= 26.285 Pr=0.000 Table 6.25: Children attending their 2003 nearest grade-appropriate school, by maternal education 6.7.3 Maternal education and mobility discussion In 1997, at the lowest levels of maternal education, there is a very high level of variability around the distance travelled. This effect seems largely to have 176 disappeared by 2003, and may relate to poor quality data around these children‘s education and residence, resulting in inaccurate mobility data, or alternatively, lack of residential stability resulting in substantial travel. Overall, however, there is evidence that as maternal education increases, so too does educational mobility. The effect of maternal education on mobility also appears to increase over time, strengthening as children reach the end of primary school and start to enrol in secondary school. This relationship between mobility and maternal education is not strictly linear, and children whose mothers have attained varying levels of education between grade 6 and 10 all appear to behave quite similarly rather than as distinct groups. Additionally, the relationship between distance and maternal education appears to stop holding at the very highest levels of maternal education. One possible explanation for this is that children with the most highly educated mothers are more likely to live in affluent areas, and by extension, close to good schools. An interesting feature of the maternal education analyses is the fairly strong inverse relationship between maternal education, particularly at intermediate levels, and the likelihood that children will attend their nearest grade- appropriate school. This raises questions around the relationship between maternal education and the decision to engage in school choice, even if the resources available allow only for choice between fairly local schools. Of course, with both this relationship between maternal education and local school choice, and with the broader relationship between maternal education and distance travelled, it is critical to explore the role of SES. As SES is highly correlated with maternal education, it is important to attempt to separate the roles of resource availability and maternal education in shaping schooling decisions. This is explored both in the section on household SES later in this chapter, and in Chapter 9. 177 6.8 Maternal Marital status The next potential determinant of learner mobility to be examined is maternal marital status at the time of the child‘s birth. This provides an indicator of the home environment into which a child is born, with married mothers typically being associated with a more stable, socio-economically advantaged home environment than non-married mothers. It is therefore plausible to expect that the children of married mothers may be more likely to travel further to attend school. 6.8.1 1997 Straight-line distance As shown in Table 6.26 and Figure 6.14 below, children of married mothers have, on average, a slightly greater distance from home to school than the children of unmarried mothers (Wilcoxon rank-sum test; Pr= 0.0090). Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Married mothers 410 6.284km 10.102km 0.510km 1.280km 8.102km Unmarried mothers 797 5.125km 9.228km 0.448km 0.934km 5.298km Table 6.26: 1997 distance from home to school, by maternal marital status 178 Figure 6.14: Kernel density plot of 1997 distance from home to school by maternal marital status Census geography There was no evidence of a significant relationship between maternal marital status at birth, and a child‘s mobility at any level of census geography in 1997. Nearest school There was no evidence of a significant relationship between maternal marital status at birth and whether a child attended his or her nearest grade-appropriate school in 1997. 6.8.2 2003 Straight-line distance Although it is evident from Table 6.27 and Figure 6.15, below, that in 2003 children with married mothers continued to travel slightly further than children of unmarried mothers, this difference is no longer statistically significant (Wilcoxon rank-sum test). 179 Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance Married mothers 426 5.511km 8.235km 0.639km 1.264km 7.446km Unmarried mothers 848 5.292km 8.603km 0.554km 1.238km 6.327km Table 6.27: 2003 distance from home to school, by maternal marital status Figure 6.15: Kernel density plot of 2003 distance from home to school, by maternal marital status Census geography There was no evidence for a significant relationship between maternal marital status at birth and a child‘s mobility at any level of census geography in 2003. 180 Nearest school There was no evidence for a significant relationship between maternal marital status at birth and whether a child attends his or her nearest grade-appropriate school in 2003. 6.8.3 Maternal marital status and mobility discussion Although there does seem to be a weak relationship between distance from home to school and maternal marital status in 1997, this effect seems to have disappeared by 2003. There is no evidence that maternal marital status is related to any other measures of mobility. This may suggest that to the extent that maternal marital status does influence schooling choices, this effect is strongest when the children are young, and have fairly limited independence. It may also simply reflect the stronger relationship between marital status at birth and 1997 SES, as opposed to 2003 SES. Indeed, the extent to which any effect of maternal marital status on schooling choices is operating through the relationship between marital status and SES also merits further investigation. This is covered in Chapter 9. 6.9 Household SES Household SES is likely to play a core role in shaping school choice. Access to resources determines how much a family can afford to spend on travel to school, as well as how much they can afford to contribute to school fees and related expenses. It may also be highly correlated with determinants of the value the family places on education, such as parental education levels. Generally, it is anticipated that as SES increases, so too will, on average, the distance travelled to school. 181 6.9.1 1997 Straight-line distance Examining the means and distributions of distance from home to school on the basis of 1997 SES (see Table 6.28 and Figure 6.16 below) confirms the hypothesis that children from wealthier families do tend to travel further to school. Interestingly, the pattern for quintile 1 is quite distinct, while those for quintiles 2 and 3 are quite similar, as are those for quintiles 4 and 5. There is also some non-linearity in the relationship between SES and distance, particularly in quintiles 2 and 3. SES Quintile Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance 1 (most poor) 232 3.896km 10.559km 0.370km 0.694km 1.380km 2 217 4.789km 8.988km 0.449km 0.842km 4.230km 3 217 4.460km 8.226km 0.465km 0.946km 3.995km 4 225 6.256km 9.430km 0.483km 1.445km 8.654km 5 (least poor) 187 6.993km 8.865km 0.611km 2.416km 11.549km Table 6.28: 1997 distance from home to school, by 1997 household SES quintile 182 Figure 6.16: Kernel density plot of 1997 distance from home to school, by 1997 household SES quintile Census Geography As is evident in Table 6.29 below, more children become mobile at both the SP and MP levels of census geography, as SES increases. At the SAL level, the proportion of children who are mobile is extremely high for all SES groups, and the chi-squared test does not provide evidence that mobility varies on the basis of SES. At the MN level so few children are mobile that there is no difference in behaviour on the basis of SES. Due to the very small numbers of children involved the MN level is not shown in the table below. 183 Quintile 1 (most poor) 2 3 4 5 (least poor) χ2 test results School in same SAL as home 23 (9.87%) 11 (5.07%) 16 (7.37%) 17 (7.56%) 13 (6.95%) χ2(4)= 3.8529 N.S. School in same SP as home 126 (54.08%) 89 (41.01%) 94 (43.32%) 79 (35.11%) 64 (34.22%) χ2(4)= 23.2285 Pr=0.000 School in same MP as home 200 (85.84%) 164 (75.58%) 167 (76.96%) 143 (63.56%) 122 (65.24%) χ2(4)= 38.2041 Pr=0.000 Table 6.29: 1997 mobility across different levels of census geography, by 1997 household SES Nearest school Table 6.30, below, illustrates the proportions of children from each quintile who attend their nearest school. Although a higher proportion of children from quintiles 1 and 5 do appear to be attending their nearest school in both analyses, this is not statistically significant. The U-shaped nature of the relationship may explain the absence of significant result. Quintile 1 (most poor) 2 3 4 5 (least poor) χ2 test results Attends nearest public school 51 (21.89%) 31 (14.22%) 37 (16.97%) 40 (17.70%) 38 (20.32%) χ2(4)=5.2706 N.S. Attends nearest public or independent school 51 (21.89%) 33 (15.21%) 37 (16.97%) 38 (16.81%) 38 (20.65%) χ2(4)=4.6710 N.S. Table 6.30: Children attending their 1997 nearest grade-appropriate school, by 1997 household SES 6.9.2 2003 Straight-line distance The relationship between distance from home to school and household SES in 2003 remains very similar to that for 1997 (see Table 6.31 and Figure 6.17 184 below). However, the distances travelled have increased, particularly for children in the higher quintiles. This may in part relate to the higher likelihood that more advantaged children have reached high school by 2003. A Kruskal- Wallis test confirms that distance from home to school varies significantly on the basis of 2003 SES (Pr=0.0001). SES Quintile Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance 1 (most poor) 170 1.422km 2.480km 0.420km 0.749km 1.395km 2 169 4.717km 9.098km 0.560km 1.019km 4.267km 3 172 5.106km 8.476km 0.457km 1.185km 5.145km 4 170 7.664km 9.548km 0.637km 2.443km 12.466km 5 (least poor) 162 9.501km 9.433km 1.315km 6.392km 14.994km Table 6.31: 2003 distance from home to school, by 2003 household SES quintile Figure 6.17: Kernel density plot of 2003 distance from home to school, by 2003 household SES quintile 185 Census Geography In 2003 there is a fairly straightforward relationship between SES and mobility at the SP and MP levels (see Table 6.32 below), with children from higher income families more likely to be mobile. At the SAL level, there is again no relationship between mobility and SES. At the MN level, by contrast, in 2003 there is a significant relationship (χ2(4)=7.8098, Pr=0.099), with more wealthy children being more likely to travel between MNs for schooling. However, the actual numbers of children travelling between MNs remains very small (n=31), and this level is therefore not included in the table below. Quintile 1 Quintile 2 Quintile 3 Quintile 4 Quintile 5 χ2 test results School in same SAL as home 5 (2.94%) 4 (2.37%) 5 (2.91%) 9 (5.29%) 4 (2.47%) χ2(4)=3.1465 N.S School in same SP as home 95 (55.88%) 63 (37.28%) 61 (35.47%) 46 (27.06%) 33 (20.37%) χ2(4)=52.6549 Pr=0.000 School in same MP as home 154 (90.59%) 128 (75.74%) 123 (71.51%) 95 (55.88%) 75 (46.30%) χ2(4)=92.3332 Pr=0.000 Table 6.32: 2003 mobility across different levels of census geography, by 2003 household SES Nearest school In 2003 there is evidence that the poorest children are significantly more likely to attend the school nearest to their home (see Table 6.33 below). 186 Quintile 1 Quintile 2 Quintile 3 Quintile 4 Quintile 5 χ2 test results Attends nearest public school 41 (24.26%) 26 (15.38%) 28 (16.28%) 27 (15.88%) 18 (11.04%) χ2(4)=11.0515 Pr=0.026 Attends nearest public or independent school 40 (23.67%) 25 (14.79%) 26 (15.20%) 25 (14.71%) 17 (10.49%) χ2(4)=11.6124 Pr=0.020 Table 6.33: Children attending their 2003 nearest grade-appropriate school, by 2003 household SES 6.9.3 Household SES and mobility discussion In both 1997 and 2003, SES is a strong predictor of the distance between a child‘s home and school. The relationship becomes slightly more direct in 2003, when children are older, and at least for wealthier children appears to become stronger. There is also a fairly strong relationship between SES and mobility at both the SP and MP levels, although not at the SAL or the MN levels. In 2003, poorer children are more likely to attend their nearest grade- appropriate schools than their wealthier peers. As noted in Chapter 4, SES is strongly correlated with maternal education, and the similarities in the relationships between maternal education and distance and SES and distance are therefore unsurprising. One of the most interesting features of this set of analyses is the absence of a significant relationship between SES and whether or not a child attends their closest school in 1997. Given the significant nature of the relationship between maternal education and enrolment at the closest school, this is unexpected. It suggests one way in which the relationship between mobility and maternal education may differ from the relationship between mobility and SES. One potential explanation for these results is that more educated mothers are more likely to engage in school choice. To the extent that they have access to additional resources (which is reflected in the SES data), they may choose to access more advantaged schools 187 outside of their immediate area. This is evident in the relationship between SES and distance travelled. However, to the extent that these mothers do not have access to additional resources, and have lower SES, they may be constrained to choose from schools closer to home. However, they do continue to exercise choice, which is demonstrated by the lower levels at which their children attend the nearest schools, even when they aren‘t able to travel far. Whether this is indeed the explanation, and if so, why this effect disappears by 2003, is not clear, and is explored further in Chapter 9. Community level characteristics 6.10 Residential area poverty The area in which a child and his or her family live is also likely to influence school choice, and particularly whether travel over a substantial distance is a seen as a potentially beneficial option. An area‘s affluence is likely to be related to, but not identical with, the affluence of any particular household within this area. A wealthier family is more likely to be able to afford to live in an affluent area, but within well-off areas, for a range of reasons such as historical accident, an employer providing somewhere to live, or the rapid emergence of an informal settlement, a number of comparatively and absolutely disadvantaged families can typically be found. Similarly, some wealthy families can typically be found in even the most disadvantaged communities, choosing to live there for various reasons which might be historical, social or even economic (lower expenditure on rent, purchasing a home, or rates and property taxes may allow for greater expenditure on consumer goods or investments such as education). An area‘s relative affluence is expected to play a role in the educational choices available to families, and the costs of those choices. This is particularly 188 true in South Africa, where the affluence of areas is closely connected to their historical racial designation, which simultaneously also shaped the quality of the schooling available in those areas. Simply put, historically white areas are typically more affluent than other areas, and also offer higher quality educational opportunities locally. A child living in a particularly affluent community is therefore not likely to need to travel substantial distances to access a good school. By contrast, a child living in a poor and historically disadvantaged community is likely to need to travel a great distance. We can therefore expect that a family‘s SES, which shapes ability to travel, and the nature of the area in which they live, which shapes the extent to which travel is beneficial, will operate together in determining levels of learner mobility. As a result, the more affluent children living in less affluent areas are those expected to travel the greatest distances. The relationship between the poverty level of the area in which a child lives, and the distance that they are likely to travel to school seems likely to be considerably more complex than any of the other relationships explored thus far. The nature of the relationship seems likely to depend strongly on the level of geography which is being considered – SAL, SP or MP. For this reason, the results will be explored level by level. 6.10.1 1997 Small Area Level (SAL) Straight-line distance In the two lowest poverty quintiles of SAL (that is, the wealthiest areas), children typically travel fairly substantial distances to school (see Table 6.34 and Figure 6.18 below). Children in quintile 3 travel substantially shorter distances than children in any other quintile. The distances travelled by children in quintiles 4 and 5 (the poorest areas), are between these two extremes. Given this non-linear relationship between SAL poverty and 189 distance travelled, it is understandable that the correlation between SAL area poverty and distance from home to school is fairly low, although negative overall (-0.0591 for raw scores). Overall, children in wealthier SALs with lower poverty levels travel somewhat further than others, as confirmed by both a Kruskal-Wallis test (Pr=0.0001). However, this relationship is clearly not linear, and particularly worth notice is that the children living in the very poorest SALs are actually typically travelling further than those in only moderately poor SALs. Area Poverty Quintile Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance 1 (wealthiest) 213 6.568km 9.206km 0.596km 1.912km 9.348km 2 248 6.957km 10.526km 0.492km 1.345km 9.264km 3 256 3.790km 7.433km 0.422km 0.758km 2.076km 4 259 4.451km 8.453km 0.398km 0.824km 4.285km 5 (poorest) 237 5.966km 11.334km 0.523km 1.177km 5.925km Table 6.34: 1997 distance from home to school, by SAL poverty quintile 190 Figure 6.18: Kernel density plot of 1997 distance from home to school, by SAL poverty quintile Census Geography There is no evidence for a relationship between the poverty level of the SAL in which a child lives, and the child‘s likelihood of a attending a school in that same SAL. Children living in the very poorest and most affluent SAL quintile are however more likely to be travelling to a school outside their home SP than children in intermediate SAL quintiles. Children in SAL poverty quintile 3 are most likely to attend a school in the same MP in which they live (χ2(4)=14.2914, Pr = 0.006). Finally, children living in the poorest SAL quintile are most likely to travel to a school outside of the MN in which their home is (χ2(4)=8.0681, Pr = 0.089). However, given the extremely low levels of MN mobility, it is not clear that these figures are particularly meaningful. Nearest school There is an almost linear relationship between the poverty of the SAL and whether children attend their nearest school, public or independent. Children in 191 the most affluent SALs are most likely to attend their nearest school, while children in the poorest SALs are least likely to attend their nearest schools (χ2(4)= 17.7483, Pr=0.001 for public schools only; χ 2 (4)= 13.3720, Pr=0.01 for public and independent schools). Sub Place (SP) Straight-line distance Children in quintile 2 and quintile 5 (highest poverty) SP areas have the greatest distance from home to school, with children in quintiles 3 and 4 travelling particularly short distances (see Table 6.35 and Figure 6.19 below). Accordingly, the correlation between SP area poverty and distance travelled is weaker than that for SAL, although it remains negative (-0.0263 for raw scores). Although children in wealthier SPs are continuing, on average, to travel slightly further than children in poorer SPs, this relationship is somewhat weaker than was the case for the SALs (Kruskal-Wallis test, Pr=0.0001). Area Poverty Quintile Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance 1 (wealthiest) 218 5.762km 8.820km 0.495km 1.524km 6.698km 2 247 7.274km 10.490km 0.496km 1.500km 11.237km 3 267 3.504km 6.166km 0.424km 0.843km 3.915km 4 248 3.694km 6.860km 0.413km 0.743km 2.142km 5 (poorest) 233 7.542km 13.246km 0.633km 1.404km 7.608km Table 6.35: 1997 distance from home to school, by SP poverty quintile 192 Figure 6.19: Kernel density plot of 1997 distance from home to school, by SP poverty quintile Census Geography Children living in poorer SP areas are more likely to be travelling to a school outside of the SAL in which they live than their peers in more advantaged areas (χ2(4)=11.1595, Pr = 0.025). Children living in poorer SP areas are also more likely to be travelling to a school outside of the SP in which they live (χ2(4)=28.1873, Pr = 0.000), but this relationship is not strictly linear. Children in quintile 5 (poorest areas), and quintile 2 seem to be far more likely than any other groups to be mobile at the SP level. Children in SP quintile 5 and 2 are also substantially more likely to be mobile at the MP level than their peers (χ2(4)=25.9282, Pr = 0.000). A similar relationship also holds with MN mobility (χ2(4)= 17.9573, Pr = 0.001), although only a very small number of children are travelling at this level. Overall, although the relationships are not strictly linear, children living in the very poorest SPs are more mobile at each level of census geography than their peers in more affluent areas. 193 Nearest school analysis Children living in wealthier SP areas are more likely to attend their nearest school than children living in SP areas with higher poverty (Kruskal-Wallis test; Pr=0.0001 regardless of whether independent schools are included). Main Place As discussed in Chapter 3, due to extreme clustering only 3 poverty quantiles are used at the MP level, and even these are very uneven, meaning that the findings presented in the section should be treated with some caution. Straight-line distance Children living in the poorest MP areas live the furthest distance away from their schools (see Table 6.36 and Figure 6.20 below; Kruskal-Wallis test, Pr=0.0155). Although this is a different result from that found at the SAL and SP level, it is not clear whether it is a feature of the larger area size considered, or a function of grouping children into three quantiles as opposed to five quintiles. MP Area Poverty Quantile Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance 1 (wealthiest) 394 5.349km 9.299km 0.447km 0.909km 5.303km 2 705 5.453km 9.580km 0.455km 1.040km 5.816km 3 (poorest) 115 6.233km 9.983km 0.699km 1.554km 7.608km Table 6.36: 1997 distance from home to school, by MP poverty quantile 194 Figure 6.20: Kernel density plot of 1997 distance from home to school, by MP poverty quantile Census geography Children living in the most affluent MP areas are most likely to attend school in the SAL in which they live (χ2(2)=16.2439, Pr = 0.000). These children are also more likely to attend school in the SP in which they live (χ2(2)= 24.4440, Pr = 0.000), and also in the MP in which they live (χ2(2)= 16.6547, Pr = 0.000). At the MN level, however, children in MP poverty quintile 2 are substantially more likely to travel between MNs than their peers living in more affluent or poorer MPs (χ2(2)= 27.0329, Pr = 0.000). Nearest School analysis Chi-square tests indicate that children living in more advantaged MPs are significantly more likely to be attending their nearest schools (χ2(2)=34.0264, Pr = 0.000 for public schools only; χ2(2)= 28.5264, Pr = 0.000 for public and independent schools). 195 6.10.2 2003 Small Area Level (SAL) Straight-line distance In 2003, it is the quintile of children living in the poorest SALs that travel the shortest distances to school, although children in SAL poverty quintile 3 only travel slightly further (see Table 6.37 and Figure 6.21 below). The change in the relative travel of children living in the poorest SALs may relate to the opening of new schools, these children being less likely to have reached high school, or more accurate data for these children. The correlation between SAL poverty and distance travelled remains weakly negative (-0.0721 for raw scores), and a Kruskal-Wallis test indicates that distance to school varies significantly with SAL poverty level (Pr= 0.0053). Area Poverty Quintile Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance 1 (wealthiest) 223 6.006km 8.949km 0.664km 1.720km 7.410km 2 252 6.439km 9.058km 0.534km 1.629km 10.246km 3 270 4.910km 8.693km 0.464km 0.873km 5.387km 4 271 5.268km 7.604km 0.631km 1.214km 7.796km 5 (poorest) 264 4.304km 7.946km 0.669km 1.247km 3.276km Table 6.37: 2003 distance from home to school, by SAL poverty quintile 196 Figure 6.21: Kernel density plot of 2003 distance from home to school, by SAL poverty quintile Census geography Children living in more affluent SAL areas are more likely to attend a school in the same SAL as their home than children living in poorer SAL areas (χ2(4)= 11.1184, Pr = 0.025). There is no significant relationship between SAL poverty and mobility at the SP, MP or MN levels, however. Nearest school Children living in more affluent SALs are significantly more likely to attend their nearest school (χ2(4)= 16.2545, Pr = 0.003 for public schools only, and χ2(4)= 13.7166, Pr = 0.008 for public and independent schools). Sub Place (SP) Straight-line distance As evident in Table 6.38 and Figure 6.22, children living in poorer SP areas travel shorter distances to school than their peers living in more affluent SP 197 areas (Kruskal-Wallis test; Pr=0.0133). A weakly negative correlation between SP poverty and distance travelled remains in place (-0.0455 for raw scores). Area Poverty Quintile Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance 1 (wealthiest) 223 5.314km 8.712km 0.535km 1.263km 5.391km 2 264 6.345km 8.666km 0.635km 1.742km 9.920km 3 283 5.636km 9.370km 0.522km 1.020km 7.425km 4 258 4.393km 6.912km 0.572km 1.010km 4.667km 5 (poorest) 252 5.011km 8.340km 0.678km 1.421km 5.916km Table 6.38: 2003 distance from home to school, by SP poverty quintile Figure 6.22: Kernel density plot of 2003 distance from home to school, by SP poverty quintile Census geography Again, children living in more affluent SP areas are more likely to attend school within the same SAL in which they live than their peers in poorer SP areas (χ2(4)= 8.0401, Pr = 0.090). A similar pattern is also found for mobility at 198 the SP (χ2(4)= 22.6349, Pr = 0.000), MP (χ 2 (4)= 25.5792, Pr = 0.000) and MN (χ2(4)= 8.4396, Pr = 0.077) levels. Nearest school Children living in the most affluent SPs are again more likely to attend their nearest school than those in poorer SPs (χ2(4)= 21.7449, Pr = 0.000 for public schools only; χ2(4)=19.7176 Pr = 0.001 for public and independent schools). Main Place (MP) Straight-line distance At the MP level, the 1997 finding that the children living in the poorest MPs tend to travel further than children in more affluent MPs is replicated (see Table 6.39 and Figure 6.23 below). Children in the wealthiest MPs travel the least far, children in mid-range MPs travel somewhat further, and the children in the highest-poverty MPs travel furthest of all (Kruskal-Wallis test, Pr=0.053). There is a positive correlation between MP poverty score and distance travelled (0.0824), indicating that as MP poverty increases, so too does distance travelled. The fact that the 1997 finding is replicated here suggests that MP area poverty might have a different relationship with mobility than poverty at smaller area levels. MP Area Poverty Quantile Number of children Mean distance to school Standard deviation 25th percentile distance 50th percentile distance 75th percentile distance 1 (wealthiest) 413 4.809km 8.737km 0.500km 1.106km 4.300km 2 743 5.707km 8.504km 0.635km 1.293km 7.377km 3 (poorest) 125 5.070km 7.145km 0.653km 1.572km 7.583km Table 6.39: 2003 distance from home to school, by MP poverty quantile 199 Figure 6.23: Kernel density plot of 2003 distance from home to school, by MP poverty quantile Census geography Children living in more affluent MP areas are more likely than their peers in poorer MP areas to attend a school in the same SAL (χ2(2)= 13.3806, Pr = 0.001), SP (χ2(2)= 20.7043, Pr = 0.000), MP (χ 2 (2)= 27.7652, Pr = 0.000) and MN (χ2(2)= 13.5678, Pr = 0.001) as their homes in 2003. Nearest school Children living in more affluent MPs are significantly more likely to attend their nearest school in 2003 than their peers living in less affluent MP areas (χ2(2)= 39.5873 Pr = 0.000 for public schools only; χ 2 (2)= 37.8819, Pr = 0.000 for public and independent schools). 6.10.3 Discussion of residential area poverty and mobility Two main streams of findings emerge from all the above analyses of residential area poverty. Firstly, there is a fairly complex relationship between 200 the area a child lives in, and his or her education-related mobility The exact nature of this relationship is not clear from the analyses documented above, although the overall trend seems to be that on average children living in relatively advantaged areas, as well as those in the most disadvantaged areas, tend to travel furthest. Secondly, however, there is a strong and straightforward relationship between the area a child lives in, and the likelihood that he or she will attend the nearest grade appropriate school, whether public or independent. The patterns evident in the first set of findings seem likely to relate to complex interactions between neighbourhood poverty and a range of household and individual characteristics. Overall, though, with the exception of some of the nearest-school analyses, children living in wealthier areas do seem to be more mobile than children living in poorer areas. As these children are, on average, more affluent than the children from poorer areas, it makes sense that their parents or families are more likely to have the resources (economic and social), to engage in mobility. However, children living in poorer areas, with poorer local schools, almost certainly have far stronger incentives to engage in mobility. The patterns of mobility revealed by the analyses presented here seem most likely to represent this interaction between incentives and capacities for mobility. The second set of findings is simpler to explain, and almost certainly relates to the higher quality of schools in more affluent areas, regardless of the geographic level at which these areas are defined. Children in more affluent areas therefore almost certainly have less of an incentive to choose to attend a school other than the one closest to their home. Two alternative explanations should, however, also be considered. Firstly, in affluent areas, which are generally more spread out, a child is more likely to be faced with having only one school within easy walking distance of his or her home, and may, therefore, be constrained to attend this particular school in a way that children 201 in less affluent areas with a denser distribution of schools are not constrained. However, given that there is a fairly strong relationship between household SES and the affluence of the area in which a child lives, it seems likely that this sort of constraint should not affect many children living in more advantaged areas. Secondly, and particularly in 1997 when the children are younger, is the fact that in affluent areas most schools are English or Afrikaans medium, while in less affluent areas, they may be operating in any one of several African languages that are heavily represented in the Johannesburg- Soweto area. While parents may consider education in English or Afrikaans even if these are not the home language, they may not be willing to accept education in an African language other than the child‘s home language. This may, to some extent, be governing decision making around which school children will attend, and whether the nearest school to the home is an option. The finding that children in less affluent areas are less likely to attend their nearest school – even if they still don‘t travel very far – is of particular importance. It appears to indicate that children and families living in poor areas still exercise school choice as far as they are able to, given the resource constraints that they face. 6.11 Conclusion This chapter has explored the relationship between a range of individual, household and community level variables, and learner mobility in South Africa. At the individual level, race, gender, age at first enrolment, and grade repetition were examined. Clear evidence was presented that race is strongly linked to mobility behaviour, and particularly that coloured children are less likely to engage in mobility than black children. Although girls travel slightly further than boys, there was not sufficient evidence to demonstrate a consistent relationship between gender and mobility. 202 Interestingly, and contrary to expectations, children who started school in 1997, as opposed to 1996, tended to be substantially more mobile in 1997. While the reasons for this are unclear, the most plausible relates to whether or not parents can afford preschool or other child care if a child is not attending school. This effect had largely worn off by 2003, perhaps in part due to the effect of late starters being more likely to have reached secondary school in 2003. There was a strong relationship between mobility and repetition, with repetition being much less likely for children travelling further. This is almost certainly a reflection of household SES, which was higher with greater travel, and the fact that for most children, travelling further is associated with attending a higher quality school. At the household level, maternal education, maternal marital status, and household SES were explored. A strong, positive relationship between mobility and maternal education was identified. In particular, children with mothers who had an education up to and including grade 10 had a very different distribution of mobility from those children with mother who had completed grade 11 or higher. Although much of the relationship between maternal education and learner mobility is likely to be mediated by SES, there was also some evidence for an independent effect of maternal education on school choice, particularly in 1997. In particular, children with more educated mothers were substantially less likely to attend their local schools, suggesting that more educated mothers were more actively engaged in school choice. There was, however, very limited evidence for a relationship between maternal marital status and mobility behaviour. As anticipated, a particularly strong relationship between mobility and socio- economic status was identified. Although this relationship was not strictly linear, children from more affluent households generally tended to live further away from their schools. This relationship appears to become stronger over time, with SES being more closely related to mobility in 2003. 203 At the community level, measures of residential area poverty were explored at three different levels of census geography. The relationship between these measures and mobility behaviour is relatively complex, which is probably due to the interaction of area and household SES. While children living in poorer areas are likely to have a greater incentive to travel to school, they are less likely to have the resources to do so. By contrast, children in more affluent areas are less likely to need to travel to attend a school of their preference, but are much more likely to have the ability to do so. Overall, however, the children living in relatively advantaged areas, and those living in the most disadvantaged areas appear to be most likely to travel substantial distances, and to travel outside of their residential area to attend school. A more clear relationship exists between area poverty and whether or not a child attends their nearest grade-appropriate school – children living in more affluent areas are substantially more likely to attend the school nearest to their home. The data presented in this chapter substantiates the notion that there are two different forms of school choice operating in the South African educational market. Firstly, families choose to send children to relatively distant but historically advantaged schools whenever possible. This is evident in the substantially greater distances typically travelled by the more affluent members of the sample, and in the extent to which the relationships between mobility and its correlates seem likely to be shaped by SES. By contrast, however, there is also evidence that less-advantaged children and their families are also engaged in choice, even if this does not typically involve substantial travel to historically advantaged schools. The prevalence of choice at a more local level is evident in the data on which children attend their nearest school. Unlike the distance travelled to school, this appears to be inversely associated to family wealth. By contrast, maternal education and area poverty play a more significant role in shaping this form of choice. This 204 suggests that when sending children to historically advantaged schools is not an option, engaged parents, and particularly those in less affluent areas, attempt to provide their children with the best possible education by choosing from amongst local schools. In the next chapter, Chapter 7, the relationship between mobility and school attributes will be explored. This will be followed in Chapter 8 by an investigation of the trajectories of individual learners with respect to mobility over time. This will provide insight as to whether mobility in 1997 can be used to predict mobility in 2003, as well as to potential determinants and correlates of changes in mobility patterns over time. Chapter 9 will then combine the data presented in the current chapter with the findings of Chapters 7 and 8 to develop a preliminary, partial model of the determinants of various forms of learner mobility. 205 Chapter 7: School characteristics associated with mobile learners 7.1 Introduction Along with the individual, household and community level variables discussed so far, a child‘s mobility is also likely to be related to the attributes of the schools available to the child, both locally and further afield. School attributes, such as resource levels, educational quality, cost, and racial composition are all likely to play various roles in attracting or deterring potential learners. It therefore makes sense to explore key variables of both the schools local to a child, and the schools a child attends, in attempting to understand the mobility of any particular child. This chapter consists of three sections. The first section explores the attributes of those schools attended by sample members from 1997-2003, along with the attributes of those schools closest to the homes of sample members. Data is presented first in unweighted form, and secondly, weighted by the number of children enrolled in each school. The second section explores the distribution of sample members across the schools attended, and provides data on which children attended schools with which properties. Finally, the third section explores the relationships between patterns of travel, and the schools that children attended. 206 7.2 Schools attended by study sample members, and grade-appropriate schools closest to study sample members’ homes 7.2.1 Unweighted data Obviously, the schools attended by the members of the sample used in this study are only a subset of all of the schools in the Gauteng province described in Chapter 4. They are not likely to be an extremely representative subset either, due to the urban location of sample members, and the under- representation of the extremely poor and the extremely wealthy, as discussed in Chapter 4. For these reasons, it is useful to generate some descriptive data specific to the group of schools attended by sample members. For similar reasons, descriptive data around the subset of schools which comprise the nearest grade-appropriate school for sample members will also be generated. It must be stressed that the data provided in this section is not weighted for the number of cohort members enrolled at each school. Rather, all schools attended by cohort members are weighted equally here – whether they are attended by just one cohort member, or many. Data weighted by the number of sample members enrolled at each school will be explored in the subsequent section. Overall, in 1997, the 1428 members of the study were attending a total of 365 different registered schools. However, as data is missing for a few individuals, and a small number of children also attended unregistered schools, this figure is probably slightly low. In 2003, this had risen to 465 schools. The figure is higher because both primary schools (310) and secondary schools (155) are included at this point in time. 378 different primary schools are identified as being the nearest primary school to members of the study sample, while 212 secondary schools are identified as being the nearest secondary school to members of the study sample. These figures are consistent with the larger number of relatively smaller primary schools found in the Gauteng province, 207 and the smaller number of relatively larger secondary schools. From these figures, it is also evident that the number of schools actually attended is, for both phases, smaller than the number of schools that would be attended if all children simply attended their nearest school. School sector 10.54% of the schools attended by sample members during the study period were independent schools. By contrast, roughly 20% of the registered schools in Gauteng were independent. The schools nearest to the homes of sample members represent far more closely the distribution of independent schools found in the full list of registered schools. Roughly 10% of the primary schools closest to a sample member‘s home were independent, as were 17% of secondary schools. Quintile Very few of the primary and secondary schools closest to sample members‘ homes fall into quintiles 1 (poorest), 2 and 5 (most advantaged). As evident in Table 7.1, the majority of the schools closest to sample members‘ homes are in either quintile 3 or quintile 4 – in both cases a higher proportion than expected given the proportion of quintile 3 and 4 schools found in Gauteng province as a whole. The proportion of schools actually attended by cohort members that fall into quintiles 1-3 is lower than would be expected on the basis of the schools nearest to sample members‘ homes. By contrast, the proportion of attended schools falling into quintiles 4 and 5 is somewhat higher than would be expected. 208 School Quintile Proportion of Gauteng schools Proportion of primary schools closest to sample members’ homes Proportion of secondary schools closest to sample members’ homes Proportion of schools attended by sample members in 1997 Proportion of schools attended by sample members in 2003 1 11.66% 3.08% 2.37% 2.85% 1.46% 2 8.61% 8.31% 11.83% 5.38% 5.84% 3 30.91% 44.62% 35.50% 37.66% 36.50% 4 27.27% 35.08% 37.28% 38.61% 41.36% 5 21.55% 8.92% 13.02% 15.51% 14.84% Table 7.1: Quintiles of schools attended by and nearest to sample members’ homes Section 21 status Just over 86% of public schools in Gauteng province have Section 21 status, while just under 89% of the schools closest to sample members‘ homes have this status. By contrast, around 95% of schools actually attended by sample members having Section 21 status. Enrolment In line with the data presented for all Gauteng, the primary schools closest to the homes of sample members are, on average, substantially smaller (mean: 626; median: 558) than the secondary schools closest to their homes (mean: 971; median: 1049). By contrast, the mean size of the schools attended 1997 is 663, while for primary schools attended in 2003 it rises to 673 – in both cases higher than might be expected on the basis of the schools nearest to sample members‘ homes. The mean size of secondary schools attended by sample members is 958, which is lower than would be expected on the basis of the data on the secondary schools nearest to sample members‘ homes. 209 Percentage of black learners The average proportion of black learners across all Gauteng schools is 73%. The average for the primary schools closest to sample members‘ homes is 83%, and for secondary schools it is 76%. In both cases, the median proportion of black students is almost or exactly 100%, indicating that around or over half of the schools found closest to participants‘ homes are entirely black. In 1997, the average percentage of black learners at the schools attended by sample members was 79%. For primary schools attended in 2003, this figure rises to 81%. By contrast, the corresponding figure for the secondary schools attended in 2003 is 72%. In all cases, these figures are lower than would be expected on the basis of the schools closest to sample member homes. School fees The school fees charged by both the primary and secondary schools closest to sample members‘ homes are substantially lower than those figures for all Gauteng area schools, with the primary schools charging an average of R504, and the secondary schools and average of R790. However, the fees at the schools which sample members actually attended are substantially higher – although with extremely large standard deviations. The primary schools attended in 1997 charging an average of R910 (minimum R0, maximum R9510, median R80). Primary schools attended in 2003 charged an average of R769 (minimum R0, maximum R7000, median R100). Secondary schools attended in 2003 charged an average of R1470 (minimum R0, maximum R9650, median R400). Historical racial status of school 70% of the primary schools closest to participant homes were historically DET schools, as were 63% of secondary schools. The schools actually attended by sample members, however, are substantially less likely to be historically DET schools. In 1997, 56% of schools attended were DET schools, and in 2003, this 210 figure was lower still at 54%. These enrolment patterns suggest that children are tending to avoid DET schools. Matric pass rate Given that the pass rate data attributed to primary schools was imputed on the basis of the performance of the nearest secondary school, it makes sense that the figures for the groups of primary and secondary schools closest to sample members‘ homes will be almost identical. The mean pass rate for these secondary schools is 70%, while the mean imputed pass rate for these primary schools is 68%. The figures for the schools actually attended are quite similar, with the mean pass rate for schools attended in 1997 at 70%. Primary schools attended in 2003 also have a mean pass rate of 70%, while for secondary schools it is 72%. Discussion of unweighted data The data presented thus far suggests that, as would be expected, the schools closest to study sample members‘ homes tend to be slightly more disadvantaged than the average schools in the Gauteng province. The schools actually attended, however, are somewhat more advantaged than this. In the following section, the schools data weighted for attendance is explored to determine whether this pattern still holds. 7.2.2 Weighted data In this section, the focus shifts from using schools to using sample members as the unit of analysis. It will describe the school environments experienced by different proportions of sample members by presenting school data weighted by the number of sample members attending a particular school. Although the 1428 sample members attended a total of 365 different schools in 1997, and 465 different schools in 2003, they were by no means evenly 211 distributed across these schools, as illustrated in Table 7.2 below. In 1997, 132 schools had only one sample member attending, while 11 different schools had 10 or more sample members attending. The single most-attended school had 20 sample members enrolled. The largest proportion of sample members, 14.02%, attended schools with 6 sample members enrolled. By contrast, relatively even proportions of the sample live closest to each of the school in the set of schools closest to any sample member‘s home. In 2003, with the sample members divided between both primary and secondary schools, the distribution of enrolment shifted towards smaller numbers of children at a range of different schools. This trend is particularly evident amongst children attending secondary schools in 2003, with only 7.67% of secondary school children attending schools with ten or more sample members. By contrast, almost 70% of the sample lives closest to one of 57 secondary schools which are also closest to at least 9 other sample members. This probably relates to the typically larger size, and sparser distribution, of secondary schools. This data suggests that as learners move into secondary schooling we should expect to see larger numbers of children enrolled at each of a smaller number of schools. It is interesting to note that the available data, however, appears to reflect an opposite trend. However, this may simply be due to the relatively small numbers of children who have reached secondary school in 2003, and data for subsequent years is needed before any firm conclusions can be drawn. Sample members per school Nearest primary Nearest secondary 1997 attended 2003 attended, primary 2003 attended, secondary % at schools with 1-4 sample members 38.59% (272 schools) 13.24% (119 schools) 40.21% (264 schools) 56.03% (280 schools) 44.19% (110 schools) % at schools with 5-9 sample members 34.37% (76 schools) 17.22% (35 schools) 48.19% (90 schools) 38.31% (65 schools) 48.13% (40 schools) 212 % at schools with over 10 sample members 27.03% (28 schools) 69.54% (57 schools) 11.6% (11 schools) 5.16% (6 schools) 7.67% (5 schools) Table 7.2: Schools attended by sample members (note: schools classified as combined are included in both columns for 2003) Sector The proportion of children living nearest to and attending independent schools is lower than suggested by the unweighted data presented earlier. Overall, in 1997, 7.09% of sample members were attending independent schools, as illustrated in Table 7.3 below. A slightly higher proportion of children had independent schools as their nearest primary (7.60%) and secondary schools (8.23%). The relatively low proportion of sample members attending independent schools may relate to a number of different factors, including the typically smaller size of independent schools, the poor quality of some independent schools in disadvantaged areas, and the relatively low representation in this study sample of the extremely affluent children most likely to attend the more highly performing independent schools. Nearest primary Nearest secondary Attended school 1997 Attended school 03 – primary Attended school 03 – secondary % of sample at independent schools 7.60% 8.23% 7.09% 7.00% 7.67% Table 7.3: Proportion of sample members closest to and attending independent schools Quintiles The proportion of children attending schools in quintiles 1, 2 and 5 is lower than would be expected on the basis of the proportions of schools of each quintile in Gauteng, while the proportion of children in quintile 3 and 4 schools, by contrast, is higher than expected. When contrasting quintile ratings of schools attended with nearest schools (see Table 7.4 below), it is again clear 213 that the proportion of children attending schools in quintiles 1 and 2 is lower than would be expected on the basis of the schools nearest to sample members‘ homes. This time, however, the proportion attending quintile 3 schools is also lower, while the proportions attending schools in quintiles 4 and 5 are higher. School quintile Nearest primary school Nearest secondary school Sample members by quintile, 1997 Sample members by quintile, primary school only 2003 Sample members by quintile, secondary school only 2003 1 3.35% 1.93% 1.43% 1.37% 0.00% 2 6.69% 4.59% 3.84% 3.98% 4.03% 3 50.76% 53.51% 46.47% 42.61% 38.79% 4 35.46% 35.86% 39.68% 42.61% 48.36% 5 3.75% 4.11% 8.58% 9.44% 8.82% Table 7.4: Distribution of sample members across schools by school quintile rating Section 21 Status In both 1997 and 2003, only slightly over 4% of learners were attending schools without Section 21 status (see Table 7.5 below). These figures are lower than the proportion of learners whose nearest primary school did not have Section 21 status (6.82%), and substantially lower than the proportion whose nearest secondary school did not have that status (17.42%). Nearest primary Nearest secondary Attended school 1997 Attended school 03 – primary Attended school 03 – secondary % of sample at schools without Section 21 status 6.82% 17.42% 4.34% 4.53% 4.20% Table 7.5: Proportion of schools without Section 21 status nearest to and attended by sample members Total school enrolment As larger schools enrol more children, it is expected that a higher proportion of our sample members will be attending larger schools. As a result, the school size experienced our average learner should be higher than the size of the 214 average school available to him or her. This is indeed reflected in the data. The mean primary school size experienced by sample members in 1997 was 675, and 720 in 2003, both of which are higher than the average size (663) of attended primary schools. The mean school size experienced by sample members attending secondary school in 2003 is 1060 learners, compared to a mean attended secondary school size of 971. A similar argument applies to the schools nearest to children‘s homes, and this is also reflected in the data. The average sample member‘s closest primary school has just over 647 children, compared to an average of 626 learners for all the primary schools nearest to sample members‘ homes. Similarly, the average sample member‘s nearest secondary school enrols 1016 children, compared to an average school size of 971. As is evident in Table 7.6 below, the mean school size experienced by a sample member in 1997 is a little larger than would be expected on the basis of the mean size of the closest primary school. The figure for those children still in primary school in 2003 is even higher. The mean size of secondary schools attended in 2003 is also a little higher than would be expected on the basis of the mean size of the secondary schools closest to the children‘s homes. Nearest primary school size Nearest secondary school size Attended school 1997 Attended school 03 – primary Attended school 03 – secondary Mean school size 647.17 1016.13 674.75 719.72 1060.35 Median school size 563 1090 641 694 1131.5 Table 7.6: Size of schools nearest to and attended by sample members Proportion Black For the average child in the sample, both the nearest primary and secondary schools were 86% black, while for the median child both schools were 100% black, as illustrated in Table 7.7 below. These figures are higher than would be 215 predicted from the data for all Gauteng schools, as well as from the figures for those schools that sample members actually attended. These figures do, however, make sense, as the majority of sample members lived in majority black areas, and majority black schools were more likely to have multiple sample members enrolled, particularly at the secondary school level. The figures for the schools actually attended in 1997 and in 2003 are substantially lower than those for the nearest schools, suggesting that on average, children are attending schools with a lower proportion of black learners than those nearest to their homes. This difference is particularly marked for those children who have reached secondary school by 2003. However, as this is a non-representative sub-section of the sample, it is not possible to determine whether this larger difference is attributable to their being in secondary school, or to their being more advantaged or academically more promising than their peers still in primary school. Nearest primary Nearest secondary Attended school 1997 Attended school 03 – primary Attended school 03 – secondary Mean % black learners 86.29 85.78 81.98 81.07 77.25 Median % black learners 100 100 100 100 99.75 Table 7.7: Proportion of black learners at schools nearest to and attended by sample members School Fees Table 7.8 below illustrates the fees charged at the primary and secondary schools nearest to, and attended by, the average sample member, for both 1997 and 2003. The fees charged at the schools attended by sample members are higher than would be predicted on the basis of the fees charged by the schools nearest to their homes. This divergence is particularly notable at the secondary school level, although again it is not clear whether this is due to the nature of 216 those children in secondary school in 2003, or whether it simply relates to the fact of their attending secondary schools. Nearest primary Nearest secondary Attended school 1997 Attended school 2003 – primary Attended school 2003 – secondary Mean school fees R255 R358 R564 R654 R812 Median school fees R50 R100 R60 R100 R150 Table 7.8: Fees charged by schools nearest to and attended by sample members Historical racial status of school Table 7.9, below, illustrates that a smaller proportion of sample members attended ex-DET schools than would be predicted on the basis of the schools closest to their homes. The proportion attending DET schools falls between 1997 and 2003, and is lowest for those children attending secondary schools in 2003. Nearest primary Nearest secondary Attended school 1997 Attended school 2003 – primary Attended school 2003 – secondary % of sample at ex-DET schools 76.72% 76.23% 63.01 58.39% 54.50% Table 7.9: Proportion of schools nearest to and attended by sample members that were historically under the DET Matric Pass Rate Once again, the average sample member attends a school with better pass rates than would be predicted on the basis of the school closest to their home (see Table 7.10 below). Also, similarly to other variables explored, this difference becomes more marked at the secondary school level, although, as indicated previously, the reasons for this increase cannot be determined from the available data. 217 Nearest primary Nearest secondary Attended school 1997 Attended school 2003 – primary Attended school 03 – secondary Mean pass rate 64% 64% 66% 67% 70% Median pass rate 63% 63% 66% 66% 69% Table 7.10: Pass rates of schools nearest to and attended by sample members. Discussion of weighted data A few key themes emerge from the data presented here. Firstly, it is clear that across all variables, on average, a sample member attends a school that is more advantaged than would be predicted on the basis of the school nearest to his or her home. Secondly, across all variables, this difference is greater at the secondary school level, although the reasons for this are not clear. Thirdly, those children still attending primary school in 2003 are typically attending schools that more advantaged than those that were attended in 1997. These findings suggest that to the extent that school choice is being used by sample members, it is being used to enhance the educational opportunities available to sample members. 7.3 Which children attend which schools? This section explores the distribution of sample members across schools with different properties on the basis of child, household and community attributes. Tests are conducted using data for both 1997 and 2003. 7.3.1 Child level variables Race Table 7.11, below, shows that schools attended by children of different races vary significantly on the basis of the majority of the school attribute variables examined, with black children typically more likely to be attending lower quality schools. 218 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Sector n.s. Pr=0.008 (Black children more likely to attend public schools) n.s. Pr=0.013 (Black children more likely to attend public schools) Quintile Pr=0.0000 (Black children more likely to attend lower quintile schools) Pr=0.0000 (Black children more likely to attend lower quintile schools) Pr=0.0000 (Black children more likely to attend lower quintile schools) Pr=0.0000 (Black children more likely to attend lower quintile schools) Section 21 Pr=0.026 (Black children less likely to attend Section 21 schools) n.s. Pr=0.070 (Black children less likely to attend Section 21 schools) Pr=0.004 (Black children more likely to attend Section 21 schools) School size Pr=0.0000 (Black children are more likely to be enrolled in smaller schools) Pr=0.0000 (Black children are more likely to be enrolled in smaller schools) Pr=0.0000 (Black children are more likely to be enrolled in smaller schools) Pr=0.0000 (Black children are more likely to be enrolled in smaller schools) Proportion black Pr=0.0000 (Black children likely to attend schools with a higher proportion of black learners) Pr=0.0000 (Black children likely to attend schools with a higher proportion of black learners) Pr=0.0000 (Black children likely to attend schools with a higher proportion of black learners) Pr=0.0000 (Black children likely to attend schools with a higher proportion of black learners) School fees Pr=0.0000 (Black children more likely to attend schools with lower fees) Pr=0.0000 (Black children more likely to attend schools with lower fees) Pr=0.0000 (Black children more likely to attend schools with lower fees) Pr=0.0000 (Black children more likely to attend schools with lower fees) Historical DET status Pr=0.000 (Black children were more likely to attend a historically DET school) Pr=0.000 (Black children were more likely to attend a historically DET school) Pr=0.000 (Black children were more likely to attend a historically DET school) Pr=0.000 (Black children were more likely to attend a historically DET school) Matric Pass Pr=0.0299 n.s. n.s. n.s. 219 rate (Black children more likely to attend schools with higher pass rates) Table 7.11: Relationship between child race (black and coloured children only) and properties of the school he or she attends Gender As illustrated in Table 7.12 below, only two school properties are significantly related to child gender in 1997: boys are more likely to attend schools with a higher proportion of black learners, as well as schools that are former DET schools. In 2003, a number of significant relationships are evident, but these appear to relate largely to the higher proportion of girls attending secondary school by 2003, and all disappear when school phase is controlled for. The one exception to this pattern is school sector, with girls being significantly more likely to attend independent secondary schools. Overall, there is weak evidence that during the early years of schooling, girls may be more likely to attend more integrated schools than boys, and less likely to attend former DET schools. These relationships do not survive in 2003, when school phase is controlled for. Girls are, however, substantially more likely to have reached secondary school in 2003, which may be obscuring relationships between gender and enrolment patterns at the secondary school level . 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Sector n.s. n.s. n.s. Pr = 0.030 (Girls more likely to attend independent schools) Quintile n.s. Pr=0.0468 (Girls more likely to attend n.s. n.s. 220 higher quintile schools) Section 21 n.s. n.s. n.s. n.s. School size n.s. Pr=0.0173 (Girls more likely to attend larger schools) n.s. n.s. Proportion black Pr=0.0677 (Girls more likely to attend schools with lower proportion of black learners) Pr=0.0562 (Girls more likely to attend schools with lower proportion of black learners) n.s. n.s. School fees n.s. Pr=0.0293 (Girls more likely to attend schools with higher fees) n.s. n.s. Historical DET status Pr=0.054 (Girls less likely to attend former DET schools) Pr=0.060 (Girls less likely to attend former DET schools) n.s. n.s. Matric Pass rate n.s. n.s. n.s. n.s. Table 7.12: Relationship between child gender and properties of the school he or she attends Age at enrolment The 1997 analysis shows strong evidence for a relationship between the age of school enrolment and the type of school attended, with children starting school at a later age attending more advantaged schools (see Table 7.13 below). This is in line with the finding in Chapter 6 that children who start school later are more likely to live further away from their schools. Given the extremely strong relationship between age at school enrolment and the child‘s phase of schooling in 2003, results for 2003 should be treated with some caution, although they are largely in line with the 1997 results. Overall, however, this data suggests that children who start school at a later age tend to access more highly performing schools. 221 Age at first school enrolment 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Sector Pr=0.001 (children starting school late less likely to attend public schools) n.s. Pr= 0.067 (children starting school late less likely to attend public schools) Pr=0.000 (children starting school late less likely to attend public schools) Quintile Pr=0.000 (children starting school late more likely to attend quintile 5 schools) Pr=0.011 (children starting school late more likely to attend quintile 5 schools) Pr=0.077 (children starting school late more likely to attend quintile 5 schools) n.s. Section 21 n.s. n.s. n.s. Pr=0.008 (children starting school late less likely to attend Section 21 schools) School size n.s. Pr=0.0000 (children starting school late likely to attend smaller schools) n.s. Pr=0.0621(childr en starting school late likely to attend smaller schools) Proportion black Pr=0.0019 (children starting school late likely to attend schools with a lower proportion of black learners) n.s. Pr=0.0779 (children starting school late likely to attend schools with a lower proportion of black learners) n.s. School fees Pr= 0.0000 (children starting school late likely to Pr=0.0979 (children starting school late likely to attend schools Pr=0.0151 (children starting school late likely to attend schools n.s. 222 attend schools with higher fees) with higher fees) with higher fees) Historical DET status P=0.000 (children starting school late less likely to attend former DET schools) Pr=0.058 (children starting school late less likely to attend former DET schools) n.s. n.s. Matric Pass rate Pr= 0.0007 (children starting school late more likely to attend schools with higher pass rates) n.s. n.s. n.s. Table 7.13: Relationship between child age at first school enrolment and school properties Repetitions The analyses related to this grade repetition should be interpreted somewhat differently to the other variables presented, as repetition seems to be shaped more by the properties of the school a child attends than by the attributes of the child him or herself. In addition, while repetition precedes 2003 schooling choices, it occurs only after 1997 schooling choices have already been made, meaning that it cannot be influencing school choice for that year (although a child‘s academic capability might be influencing choice). However, it is quite likely that school choice in 1997 influences repetition in subsequent years. In addition, grade repetition strongly influences whether or not a child has reached secondary school by 2003. This means that finding for 2003, particularly for the secondary school level, should be treated with some caution. 223 Although the causality behind the relationships documented in Table 7.14, below, is not clear, there are highly significant relationships between grade repetition and most indicators of school quality, other than Section 21 status. All relationships operate in the expected direction, with children who have repeated a grade tending to attend less advantaged and more poorly performing schools than those who have never repeated a grade. The relationship between repetition and school size is likely to be due to the tendency for primary schools in poorer areas to be somewhat smaller than those in more affluent areas, as mentioned previously. 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Sector Pr=0.009 (Grade repeaters more likely to attend public schools) Pr=0.013 (Grade repeaters more likely to attend public schools) Pr=0.012 (Grade repeaters more likely to attend public schools) n.s. Quintile Pr=0.000 (Grade repeaters more likely to attend schools in quintiles 1, 2 or 3) Pr=0.000 (Grade repeaters more likely to attend schools in quintiles 1, 2 or 3) Pr=0.000 (Grade repeaters more likely to attend schools in quintiles 1, 2 or 3) n.s. Section 21 n.s. n.s. n.s. n.s. School size Pr= 0.0000 (Grade repeaters more likely to attend smaller schools) Pr= 0.0000 (Grade repeaters more likely to attend smaller schools) Pr= 0.0284 (Grade repeaters more likely to attend smaller schools) n.s. Proportion black Pr= 0.000 (Grade repeaters more likely to attend a school with a higher proportion of black learners) Pr= 0.000 (Grade repeaters more likely to attend a school with a higher proportion of black learners) Pr= 0.000 (Grade repeaters more likely to attend a school with a higher proportion of black learners) Pr= 0.0208 (Grade repeaters more likely to attend a school with a higher proportion of black learners) 224 School fees Pr= 0.000 (Grade repeaters more likely to attend a school with lower fees) Pr= 0.000 (Grade repeaters more likely to attend a school with lower fees) Pr= 0.000 (Grade repeaters more likely to attend a school with lower fees) n.s. Historical DET status Pr= 0.000 (Grade repeaters more likely to attend a former DET school) Pr= 0.000 (Grade repeaters more likely to attend a former DET school) Pr= 0.000 (Grade repeaters more likely to attend a former DET school) Pr= 0.014 (Grade repeaters more likely to attend a former DET school) Matric Pass rate Pr=0.0149 (Grade repeaters more likely to attend a school with a higher pass rate) Pr=0.0010 (Grade repeaters more likely to attend a school with a higher pass rate) Pr=0.0091 (Grade repeaters more likely to attend a school with a higher pass rate) n.s. Table 7.14: Relationship between grade repetitions between 1997 and 2003, and school properties Child level variables associated with school enrolment patterns: Discussion The data presented here suggests that race remains closely connected to the attributes of the school a child attends, with Black children particularly likely to attend historically disadvantaged schools. The data does not provide strong evidence for any consistent relationship between gender and schooling enrolment patterns, with the exception that girls are substantially more likely than boys to have reached secondary schooling by 2003. It does, however, indicate that children enrolling in school for the first time at a later age are significantly more likely to attend more advantaged schools than children who enrol earlier. Finally, there is also evidence that children who have repeated a grade attend less advantaged schools than children who have never repeated a grade, although with the available data it is not possible to disentangle the 225 causal roles of school quality and a learner‘s inherent academic capabilities in this relationship. 7.3.2 Family & household variables: Maternal education Table 7.15, below, provides evidence for highly significant relationships between maternal education and almost all of the school attributes examined. In all cases where a relationship between a school property and expected school quality exists, children with more educated mothers are more likely to attend higher quality schools. The one exception to this is Section 21 status, which is unrelated to maternal education. Additionally, in 1997 there is no relationship between maternal education and the imputed pass rate of primary schools, but this probably relates to the imputation process used. 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Sector Pr= 0.000 (children of more educated mothers less likely to attend public schools) Pr= 0.000 (children of more educated mothers less likely to attend public schools) Pr= 0.000 (children of more educated mothers less likely to attend public schools) n.s. Quintile Pr=0.0001 (children of more educated mothers likely to attend higher quintile schools) Pr=0.0001 (children of more educated mothers likely to attend higher quintile schools) Pr=0.0001 (children of more educated mothers likely to attend higher quintile schools) Pr=0.0002 (children of more educated mothers likely to attend higher quintile schools) Section 21 n.s. n.s. n.s. n.s. School size n.s. Pr= 0.0490 (children of more educated mothers likely to attend n.s. Pr= 0.0490 (children of more educated mothers likely to attend 226 larger schools) smaller schools) Proportion black Pr=0.0001 (children of more educated mothers likely to attend school with lower proportion of black learners) Pr=0.0001 (children of more educated mothers likely to attend school with lower proportion of black learners) Pr=0.0001 (children of more educated mothers likely to attend school with lower proportion of black learners) Pr=0.0320 (children of more educated mothers likely to attend school with lower proportion of black learners) School fees Pr= 0.0001 (children of more educated mothers likely to attend school with higher fees) Pr= 0.0001 (children of more educated mothers likely to attend school with higher fees) Pr= 0.0001 (children of more educated mothers likely to attend school with higher fees) Pr= 0.0001 (children of more educated mothers likely to attend school with higher fees) Historical DET status Pr= 0.000 (children of more educated mothers less likely to attend a former DET school) Pr= 0.000 (children of more educated mothers less likely to attend a former DET school) Pr= 0.000 (children of more educated mothers less likely to attend a former DET school) Pr= 0.033 (children of more educated mothers less likely to attend a former DET school) Matric Pass rate n.s. Pr=0.0002 (children of more educated mothers likely to attend a school with a higher pass rate) Pr=0.0098 (children of more educated mothers likely to attend a school with a higher pass rate) Pr=0.0015 (children of more educated mothers likely to attend a school with a higher pass rate) Table 7.15: Relationship between maternal education and school properties Maternal marital status As reflected in Table 7.16 below, maternal marital status is significantly related to most school attributes, with the children of married mothers typically being more likely to attend more advantaged schools. 227 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Sector Pr= 0.001 (children of married mothers less likely to attend a public school) Pr= 0.005 (children of married mothers less likely to attend a public school) Pr= 0.028 (children of married mothers less likely to attend a public school) Pr= 0.094 (children of married mothers less likely to attend a public school) Quintile Pr=0.019 (children of married mothers more likely to attend a quintile 4 or 5 school) Pr=0.000 (children of married mothers more likely to attend a quintile 4 or 5 school) Pr=0.000 (children of married mothers more likely to attend a quintile 4 or 5 school) n.s. Section 21 n.s. n.s. n.s. Pr=0.048 (children of married mothers less likely to attend a Section 21 school) School size Pr=0.0009 (children of married mothers likely to attend larger schools) Pr=0.0179 (children of married mothers likely to attend larger schools) Pr=0.0041 (children of married mothers likely to attend larger schools) n.s. Proportion black Pr=0.0000 (children of married mothers likely to attend schools with a lower proportion of black learners) Pr=0.0000 (children of married mothers likely to attend schools with a lower proportion of black learners) Pr=0.0000 (children of married mothers likely to attend schools with a lower proportion of black learners) Pr=0.0000 (children of married mothers likely to attend schools with a lower proportion of black learners) School fees Pr=0.0000 (children of married mothers likely to attend schools with higher fees) Pr=0.0000 (children of married mothers likely to attend schools with higher fees) Pr=0.0000 (children of married mothers likely to attend schools with higher fees) n.s. 228 Historical DET status Pr=0.0000 (children of married mothers less likely to attend former DET schools) Pr=0.0000 (children of married mothers less likely to attend former DET schools) Pr=0.0000 (children of married mothers less likely to attend former DET schools) Pr=0.0000 (children of married mothers less likely to attend former DET schools) Matric Pass rate Pr=0.0191 (children of married mothers likely to attend schools with higher pass rates) Pr=0.0033 (children of married mothers likely to attend schools with higher pass rates) Pr=0.0315 (children of married mothers likely to attend schools with higher pass rates) Pr=0.0690 (children of married mothers likely to attend schools with higher pass rates) Table 7.16: Relationship between maternal marital status and school properties Household SES: 1997 All relationships were explored using both the derived SES quintiles, as well as the raw SES scores. As the results were largely identical, only the results of the tests based on the quintiles are documented in Table 7.17 below. These results indicate the existence of strongly significant relationships between 1997 household SES, and all indicators of school quality, in both 1997 and 2003, with the exception of Section 21 status. The highly significant nature of all of these relationships is expected given existing evidence for a close relationship between SES and educational opportunity in urban South Africa. 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Sector Pr=0.000 (Higher SES children less likely to attend public schools) Pr=0.000 (Higher SES children less likely to attend public schools) Pr=0.000 (Higher SES children less likely to attend public schools) Pr=0.003 (Higher SES children less likely to attend public schools) Quintile Pr=0.0001 (Higher SES children likely to attend higher quintile schools) Pr=0.0001 (Higher SES children likely to attend higher quintile schools) Pr=0.0001 (Higher SES children likely to attend higher quintile schools) Pr=0.0001 (Higher SES children likely to attend higher quintile schools) 229 Section 21 n.s. n.s. n.s. n.s. School size Pr=0.0001 (Higher SES children likely to attend larger schools) Pr=0.0026 (Higher SES children likely to attend larger schools) Pr=0.0001 (Higher SES children likely to attend larger schools) n.s. Proportion black Pr=0.0001 (Higher SES children likely to attend schools with a lower proportion of black learners) Pr=0.0001 (Higher SES children likely to attend schools with a lower proportion of black learners) Pr=0.0001 (Higher SES children likely to attend schools with a lower proportion of black learners) Pr=0.0001 (Higher SES children likely to attend schools with a lower proportion of black learners) School fees Pr=0.0001 (Higher SES children likely to attend a school with higher fees) Pr=0.0001 (Higher SES children likely to attend a school with higher fees) Pr=0.0001 (Higher SES children likely to attend a school with higher fees) Pr=0.0001 (Higher SES children likely to attend a school with higher fees) Historical DET status Pr=0.000 (Higher SES children less likely to attend a former DET school) Pr=0.000 (Higher SES children less likely to attend a former DET school) Pr=0.000 (Higher SES children less likely to attend a former DET school) Pr=0.000 (Higher SES children less likely to attend a former DET school) Matric Pass rate Pr=0.0001 (Higher SES children likely to attend a school with higher pass rate) Pr=0.0001 (Higher SES children likely to attend a school with higher pass rate) Pr=0.0190 (Higher SES children likely to attend a school with higher pass rate) Pr=0.0001 (Higher SES children likely to attend a school with higher pass rate) Table 7.17: Relationship between 1997 household SES and school attributes in both 1997 and 2003 Household SES: 2003 Again, tests were conducted using both 2003 SES quintiles and raw scores, and as the results were largely identical, only the results of the tests conducted using the quintiles are presented in Table 7.18 below. These results were extremely similar to those for 1997 SES, which is anticipated as the two SES scores are strongly related. Once again, the only school attribute which was not related to SES was a child‘s school had Section 21 status. 230 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Sector Pr=0.000 (Higher SES children less likely to attend public schools) Pr=0.000 (Higher SES children less likely to attend public schools) Pr=0.000 (Higher SES children less likely to attend public schools) Pr=0.015 (Higher SES children less likely to attend public schools) Quintile Pr=0.0001 (Higher SES children likely to attend higher quintile schools) Pr=0.0001 (Higher SES children likely to attend higher quintile schools) Pr=0.0001 (Higher SES children likely to attend higher quintile schools) Pr=0.0001 (Higher SES children likely to attend higher quintile schools) Section 21 n.s. n.s. n.s. n.s. School size Pr=0.0001 (Higher SES children likely to attend larger schools) n.s. Pr=0.0148 (Higher SES children likely to attend larger schools) n.s. Proportion black Pr=0.0001 (Higher SES children likely to attend schools with a lower proportion of black learners) Pr=0.0001 (Higher SES children likely to attend schools with a lower proportion of black learners) Pr=0.0001 (Higher SES children likely to attend schools with a lower proportion of black learners) Pr=0.0001 (Higher SES children likely to attend schools with a lower proportion of black learners) School fees Pr=0.0001 (Higher SES children likely to attend a school with higher fees) Pr=0.0001 (Higher SES children likely to attend a school with higher fees) Pr=0.0001 (Higher SES children likely to attend a school with higher fees) Pr=0.0001 (Higher SES children likely to attend a school with higher fees) Historical DET status Pr=0.000 (Higher SES children less likely to attend a former DET school) Pr=0.000 (Higher SES children less likely to attend a former DET school) Pr=0.000 (Higher SES children less likely to attend a former DET school) Pr=0.000 (Higher SES children less likely to attend a former DET school) Matric Pass rate Pr=0.0044 (Higher SES children likely to attend a school with higher pass Pr=0.0001 (Higher SES children likely to attend a school with higher pass Pr=0.0015 (Higher SES children likely to attend a school with higher pass Pr=0.0002 (Higher SES children likely to attend a school with higher pass 231 rate) rate) rate) rate) Table 7.18: Relationship between 2003 household SES and school attributes in 1997 and 2003 Household level variables associated with school enrolment patterns: Discussion Significant relationships were identified between each of the household level variables examined, and all school attributes with the exception of Section 21 status. In all cases, relationships operated in the expected direction. Children of more educated mothers and married mothers were more likely to be attending more advantaged schools, as were children living in more advantaged households in either 1997 or 2003. This provides strong evidence that more advantaged children tend to have access to better educational opportunities in contemporary urban South African than their less advantaged peers. 7.3.3 Community level variables: Residential Area Poverty Small Area Level Again, as tests on residential area poverty quintiles and raw scores produced largely identical results, only the results of analyses conducted using the quintiles are presented in Table 7.19 below. Overall the results are fairly clear, with children living in poorer areas more likely to attend less advantaged schools. SAL Poverty 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Sector Pr=0.000 (Children living in poorer SALs more likely to attend a public school) Pr=0.000 (Children living in poorer SALs more likely to attend a public school) Pr=0.000 (Children living in poorer SALs more likely to attend a public school) n.s. Quintile Pr=0.0001 Pr=0.0001 Pr=0.0001 Pr=0.0001 232 (Children living in poorer SALs likely to attend lower quintile schools) (Children living in poorer SALs likely to attend lower quintile schools) (Children living in poorer SALs likely to attend lower quintile schools) (Children living in poorer SALs likely to attend lower quintile schools) Section 21 n.s. n.s. n.s. Pr=0.092 (Children living in richer SALs less likely to attend Section 21 schools) School size Pr=0.0001 (Children living in richer SALs likely to attend larger schools) Pr=0.0001 (Children living in richer SALs likely to attend larger schools) Pr=0.0001 (Children living in richer SALs likely to attend larger schools) Pr=0.0232 (Children living in richer SALs likely to attend larger schools) Proportion black Pr=0.0001 (Children living in poorer SALs likely to attend schools with a higher proportion of black learners) Pr=0.0001 (Children living in poorer SALs likely to attend schools with a higher proportion of black learners) Pr=0.0001 (Children living in poorer SALs likely to attend schools with a higher proportion of black learners) Pr=0.0001 (Children living in poorer SALs likely to attend schools with a higher proportion of black learners) School fees Pr=0.0001 (Children living in poorer SALs likely to attend schools with lower fees) Pr=0.0001 (Children living in poorer SALs likely to attend schools with lower fees) Pr=0.0001 (Children living in poorer SALs likely to attend schools with lower fees) Pr=0.0001 (Children living in poorer SALs likely to attend schools with lower fees) Historical DET status Pr=0.000 (Children living in poorer SALs more likely to attend former DET schools) Pr=0.000 (Children living in poorer SALs more likely to attend former DET schools) Pr=0.000 (Children living in poorer SALs more likely to attend former DET schools) Pr=0.000 (Children living in poorer SALs more likely to attend former DET schools) Matric Pass rate Pr=0.0001 (Children living in poorer SALs likely to attend schools with lower pass rates) Pr=0.0001 (Children living in poorer SALs likely to attend schools with lower pass rates) Pr=0.0001 (Children living in poorer SALs likely to attend schools with lower pass rates) Pr=0.0012 (Children living in poorer SALs likely to attend schools with lower pass rates) Table 7.19: Relationship between residential SAL poverty and school attributes in 1997 and 2003 233 Sub-place level The results for the analyses based on SP poverty levels (see Table 7.20 below) were similar to those conducted for the SAL poverty levels, with children in poor areas typically attending less advantaged schools. However, for matric pass rate, the relationship shifted from a linear relationship in which children living in more affluent areas attended schools with higher pass rates, to a non- linear relationship in which children in both the wealthiest and poorest areas attended schools with higher pass rates than children in moderate-poverty areas. Potential explanations for this shift are not clear. SP Poverty 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Sector Pr=0.000 (Children living in poorer SPs more likely to attend a public school) Pr=0.000 (Children living in poorer SPs more likely to attend a public school) Pr=0.000 (Children living in poorer SPs more likely to attend a public school) n.s. Quintile Pr=0.0001 (Children living in poorer SPs likely to attend lower quintile schools) Pr=0.0001 (Children living in poorer SPs likely to attend lower quintile schools) Pr=0.0001 (Children living in poorer SPs likely to attend lower quintile schools) Pr=0.0001 (Children living in poorer SPs likely to attend lower quintile schools) Section 21 Pr=0.003 (Children living mid-range SPs least likely to attend Section 21 schools) Pr=0.000 (Children living mid-range SPs least likely to attend Section 21 schools) Pr=0.007 (Children living mid-range SPs least likely to attend Section 21 schools) Pr=0.000 (Children living in richer SPs least likely to attend Section 21 schools) School size Pr=0.0001 (Children living in richer SPs likely to attend larger schools) Pr=0.0001 (Children living in richer SPs likely to attend larger schools) Pr=0.0001 (Children living in richer SPs likely to attend larger schools) Pr=0.0068 (Children living in richer SPs likely to attend larger schools) Proportion black Pr=0.0001 (Children living in poorer SPs likely to attend schools with a higher proportion of Pr=0.0001 (Children living in poorer SPs likely to attend schools with a higher proportion of Pr=0.0001 (Children living in poorer SPs likely to attend schools with a higher proportion of Pr=0.0001 (Children living in poorer SPs likely to attend schools with a higher proportion of 234 black learners) black learners) black learners) black learners) School fees Pr=0.0001 (Children living in poorer SPs likely to attend schools with lower fees) Pr=0.0001 (Children living in poorer SPs likely to attend schools with lower fees) Pr=0.0001 (Children living in poorer SPs likely to attend schools with lower fees) Pr=0.0001 (Children living in poorer SPs likely to attend schools with lower fees) Historical DET status Pr=0.000 (Children living in poorer SPs more likely to attend former DET schools) Pr=0.000 (Children living in poorer SPs more likely to attend former DET schools) Pr=0.000 (Children living in poorer SPs more likely to attend former DET schools) Pr=0.000 (Children living in poorer SPs more likely to attend former DET schools) Matric Pass rate Pr=0.0001 (Children living in mid-range SPs likely to attend schools with lower pass rates) Pr=0.0001 (Children living in mid-range SPs likely to attend schools with lower pass rates) Pr=0.0001 (Children living in mid-range SPs likely to attend schools with lower pass rates) Pr=0.0001 (Children living in mid-range SPs likely to attend schools with lower pass rates) Table 7.20: Relationship between SP poverty and school properties Main Place level As evident in Table 7.21, the relationships between residential MP poverty levels and school attributes remains fairly consistent with those identified at the SAL and SP levels, although the statistical significance of some of these relationships has decreased. Overall, children living in wealthier MPs are more likely to attend more advantaged schools than their peers in poorer MPs. Once again, however, children living in the mid-range MPs likely to attend the schools with the poorest pass rates – although this relationship disappears when only those children who have reached secondary school by 2003 are examined. The reasons for the persistence of this pattern remain unclear. 235 MP Poverty 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Sector Pr=0.061 (Children living in wealthier MPs less likely to attend a public school) Pr=0.062 (Children living in wealthier MPs less likely to attend a public school) Pr=0.010 (Children living in wealthier MPs less likely to attend a public school) n.s. Quintile Pr=0.0001 (Children living in poorer MPs likely to attend lower quintile schools) Pr=0.0001 (Children living in poorer MPs likely to attend lower quintile schools) Pr=0.0001 (Children living in poorer MPs likely to attend lower quintile schools) Pr=0.0001 (Children living in poorer MPs likely to attend lower quintile schools) Section 21 Pr=0.002 (Children living in poorer MPs less likely to attend Section 21 schools) n.s. Pr=0.001 (Children living in poorer MPs less likely to attend Section 21 schools) Pr=0.022 (Children living in poorer MPs less likely to attend Section 21 schools) School size Pr=0.0001 (Children living in wealthier MPs likely to attend larger schools) Pr=0.0001 (Children living in wealthier MPs likely to attend larger schools) Pr=0.0002 (Children living in wealthier MPs likely to attend larger schools) n.s. Proportion black Pr=0.0001 (Children living in wealthier MPs likely to attend schools with a lower proportion of black learners) Pr=0.0001 (Children living in wealthier MPs likely to attend schools with a lower proportion of black learners) Pr=0.0001 (Children living in wealthier MPs likely to attend schools with a lower proportion of black learners) Pr=0.0001 (Children living in wealthier MPs likely to attend schools with a lower proportion of black learners) School fees Pr=0.0001 (Children living in wealthier MPs likely to attend schools with higher fees) Pr=0.0001 (Children living in wealthier MPs likely to attend schools with higher fees) Pr=0.0001 (Children living in wealthier MPs likely to attend schools with higher fees) Pr=0.0001 (Children living in wealthier MPs likely to attend schools with higher fees) Historical DET status Pr=0.000 (Children living in wealthier SPs less likely to attend Pr=0.000 (Children living in wealthier SPs less likely to attend Pr=0.000 (Children living in wealthier SPs less likely to attend Pr=0.000 (Children living in wealthier SPs less likely to attend 236 former DET schools) former DET schools) former DET schools) former DET schools) Matric Pass rate Pr=0.0001 (Children living in mid-range MPs likely to attend schools with lower pass rates) Pr=0.0028 (Children living in mid-range MPs likely to attend schools with lower pass rates) Pr=0.0008 (Children living in mid-range MPs likely to attend schools with lower pass rates) n.s. Table 7.21: Relationship between MP poverty and school properties Community level variables associated with school enrolment patterns: Discussion At the community level, the relationships between residential SAL poverty and school attributes were clearest, with children living in poorer SALs more likely to attend less advantaged schools. At the SP and MP levels, this pattern held overall, although strange results emerged with respect to the school‘s Section 21 status and matric pass rates. Explanations for these unexpected results are not clear. 7.3.4 Child, household and community level variables associated with school enrolment patterns: Discussion The analyses presented in this section present clear evidence for a relationship between child, household and community attributes associated with advantage, and school-level variables associated with more highly performing schools. Socio-economic status and maternal education appear to be particularly strongly related to attending higher quality schools, while living in high poverty areas is, predictably, inversely associated with attending high quality schools. At the level of the individual child, there is no evidence for a relationship between gender and school quality or other school properties, once schooling phase has been controlled for. However, children who start school late for their age are much more highly represented among those attending more advantaged and highly performing schools. The analyses around Section 21 status given unexpected and often contradictory results, suggesting that 237 there may be reason for concern around the accuracy of this variable. Overall, these results highlight the extent to which access to high quality education in contemporary urban South Africa is determined by a child‘s home circumstances, and the area in which he or she lives. 7.4 Relationships between school attributes and mobility behaviours The final section of this chapter tests for and explores relationships between various school attributes, and the mobility of learners enrolled in those schools. Children attending higher quality schools are expected to be more strongly engaged in mobility. 7.4.1 School sector Table 7.22, below, illustrates the relationships between the sector (public or independent) of the school a child attends, and his or her mobility behaviour. There is strong evidence that children attending independent schools are significantly more mobile than those attending public schools. This holds for all definitions of mobility. School sector 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Distance travelled Pr=0.000 (Children at independent schools travel significantly further to school than children at public schools) Pr=0.000 (Children at independent schools travel significantly further to school than children at public schools) Pr=0.000 (Children at independent schools travel significantly further to school than children at public schools) Pr=0.000 (Children at independent schools travel significantly further to school than children at public schools) Movement between areas SAL: Pr=0.052 (Children at public schools SAL: n.s. SAL: n.s. SAL: n.s. 238 are more likely to attend school in their home SAL) SP: Pr=0.000 (Children at public schools are more likely to attend school in their home SP) SP: Pr=0.000 (Children at public schools are more likely to attend school in their home SP) SP: Pr=0.000 (Children at public schools are more likely to attend school in their home SP) SP: Pr=0.001 (Children at public schools are more likely to attend school in their home SP) MP: Pr=0.000 (Children at public schools are more likely to attend school in their home MP) MP: Pr=0.000 (Children t public schools are more likely to attend school in their home MP) MP: Pr=0.000 (Children at public schools are more likely to attend school in their home MP) MP: Pr=0.004 (Children at public schools are more likely to attend school in their home MP) Nearest school Pr=0.000 (Children at public schools are more likely to attend their nearest school) Pr=0.000 (Children at public schools are more likely to attend their nearest school) Pr=0.000 (Children at public schools are more likely to attend their nearest school) Pr=0.000 (Children at public schools are more likely to attend their nearest school) Table 7.22: Relationship between sector of attended school and learner mobility 7.4.2 School quintile As indicated by the data in Table 7.23, children attending high quintile schools are more likely to engage in learner mobility than children attending lower quintile schools. However, in some tests, particularly relating to movement between areas, there is evidence that the children attending the very lowest quintile schools are as likely to engage in mobility as children at the attending the highest quintile schools. This has two potential explanations: firstly, it may relate to poor quality data on those children attending the most disadvantaged schools, or secondly, it may be that the children attending the most disadvantaged schools are living in areas with particularly few educational opportunities, requiring them to travel further. The second hypothesis is 239 supported by the fact that higher mobility amongst children attending most disadvantaged schools largely disappears at higher levels of geography. School quintile 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Distance travelled Pr=0.0001 (Children at higher quintile schools are likely to travel further than children at lower quintile schools) Pr=0.0001 (Children at higher quintile schools are likely to travel further than children at lower quintile schools) Pr=0.0001 (Children at higher quintile schools are likely to travel further than children at lower quintile schools) Pr=0.0001 (Children at higher quintile schools are likely to travel further than children at lower quintile schools) Movement between areas SAL: Pr=0.042 (Children at mid-quintile schools are more likely to attend a school in their home SAL) SAL: n.s. SAL: n.s. SAL: n.s. SP: Pr=0.000 (Children at mid-quintile schools are more likely to attend a school in their home SP) SP: Pr=0.000 (Children at mid-quintile schools are more likely to attend a school in their home SP) SP: Pr=0.000 (Children at mid-quintile schools are more likely to attend a school in their home SP) SP: Pr=0.000 (Children at mid-quintile schools are more likely to attend a school in their home SP) MP: Pr=0.000 (Children at low and mid- quintile schools are more likely to attend a school in their home MP) MP: Pr=0.000 (Children at low and mid- quintile schools are more likely to attend a school in their home MP) MP: Pr=0.000 (Children at low and mid- quintile schools are more likely to attend a school in their home MP) MP: Pr=0.000 (Children at low and mid- quintile schools are more likely to attend a school in their home MP) Nearest school Pr=0.000 (Children at low and mid- quintile schools more likely to attend Pr=0.000 (Children at low and mid- quintile schools more likely to attend Pr=0.000 (Children at low and mid- quintile schools more likely to attend Pr=0.000 (Children at low and mid- quintile schools more likely to attend 240 their nearest school) their nearest school) their nearest school) their nearest school) Table 7.23: Relationship between quintile of attended school and learner mobility 7.4.3 Section 21 status The evidence for a relationship between the Section 21 status of a child‘s school and his or her mobility is somewhat weaker, as evident in Table 7.24 below. While there is evidence of a significant relationship between Section 21 status and mobility for all definitions of mobility for those children attending secondary school in 2003, the evidence for primary school children is less clear. There does appear to be a weakly significant relationship between Section 21 status and distance travelled, but there is no evidence that Section 21 status is related to movement between areas or to whether a child attends his or her nearest school. In all cases where there is evidence for a relationship between Section 21 status and mobility, attending a school with Section 21 status is associated with increased mobility. School Section 21 status 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Distance travelled Pr=0.0214 (Children at schools without Section 21 status travel less far to school than children at Section 21 schools) Pr=0.0540 (Children at schools without Section 21 status travel less far to school than children at Section 21 schools) n.s. Pr=0.0716 (Children at schools without Section 21 status travel less far to school than children at Section 21 schools) Movement between areas SAL: n.s. SAL: n.s. SAL: n.s. SAL: n.s. SP: n.s. SP: n.s. SP: n.s. SP: Pr=0.001 (Children at schools without Section 21 status more 241 likely to attend school in their home SP) MP: n.s. MP: n.s. MP: n.s. MP: Pr=0.014 (Children at schools without Section 21 status more likely to attend school in their home MP) Nearest school n.s. n.s. n.s. Pr=0.038 (Children at Section 21 schools less likely to attend their nearest school) Table 7.24: Relationship between Section 21 status of attended school and learner mobility 7.4.4 School Enrolment The direction of the relationship between a child‘s mobility and the size of his or her school is heavily moderated by schooling phase (see Table 7.25 below). At the secondary school level, as distance increases, the size of the school attended tends to fall. By contrast, at the primary level, as distance increases, school size also tends to rise. This is likely to relate to the tendency, described in Chapter 4, for more advantaged areas to have primary and secondary schools that are roughly equivalent in size, while less advantaged areas tend to have a few particularly large secondary schools, and a large number of far smaller primary schools. 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Distance travelled Pr=0.0000 (As children travel further to Pr=0.0010 (As children travel further to Pr=0.0000 (As children travel further to Pr=0.0005 (As children travel further to 242 school, school size increases) school, school size increases) school, school size increases) school, school size falls) Movement between areas SAL: n.s. SAL: n.s. SAL: n.s. SAL: n.s. SP: Pr=0.0000 (Children at a school outside of their home SP attend larger schools) SP: n.s. SP: Pr=0.0008 (Children at a school outside of their home SP attend larger schools) SP: Pr=0.0024 (Children at a school outside of their home SP attend larger schools) MP: Pr=0.0000 (Children at a school outside of their home MP attend larger schools) MP: Pr=0.0226 (Children at a school outside of their home MP attend larger schools) MP: Pr=0.0000 (Children at a school outside of their home MP attend larger schools) MP: Pr=0.0008 (Children at a school outside of their home MP attend smaller schools) Nearest school n.s. Pr=0.0000 (Children at their nearest school are likely to be attending a larger school) n.s. Pr=0.0005 (Children at their nearest school are likely to be attending a larger school) Table 7.25: Relationship between size of attended school and learner mobility 7.4.5 Proportion black learners Predictably, given that the majority of sample members live in predominantly black areas, those attending school close to their homes are likely to attend schools with a higher proportion of black learners than those who are travelling to schools further afield (see Table 7.26 below). There are, however, two exceptions to this pattern. Firstly, when looking at primary school children who attend school within the same SAL as their home, they are likely to attend a school with a lower proportion of black learners than those children travelling outside of the SAL in which their home is. This is likely to be due to the extremely small numbers of children attending school in their home SAL, and the earlier finding children 243 living in more affluent areas were more likely to attend school in their home SAL. Secondly, there is no evidence for a relationship between whether or not children attend their nearest school, and the proportion of black learners in that school. This absence of a relationship is likely to be because quite a few of those children not attending their nearest school are still attending a school in the same, relatively disadvantaged area in which they live. 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Distance travelled Pr=0.0000 (Children at schools with a lower proportion of black learners travel further to school) Pr=0.0000 (Children at schools with a lower proportion of black learners travel further to school) Pr=0.0000 (Children at schools with a lower proportion of black learners travel further to school) Pr=0.0000 (Children at schools with a lower proportion of black learners travel further to school) Movement between areas SAL: Pr=0.0433 (Children at schools with a lower proportion of black children more likely to attend a school in their home SAL. SAL: n.s. SAL: Pr=0.0133 (Children at schools with a lower proportion of black children more likely to attend a school in their home SAL. SAL: n.s. Pr=0.0000 (Children at schools with a higher proportion of black learners are more likely to attend school in their home SP) Pr=0.0000 (Children at schools with a higher proportion of black learners are more likely to attend school in their home SP) Pr=0.0000 (Children at schools with a higher proportion of black learners are more likely to attend school in their home SP) Pr=0.0031 (Children at schools with a higher proportion of black learners are more likely to attend school in their home SP) Pr=0.0000 (Children at schools with a Pr=0.0000 (Children at schools with a Pr=0.0000 (Children at schools with a Pr=0.0000 (Children at schools with a 244 higher proportion of black learners are more likely to attend school in their home MP) higher proportion of black learners are more likely to attend school in their home MP) higher proportion of black learners are more likely to attend school in their home MP) higher proportion of black learners are more likely to attend school in their home MP) Nearest school n.s. n.s. n.s. n.s. Table 7.26: Relationship between proportion of black students at attended school and learner mobility 7.4.6 School fees Table 7.27 below provides evidence for a significant positive relationship between the fees charged by a child‘s school, and the mobility of that child. One finding, that children attending school inside their home SAL are likely to attend schools with lower fees, slightly inconsistent with the finding that these same children are likely to attend a school with a lower proportion of black learners. One plausible explanation is the nature of the distribution of the values of school fees, with only a very few very high values present. School fees 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Distance travelled Pr=0.0000 (Children at schools with higher fees travel further) Pr=0.0000 (Children at schools with higher fees travel further) Pr=0.0000 (Children at schools with higher fees travel further) Pr=0.0000 (Children at schools with higher fees travel further) Movement between areas SAL: Pr=0.0917 (Children at schools with lower fees are more likely to attend school in their home SAL) SAL: n.s. SAL: n.s. SAL: n.s. SP: Pr=0.0000 (Children at schools with SP: Pr=0.0000 (Children at schools with SP: Pr=0.0000 (Children at schools with SP: Pr=0.0000 (Children at schools with 245 lower fees are more likely to attend school in their home SP) lower fees are more likely to attend school in their home SP) lower fees are more likely to attend school in their home SP) lower fees are more likely to attend school in their home SP) MP: Pr=0.0000 (Children at schools with lower fees are more likely to attend school in their home MP) MP: Pr=0.0000 (Children at schools with lower fees are more likely to attend school in their home MP) MP: Pr=0.0000 (Children at schools with lower fees are more likely to attend school in their home MP) MP: Pr=0.0000 (Children at schools with lower fees are more likely to attend school in their home MP) Nearest school Pr=0.0192 (Children at schools with lower fees are more likely to attend their nearest school) Pr=0.0002 (Children at schools with lower fees are more likely to attend their nearest school) Pr=0.0004 (Children at schools with lower fees are more likely to attend their nearest school) Pr=0.0130 (Children at schools with lower fees are more likely to attend their nearest school) Table 7.27: Relationship between fees of attended school and learner mobility 7.4.7 Historical DET Table 7.28, below, presents evidence for a significant negative relationship between the historical DET status of the school a child attends, and that child‘s mobility. However, this relationship does not hold when mobility is defined at the SAL level, or by attendance at the nearest school. As discussed with reference to the findings on the proportion of black students enrolled at a school, this is likely to relate to the fact that many of the children engaging in these two forms of mobility are still attending schools that are no less disadvantaged than the school nearest to their home, or within their home SAL. 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Distance travelled Pr=0.0000 (Children at former DET schools travel Pr=0.0000 (Children at former DET schools travel Pr=0.0000 (Children at former DET schools travel Pr=0.0000 (Children at former DET schools travel 246 less far) less far) less far) less far) Movement between areas SAL: n.s. SAL: n.s. SAL: n.s. SAL: n.s. SP: Pr=0.000 (Children at former DET schools are more likely to attend a school in their home SP) SP: Pr=0.000 (Children at former DET schools are more likely to attend a school in their home SP) SP: Pr=0.000 (Children at former DET schools are more likely to attend a school in their home SP) SP: Pr=0.007 (Children at former DET schools are more likely to attend a school in their home SP) MP: Pr=0.000 (Children at former DET schools are more likely to attend a school in their home MP) MP: Pr=0.000 (Children at former DET schools are more likely to attend a school in their home MP) MP: Pr=0.000 (Children at former DET schools are more likely to attend a school in their home MP) MP: Pr=0.000 (Children at former DET schools are more likely to attend a school in their home MP) Nearest school n.s. n.s. n.s. n.s. Table 7.28: Relationship between historical DET status of attended school and learner mobility 7.4.8 Matric pass rate There is strong evidence for a positive relationship between mobility and school performance, as documented in Table 7.29 below. 1997 2003 (all data) 2003 (primary schools only) 2003 (secondary schools only) Distance travelled Pr=0.0000 (Children at schools with higher pass rates travel further) Pr=0.0000 (Children at schools with higher pass rates travel further) Pr=0.0000 (Children at schools with higher pass rates travel further) Pr=0.0000 (Children at schools with higher pass rates travel further) Movement between areas SAL: Pr=0.0033 (Children at schools with higher pass rates are less likely to attend a school in SAL: Pr=0.0216 (Children at schools with higher pass rates are less likely to attend a school in SAL: n.s. SAL: Pr=0.0046 (Children at schools with higher pass rates are less likely to attend a school in 247 their home SAL) their home SAL) their home SAL) SP: Pr=0.0000 (Children at schools with higher pass rates are less likely to attend a school in their home SP) SP: Pr=0.0000 (Children at schools with higher pass rates are less likely to attend a school in their home SP) SP: Pr=0.0000 (Children at schools with higher pass rates are less likely to attend a school in their home SP) SP: Pr=0.0000 (Children at schools with higher pass rates are less likely to attend a school in their home SP) MP: Pr=0.0010 (Children at schools with higher pass rates are less likely to attend a school in their home MP) MP: Pr=0.0000 (Children at schools with higher pass rates are less likely to attend a school in their home MP) MP: Pr=0.0048 (Children at schools with higher pass rates are less likely to attend a school in their home MP) MP: Pr=0.0000 (Children at schools with higher pass rates are less likely to attend a school in their home MP) Nearest school Pr=0.0047 (Children at schools with higher pass rates are less likely to be attending their nearest school) Pr=0.0004 (Children at schools with higher pass rates are less likely to be attending their nearest school) Pr=0.0811 (Children at schools with higher pass rates are less likely to be attending their nearest school) Pr=0.0002 (Children at schools with higher pass rates are less likely to be attending their nearest school) Table 7.29: Relationship between quintile of attended school and learner mobility 7.4.9 Relationships between school attributes and mobility behaviour: Discussion Overall, the results presented in this section provide support for the hypothesis that learner mobility is related to the pursuit of higher quality education. Using a number of different school variables associated with educational quality, and a number of different approaches to the measurement of learner mobility, the analyses described above found positive, largely consistent and highly significant relationships between engagement in learner mobility, and the quality of the school a child attends. The one definition of mobility which did produce inconsistent results was that based on whether or not a child attended 248 school in the same SAL in which he or she lived. One possible explanation is simply that SAL mobility is not a good measure of learner mobility, as it does not differentiate sufficiently between children. This is supported by the very small numbers of children who do attend school in the SAL in which they live. 7.5 Conclusion This chapter began by contrasting those schools closest to the homes of study sample members with the schools actually attended by study sample members. This provided clear evidence that, on average, the schools that children attend are somewhat more advantaged than the schools that are closest to their homes. This suggests that children and families are actively engaging in school choice and mobility to pursue higher quality educational opportunities than would otherwise be accessible to them. It does, however, also raise questions around who is attending the most poorly performing of schools. One possibility is that these are predominantly filled by in-migrants from other areas. Alternatively, these schools may simply be extremely under-enrolled. Secondly, the chapter explored the distribution of children with different individual, household and community attributes across schools. This provided strong evidence that more advantaged children typically attend more advantaged schools. This suggests that even though parents and children are able to use learner mobility as a tool to access higher quality educational opportunities, even these enhanced educational opportunities remain strongly related to the affluence of a family. This ties in well with the hypothesis that two different forms of mobility exist in Johannesburg-Soweto, and that these require the investment of different levels of resources, but also providing access to different levels of schooling quality. The more resource intensive form of mobility requires substantial investment in travel, school fees and associated costs, but allows children to access historically advantaged schools. Given the resource requirements associated with this form of mobility, only 249 children from relatively advantaged families are able to engage in it. The second form of mobility, which requires less in the way of resources and is therefore open to a wider group of children, is the one in which choices are made between a number of fairly local schools. While this may still mean that children attend a better school than the school closest to their home, it does not give them access to the most advantaged schools of all. Finally, in testing the relationships between learner mobility and the attributes of the school a child attends, the chapter has also provided evidence that learner mobility is associated with enrolment at those schools expected to provide higher educational quality. This again suggests that children and families are using mobility, in at least two different forms, to gain access to higher quality educational opportunities than would otherwise be accessible to them. 250 Chapter 8: Changes in educational mobility over time 8.1 Introduction This chapter documents changes in the educational mobility of children over time. It begins by looking at those sample members who have changed schools between 1997 and 2003. School change is a prerequisite to mobility change in this sample of children with constant residential addresses. Correlates of school change at the individual, household, community and school levels are documented. Tests are also conducted to determine whether children who change schools behave differently with regards to mobility than those who do not change schools. Secondly, the chapter explores the nature of the changes in mobility resulting from school change. These results are then disaggregated by whether the school change is a change between two primary schools, or a change from a primary to a high school. The implications of school change and changes in mobility over time are then discussed. 8.2 Changing schools Given that residential addresses for the study sample are by definition constant over the period under examination, if children are enrolled in the same school in both 1997 and 2003, their mobility will also be constant. When children move between schools, however, their mobility will change. This means that school change provides a window onto mobility change. Of the 1210 children for whom full schooling information is available in both 1997 and 2003, 373 (30.93%) were attending the same school at both points in time, whereas 833 (69.07%) were attending different schools (see Table 8.1 below). Of course, a number of these school changes relate to the 433 children 251 who have moved from primary school to high school by 2003. When only children still in primary school by 2003 are examined, 357 (45.5%) are enrolled in the same school at both points in time, while 427 (54.39%) have changed schools. While this is a lower level of change than that found in the full study sample, it indicates that the majority of children who did not move between schooling phases still changed schools at least once between 1997 and 2003. Remained in same school from 1997-2003 Changed schools between 1997 and 2003 All children with full schooling data (n=1210) 373 (30.93%) 833 (69.07%) Children in primary school in 2003 (n=781) 357 (45.71%) 424 (54.39%) Children in high school in 2003 (n=415) 14 (3.37%) 401 (96.63%) Table 8.1: Proportion of sample members remaining in the same school from 1997 to 2003, and proportion changing schools (note that information on schooling phase is only available for 1196 individuals) 8.2.1 Correlates of school change during primary schooling Individual, household and community correlates A range of tests was conducted to determine whether, amongst the group of children still enrolled in primary school in 2003, those who had changed schools at least once differed systematically from those who had never changed schools. A chi-square test indicates that black children were significantly more likely to change schools than coloured children (χ2(1)= 17.9177, Pr=0.000). There was, however, no evidence for a relationship between school changing and gender, age at first enrolment, or grade repetition. Maternal education was also significantly related to school change (χ2(4)=13.6864, Pr=0.008), with children of mothers with some secondary schooling the most likely to change schools. There was no evidence for a 252 relationship between school change and maternal marital status, or, more surprisingly, household SES in either 1997 or 2003, or change in household SES between 1997 and 2003. There was, however, strong evidence for a relationship between school change and the poverty of the area in which a child lives, whether measured as the SAL (χ2(4)=25.7324, Pr=0.000), SP (χ 2 (4)=29.4967, Pr=0.000) or MP (χ2(2)=18.0766, Pr=0.000) level. At the SAL level, children living in poorer areas more likely to change schools. At the SP and MP levels, by contrast, while children in poor areas are still more likely to change schools than children in affluent areas, it is the children living in mid-level areas who are most likely to change schools. Overall, contrary to expectations, the data does not suggest that disadvantaged children are any more likely to change schools during primary schooling than their more advantaged peers. In fact, children with more poorly educated mothers appear to be somewhat protected against school change during this period. School level correlates With the exception of the few children enrolled in quintile 1 schools (the least advantaged schools), who are very unlikely to change schools during primary schooling, there is a negative relationship between school quintile and school change, with school change decreasing as quintile increases, in both 1997 (χ2(4)=18.3413, Pr=0.001) and 2003 (χ 2 (4)=8.5163, Pr=0.074). There is no evidence for a relationship between school change and Section 21 status in 1997, but a weakly significant relationship is found in 2003 (χ2(1)=3.1192, Pr=0.077). Wilcoxon rank-sum tests indicate that children attending smaller schools in 1997 (Pr=0.0000) or in 2003 (Pr=0.0043) are more likely to change schools. 253 Children attending schools with a higher proportion of black learners in 1997 (Pr=0.0000) or 2003 (Pr=0.0000) are also more likely to change schools. There is also a significant positive relationship between school change and the school fees in 1997 (Pr=0.0002) and 2003 (Pr=0.0000). Children attending former DET schools in either 1997 (χ2(1)=42.5454, Pr=0.000) or 2003 (χ 2 (1)=19.6727, Pr=0.000) were also significantly more likely to change schools. Finally, children attending schools with lower imputed pass rates in 1997 (Pr=0.0713) or 2003 (Pr=0.0185) were also more likely to change schools. Interestingly, all of the school properties associated with increased levels of school change are also typically associated with schools located in township areas. It appears, therefore, that a large proportion of school change occurring during the primary school years is between primary schools located in the townships. As relationships in 2003 are typically weaker than in 1997, this suggests that to the extent that children are changing between schools in different areas, they are tending to leave township schools in favour of schools in other, presumably more advantaged areas. Mobility related correlates Mobility in 1997 Amongst children who remained in primary school throughout the study period, school-changers travelled average marginally further than non- changers in 1997, although this difference was not statistically significant. There was no evidence for a relationship between school change and whether he or she attended a school in the same SAL, SP or MP as his or her home in 1997. Finally, there was also no evidence for a relationship between school change and whether a child attended his or her nearest school in 1997, regardless of whether private schools were included in the analysis. Overall, therefore, data about a child‘s mobility in 1997 is unlikely to serve as a useful predictor of whether he or she is likely to change primary schools between 1997 and 2003. 254 Mobility in 2003 A Wilcoxon rank-sum test indicated that there was no significant relationship between school change and distance from home to school in 2003. Children attending a school outside of their home SAL in 2003 were, however, significantly more likely to have changed schools (χ2(1)=14.5194, Pr=0.000). This relationship is inverted at the SP level (χ2(1)=3.714, Pr=0.054), with children attending a school in their home SP more likely to have changed schools. There is no evidence for a relationship between children attending schools in their home MP and school change. Finally, children who change schools are significantly less likely to be attending their nearest school in 2003, regardless of whether only public schools (χ2(1)=8.0904, Pr=0.004), or both public and private schools (χ2(1)=10.0275, Pr=0.002) are considered. Overall, while a child‘s mobility in 2003 is a better predictor of school change than mobility in 1997, there is still not much evidence for a relationship between school change and mobility. The strongest finding is that children who attended their nearest school in 2003 were less likely to change schools, which is consistent with earlier findings that the children most likely to attend their nearest schools were those living in the affluent areas, and who would therefore be unlikely to have much incentive to change schools. Discussion Black children, and children with mothers with some secondary schooling appear to be the most likely to change school during the primary schooling period. Children living in areas with high or intermediate poverty levels are also more likely to change schools than their peers living in more affluent areas. While children changing schools seem to be more likely to attend township schools, there is little evidence to suggest that their patterns of mobility in either 1997 or 2003 differ from children who do not change 255 schools. Overall, change between primary schools does not seem to be strongly related to mobility. 8.2.2 Correlates of school change associated with the transition to high school The transition to high school typically requires that a child change schools. The correlates of school change associated with the transition to high school will therefore simply echo the correlates of high school status itself. Furthermore, as school change and transition to high school are not independent events for this group of children, but are different aspects of the same phenomenon, it does not make sense to try to determine the role of each, independently, in shaping mobility. Rather, it is appropriate to see both the transition to grade 8 and the school change that typically accompanies it as a single event. The data detailing the correlates of reaching high school by 2003, presented in Appendix C, Chapter 5 and Chapter 7, is briefly reviewed in the following sections. Individual, household and community correlates As documented in Appendix C, children who have reached high school by 2003 differ systematically from children still in primary school at this point. They are more likely to be coloured, to be girls, and to have started school early, and less likely to have ever repeated a grade. Their mothers are likely to be more highly educated, with an attainment of at least Grade 11, and they are more likely to have lived in comparatively advantaged homes in 1997. There is, however, no evidence that they differ from their peers who are still in primary school in 2003 with regards to the poverty levels of the areas in which they live. 256 School level correlates As documented in Chapter 7, secondary schools attended by study sample members differ systematically from the primary schools attended. Secondary schools are more likely to be in quintile 4, and less likely to be in quintile 3 or quintile 5 than those attended at the primary level. They also tend to have a substantially larger number of learners enrolled, and a slightly lower proportion of black learners. Fees are higher, while the schools are slightly less likely to be former DET schools, and have slightly higher pass rates. Mobility related correlates Mobility in 1997 As detailed in Chapter 5, there is no evidence that group of children who have reached high school in 2003 differ from their peers with regards to the distance from home to school in 1997, or their likelihood of attending the school closest to their home in 1997. Those children who transition to high school by 2003 were, however, more likely to attend a school in the same SAL or SP as their home in 1997. At the MP and MN levels, however, no differences are evident. Mobility in 2003 Children at the high school level in 2003 do, however, live significantly further from their schools than their peers still at primary school level do. However, there is no evidence that patterns of travel between different areas, as defined by census geography, change with schooling phase. Children at the high school level, however, are more likely to be attending their nearest school. These changes are thought to be related to the different density distributions of primary and high schools in Johannesburg-Soweto. Discussion As is clear from the data reviewed above, children who have reached high school by 2003 differ from their peers who have not, with regards to a number of variables associated with mobility, as well as differing with regards to 257 certain aspects of mobility itself. School changes associated with the transition to high school seem to be more strongly related to changes in mobility than school changes associated with movement between two primary schools. However, this appears to be due primarily to the different distributions of primary and high schools in urban Gauteng province. 8.3 The nature of changes in mobility 8.3.1 Straight-line distance Of the 1210 children with full schooling data for 1997 and 2003, 1177 also have full residential data, allowing the changes in their mobility over time to be calculated. These children are distributed across primary and high schools as illustrated in Table 8.2 below. Just over two thirds of them experience a change in distance from home to school between 1997 and 2003. No change in mobility between 1997 and 2003 Change in mobility between 1997 and 2003 All children with full schooling and residential data (n=1177) n=368 (31.27%) n=809 (68.73%) Children in primary school in 2003 (n=763) n=353 (46.26%) n=410 (53.74%) Children in high school in 2003 (n=404) n=13 (3.21%) n=391 (96.78%) Table 8.2: Distribution of sample members with full residential and schooling data for both 1997 and 2003 across schooling phases, and stability of mobility behaviour There is a small overall decrease in distance from home to school between 1997 and 2003 (see Table 8.3 below). This, however, obscures a very broad distribution of changes in distance, and the 31% of children who experience no change in distance at all. When only those children who have experienced a change in mobility are examined (n=807), the mean decrease in travel distance becomes larger, and the distribution as a whole becomes more spread out. The 258 distribution of distance change for both the full sample and for school-changers only is illustrated in Figures 8.1 and 8.2 below. It is evident from these figures that although changes in distance range from extreme values at each end (both very large increases and decreases), the majority of the sample is concentrated around very moderate levels of change, even once all zero values have been removed. Mean change in distance from 1997 to 2003 Standard Deviation 25% Median 75% Full Sample (n=1177) -0.167km 9.723km -0.178km 0.000km 0.530km School changers only (n=807) -0.244km 11.743km -0.754km 0.121km 1.111km Children moving between primary and high school only (n=404) 0.543km 12.439km -0.488km 0.137km 1.420km Children moving between primary and high school and changing schools only (n=390) 0.560km 12.660km -0.529km 0.211km 1.519km All primary school children (n=763) -0.568km 7.951km 0.000km 0.000km 0.181km Children moving between different primary schools (n=409) -1.060km 10.841km -1.314km 0.070km 0.795km Table 8.3: Changes in distance from home to school between 1997 for all sample members, disaggregated by schooling phase and school change status 259 Figure 8.1: Kernel density plot of the change in distance from home to school between 1997 and 2003 for all sample members with full mobility information Figure 8.2: Kernel density plot of the change in distance from home to school between 1997 and 2003 for all sample members who changed schools 260 When change in distance is disaggregated by whether it is due to transition from primary to high school, or to movement between two primary schools, both distributions remain centred around zero, but otherwise become quite distinct (see Table 8.3 above). The mean change in distance for children moving between primary schools remains negative, but the mean change in distance for children moving to a high school becomes positive, at just over half a kilometre. A Wilcoxon rank-sum test indicates that children moving between primary schools, and those moving from primary to high schools, represent different populations in terms of the distribution of change in distance from home to school (Pr=0.0211). It is evident however, that both of these distributions, and particularly their means, are quite heavily influenced by particularly large values – as can be seen from the values of the 25th and 75th percentiles. For this reason, it is also useful to examine counts of children travelling higher and lower distances in 2003 than in 1997 (Table 8.4 below). These counts indicate that in both the group of children moving between primary schools, and those transitioning to high school, roughly the same proportions are travelling further in 2003. Similarly, the proportions of children travelling less far in the two groups are also much the same. A chi-square test confirms that there is no evidence that the distribution of children travelling both further and less far differs for these two groups. Given that the Wilcoxon test referred to earlier found a significant difference between these groups with regards to change in distance experienced, but the proportion of children increasing and decreasing their travel distance in both groups is the same, the difference between these two groups must be found in the extent of the increase or decrease in distance travelled. As evident in Figure 8.3 below, amongst children who travel further in 2003, the children moving between primary schools are far more concentrated around very small increases than the children moving from primary to high school. By contrast, 261 when looking at those children travelling less far in 2003, this time it is the children moving from primary to high school that are far more concentrated around the very low decreases (see Figure 8.4 below). Wilcoxon rank-sum tests confirm that for both the group travelling further (Pr=0.0146), and those travelling less far (Pr=0.0373), the distributions of the change in distance are significantly different. Overall, this evidence suggests that although both groups of children are approximately equally likely to travel either further or less far, the average decrease experienced by primary school children is larger. By contrast, the average increase in distance experience by high school children is larger. Mean change in distance from 1997 to 2003 Number (%) travelling further in 2003 Number (%) travelling less far in 2003 School changers only (n=807) (1 child changed between two schools located in the same place) -0.244km 448 (55.51%) 358 (44.36%) Children moving between different primary schools (n=409) -1.060km 221 (54.03%) 187 (45.72%) Children moving between primary and high school only (n=390) 0.560km 223 (56.92%) 168 (43.08%) Table 8.4: Counts of sample members who live closer to or further from their school in 2003 as compared to 1997, disaggregated by phase of schooling 262 Figure 8.3: Distribution of change in distance from home to school for children travelling less far in 2003, by schooling phase Figure 8.4: Distribution of change in distance from home to school for children travelling further in 2003, by schooling phase 263 Categories of distance Another way to think about changes in mobility over time is to group children according to their distance between home and school at each point in time, and explore the numbers who remain in the same category, and those who change. Particularly salient categories are those travelling under 2.5km – that is, those who are able to walk to school; those travelling 2.5 to 5km – these can be thought of as children attending ‗local schools‘; and those travelling over 5km – that is, those attending non-local schools. Table 8.5, below, shows that the proportion of children found in the various distance categories is fairly consistent over time. Group Number (%) travelling less than 2.5km Number (%) travelling 2.5- 5km Number (%) travelling over 5km Time point 1997 2003 Both 1997 2003 Both 1997 2003 Both Full Sample 1997: n=1214 2003: n=1281 Both: n=1177 796 (65.57%) 823 (64.25%) 655 (55.65%) 83 (6.84%) 87 (6.79%) 40 (3.40%) 335 (27.59%) 371 (28.96%) 244 (20.73%) School changers only 1997: n=810 2003: n=812 Both: n=807 539 (66.54%) 529 (65.15%) 424 (53.54%) 51 (6.30%) 50 (6.16%) 12 (1.49%) 220 (27.16%) 233 (28.69%) 133 (16.48%) Children moving between two primary schools 1997: n=410 2003: n=413 both: n=409 263 (64.15%) 272 (65.86%) 207 (50.61%) 29 (7.07%) 25 (6.05%) 6 (1.47%) 118 (28.78%) 116 (28.09%) 68 (16.63%) Children moving between primary and high school 1997: n=392 2003: n=391 both: n=390 269 (68.62%) 250 (63.94%) 210 (53.85%) 21 (5.36%) 25 (6.39%) 6 (1.54%) 102 (26.02%) 116 (29.67%) 65 (16.67%) Table 8.5: Children attending schools in the same categories of distance from their homes in 1997 and 2003, disaggregated by schooling phase 264 However, there is less consistency over the actual individuals who are found in each category over time. Chi-square tests conducted on all children who changed schools (χ2(4)= 185.9723, Pr=0.000), those moving between different primary schools (χ2(4)=83.7371, Pr=0.000), and those moving between primary and high schools (χ2(4)=103.0342, Pr = 0.000), all found highly significant differences in distribution across these mobility categories between the two points in time. Amongst children who changed schools between 1997 and 2003, of the 644 children who were travelling less than 2.5km at any point in time, 424, or 65.84%, were travelling less than 2.5km at both points in time. Of the 89 children travelling between 2.5 and 5km at either point in time, 12, or 13.48%, were found in this distance category at both points in time. Finally, of the 320 children travelling over 5km at either point in time, 133, or 41.56%, fell into this category for both points in time. This data suggests that while the overall proportion of children found in the different distance categories is fairly constant over time, a fairly high proportion of individuals are actually moving between categories. There is no evidence of a significant difference in movement between distance categories for those children moving between primary schools, and those transitioning to high school. This is particularly interesting, as a proportion of children moving to high school are expected to be doing so because they have completed their primary schooling. By contrast, change during primary schooling is more likely to be related to dissatisfaction with current schooling arrangements, providing more of a reason for a child to change the distance he or she travels. Nonetheless, both groups seem to be equally likely to move between different categories. Overall, just under 56% of the sample falls into the 0-2.5km at both points in time. This means that 44% of the children in the sample are attending a school over 2.5km away from their home during some part of their schooling. 20% of 265 the sample attends schools over 5km away from their home at both points in time, meaning that a full fifth of the sample is consistently educated over 5kms away from their home. Clearly the assumption that children attend schools within walking distance of their homes does not hold for a fairly sizeable proportion of the current sample at various points in their schooling, and fairly consistently for almost 25% of the sample who never attend a school within a 2.5km radius of their homes. 8.3.2 Census geography This section explores changes in mobility over time with regards to whether a child‘s school falls into the same area as his or her home. Table 8.6, below, illustrates the numbers of children experiencing mobility changes between 1997 and 2003, when this approach is used. No. (%) attending schools in different SALs in 1997 & 2003 No. (%) attending schools in different SPs in 1997 & 2003 No. (%) attending schools in different MPs in 1997 & 2003 Full Sample with school location data (n=1206) 782 (64.84%) 561 (46.52%) 244 (20.23%) School changers only (n=831) 779 (93.74%) 560 (67.39%) 244 (29.36%) Children moving between two primary schools (n=423) 398 (94.09%) 282 (66.67%) 128 (30.26%) Children moving between primary and high school (n=400) 373 (93.25%) 275 (68.75%) 116 (29.00%) Table 8.6: Numbers of children moving between schools in different geographical areas, disaggregated by schooling phase What is particularly striking about Table 8.6 is that the proportion of children experiencing changes in the area in which they attend school again appears to be very similar for children moving between primary schools, and those transitioning to secondary schools. Chi-square tests find no significant 266 difference in the distribution of mobility at the SAL, SP or MP level on the basis of whether a child is moving between primary schools or to a high school. Exploring the changes in distances travelled for each of these groups of mobile children, however, reveals interesting results (see Table 8.7 below). Primary school children who are moving between schools in two different geographical areas appear to be doing so primarily by decreasing their distance travelled, while high school children appear to be doing so by increasing their distance travelled. This is particularly significant for the SP level. It may be that any relationship at the MP level is being partially obscured by the relatively small numbers moving between different MPs, as well as the typically larger distance that needs to be travelled to bring someone across an MP boundary. Mean change in distance Median change in distance No. (%) travelling further in 2003 χ2 for children travelling further Children changing from school in one SP to another High school (n=267) 0.830km 0.584km 163 (61.05%) χ2(1)=7.2998 Pr = 0.007 Primary schools (n=269) -1.683km -0.716km 133 (49.44%) Children changing from school in one MP to another High school (n=110) 0.820km 5.329km 66 (60.00%) χ2(1)=3.1536 Pr = 0.076 Primary schools (n=122) -2.814km -1.096km 59 (48.36%) Table 8.7: Changes in distance from home to school experienced by children moving between schools in different geographic areas, disaggregated by schooling phase (Note: Sample size is not the same as in Table 8.6 as only children for whom full schooling and residential data is available are included in Table 8.7) The final question explored in this section is whether there is change over time with regards to the proportions of children attending school in the same area in which they live, and whether these are the same children in 1997 and 2003. 267 Table 8.8, below, shows the numbers of children attending schools in the same area in which they live. At the SAL level, very few children attend school in the same area as their home at either point in time, and the proportion of children who change schools but continue to attend school in the same SAL in which they live is extremely small. Given the small size of the SALs, this is unsurprising. At the SP level, a higher proportion of children, roughly a quarter of the sample, are found attending school in the same SP in which they live at both points of time, even when only those children who have changed schools are considered. Finally, at the MP level, over half the sample is attending school in the MP in which they live in both 1997 and 2003. Chi-square tests indicate that the differences in the distributions of sample members attending school in their home SP or MP is highly significant, regardless of the group considered. Results at the SAL level are less clear, which is probably related to the extremely small numbers of children attending school in their home SAL. Number (%) attending school in same SAL as home Number (%) attending school in same SP as home Number (%) attending school in same MP as home Time point 1997 2003 Both 1997 2003 Both 1997 2003 Both Full Sample (n=1180) 80 (6.78%) 47 (3.98%) 33 (2.80 %) 479 (40.59%) 437 (37.03%) 335 (28.39%) 851 (72.12%) 829 (70.25%) 748 (63.39%) School changers only (n=810) 56 (6.91%) 23 (2.84%) 9 (1.11 %) 350 (43.21%) 309 (38.15%) 207 (25.56%) 585 (72.22%) 563 (69.51%) 482 (59.51%) Children moving between primary schools (n=411) 20 (4.87%) 5 (1.22%) 1 (0.24 %) 165 (40.15%) 168 (40.88%) 105 (25.55%) 286 (65.59%) 289 (70.32%) 239 (58.15%) 268 Children moving between primary and high school (n=391) 36 (9.21%) 18 (4.60%) 8 (2.05 %) 180 (46.04%) 137 (35.04%) 98 (25.06%) 292 (74.68%) 267 (68.29%) 236 (60.36%) Table 8.8: Children attending school in the same area as their home, for three different levels of geography, disaggregated by schooling phase In line with earlier findings, there is again little evidence for an impact of schooling phase on the proportions of children attending schools within the area in which their home is located. However, there is evidence that the members of these groups change quite substantially over time, particularly at smaller levels of geography. The finding that only about a quarter of the sample is educated within the SP in which they live at both points in time is particularly clear evidence for the limited proportion of children who are consistently educated at local schools. Similarly, less than two thirds of children attend schools within their home MP at both points in time. 8.3.3 Nearest school A final approach to understanding changes in mobility is to look at the proportion of children attending their nearest school in 1997 and 2003. Table 8.9, below, shows the numbers of sample members attending their nearest public school in 1997, in 2003, and at both points in time. Although results are not presented below, the same tests were conducted including private schools, and in all instances produced extremely similar results. For all time points, as has been discussed previously, the numbers of children attending their nearest public school are extremely low, and the numbers who attended their nearest schools at both points in time are lower still. 269 No. (%) attending nearest school in 1997 No. (%) attending nearest school in 2003 No. (%) attending nearest school in 1997 & 2003 Chi-squared test for similarity in 1997 and 2003 Full Sample (n=1178) 211 (17.91%) 214 (18.17%) 100 (8.49%) χ2(1)= 147.6961 Pr=0.000 School changers only (n=808) 136 (16.83%) 141 (17.45%) 28 (3.47%) χ2(1)= 1.1176 Pr=0.290 Children moving between primary schools (n=414) 67 (16.18%) 51 (12.32%) 0 (0.00%) χ2(1)=11.2308 Pr=0.001 Children moving between primary and high school (n=394) 69 (17.51%) 90 (22.84%) 28 (7.11%) χ2(1)= 14.9315 Pr=0.000 Table 8.9: Children attending their nearest grade-appropriate public school in 1997, 2003 and at both points, disaggregated by schooling phase When only children moving between two primary schools are considered, there is a highly significant decrease in the proportion of children attending their nearest school between 1997 and 2003. By contrast, amongst children transitioning to secondary school, there is a significant increase in the proportion attending their nearest school. There are several potential explanations for this pattern. Firstly, obviously, if a child changes schools, but not home location or phase of schooling, he or she cannot possibly attend the nearest school at both points in time. By contrast, if the child changes schooling phase, he or she is able to attend the nearest school at both points in time. This is one reason to expect a higher proportion of children attending their nearest school to be found amongst those transitioning to high school. Secondly, as mentioned previously, high schools tend to be larger, and more widely spaced, than primary schools. As a result, a child choosing between high schools has a smaller number of options available to him or her. The chances of attending the nearest school are therefore higher. The different sizes and distributions of primary and high schools also explains why the finding that children transitioning to high school are simultaneously 270 more likely to experience an increase in travel distance and also increased likelihood of attending their nearest school, is not inconsistent. Of the 354 children in the full sample who attended their nearest school at either point in time, 100, or 28.25%, attended their nearest school at both time points. Of the 250 school changers who attended their nearest school at either point in time, 28, or 11.20%, attended their nearest school at both points in time. Finally, of the 131 children transitioning to high school who attended their nearest school at either point in time, 28, or 21.37%, attended their nearest school at both points in time. These figures echo the findings presented earlier in this chapter that children who change school, whether at the primary or high school level, are less likely to be attending their nearest school in 2003 than children who do not change school. 8.4 Correlates of type of mobility change for primary school school-changers The previous section has documented the differences between mobility changes related to the transition to high school, and those related to children moving between two primary schools. This section now focuses on those children changing between primary schools, and explores the correlates associated with different patterns of mobility change. Results presented below are based on the 409 sample members known to have changed primary schools between 1997 and 2003. Of course, given that we only have primary school change data for the non-random sub-group of the study sample who have not progressed to high school by 2003, it is not appropriate to assume that findings will be representative of all children who move between schools during their primary schooling. However, findings will provide a preliminary idea of correlates of changing mobility, which will assist with hypothesis and theory development. 271 8.4.1 Straight-line distance Individual level variables There was no evidence for any relationship between race or gender and the nature of mobility change associated with primary school change. Children with a later first school enrolment were more likely to decrease distance from home to school when changing primary schools (Wilcoxon rank-sum test; Pr=0.0089). Children who have never experienced a grade repetition were also more likely to decrease distance from home to school (Wilcoxon rank-sum test; Pr=0.0092). Household level variables Children with mothers with very little education (functionally illiterate), and children whose mothers have completed grade 10 or higher are significantly more likely to experience an increase in distance from home to school than children whose mothers have intermediate levels of education (χ2(1)= 8.4919, Pr=0.075). Children whose mothers have completed grade 10 or higher also experience a mean increase in distance, compared to a mean decrease in distance for all other groups. There was no evidence for any difference in mobility change on the basis of maternal marital status, SES in either 1997 or 2003, or change in SES between these two years. Community level variables Children living in the wealthiest and poorest SAL areas were most likely to experience a substantial decrease in distance travelled (Wilcoxon rank-sum test; Pr=0.0577). By contrast, children in areas with moderate poverty levels were more likely to experience a small increase in travel. The same pattern, although with a lower significance level (Pr=0.0752), was found at the SP level. There was, however, no evidence for a relationship between MP area poverty and change in distance from home to school. 272 8.4.2 Census geography Two ways of measuring change in mobility through the use of census data were presented earlier in this chapter. The first was to explore whether a child moved between schools in two different levels of geography between 1997 and 2003. The second was to explore whether a child moved either into, or out of, a school in the same geographic area in which he or she lived. In the first part of this section, children who moved between primary schools in different geographic areas will be compared to primary school changers who did not move between areas. In the second part of this section, children who moved into a school in the same area as their home will be compared to those children who moved out of a school in the same area as their home. Children moving between schools in different areas SAL Of the 423 children still in primary school in 2003 who changed school between 1997 and 2003, only 25 moved between two schools in the same SAL, while the remaining 398 moved between schools in different SALs. There was no evidence that these two groups of children differed in any way with regards to race, gender, age at first enrolment in school, and whether or not they had ever experienced grade repetition. There was also no evidence for any difference with regards to any of the household level variables explored, namely maternal education, maternal marital status, household SES in 1997, household SES in 2003, or change in household SES between 1997 and 2003. Finally, although there is no evidence that the groups differ with respect to SAL poverty, there is evidence that they differ in both SP area poverty (χ2(4)= 10.8582, Pr=0.021) and MP area poverty (χ2(2)=5.2592, Pr=0.057). Those children who move between schools in different SALs tend to live in somewhat more affluent SP and MP areas. 273 SP Of the 423 school-changers still in primary school in 2003, 282 (66.67%) moved between schools in different SPs, while 141 (33.33%) moved between two schools in the same SP. As was the case at the SAL level, there is no evidence that children moving between schools in the same SP differed significantly from those moving between schools in different SPs with regards to any of the individual level variables considered (race, gender, age at first enrolment, and grade repetition). Children moving between schools in two different SPs did, however, have more highly educated mothers (χ2(4)=10.6102, Pr=0.031), although there was no significant difference with respect to maternal marital status. Those children who moved between schools in different SPs were likely to have a higher SES in both 1997 (χ2(4)=20.2271, Pr=0.000) and 2003 (χ2(4)=13.7227, Pr=0.008). There was no evidence that the two groups of children differed with respect to the poverty levels of the SALs or SPs in which they lived. Children moving between schools in different SP areas were, however, more likely to live in an extremely advantaged or disadvantaged MP (χ2(2)=14.5783, Pr=0.001). In summary, movement between schools in different SPs was associated with higher maternal education and higher household SES, and with living in either a particularly advantaged, or disadvantaged MP. MP 128 (30.26%) of the school changers still in primary school in 2003 moved between schools in two different MPs, while the remaining 295 (69.74%) moved between two schools in the same MP. There was no evidence that these two groups of children differed with respect to race, gender, age at first enrolment, grade repetition, maternal education, or maternal marital status. They did, however, differ significantly with respect to household SES in 1997 (χ2(4)=13.2473, Pr=0.010) and in 2003 (χ 2 (4)=22.0211, Pr=0.000), although not with respect to change in SES between these two points. In both 1997 and 2003, those children moving between schools in two different MPs were more 274 likely to be from more affluent homes. There was no evidence that the two groups of children differed with respect to the poverty area of the SALs in which they lived, but children moving between schools in different MPs were more likely to be living in particularly disadvantaged SP (χ2(4)=10.3958, Pr=0.034) and MP (χ2(2)=11.2848, Pr=0.004) areas. Children moving into, and out of, schools in the same area as their home SAL At the SAL level, the number of children attending school in the same area as their home is extremely small (n=20 in 1997; n=5 in 2003). Due to this small sample size, no further analyses will be conducted on these groups. SP At the SP level, the numbers are somewhat higher, with 60 children attending primary school in the same SP as their home in 1997, but not 2003, and 63 doing so in 2003, but not 1997. There is no evidence that these two groups of children differ with respect to race or gender. The group of children attending local schools in 2003, but not 1997, is however significantly more likely to have first enrolled in school at an older age (χ2(1)=4.2981, Pr=0.038). They are also less likely to have ever repeated a grade (χ2(1)=7.9301, Pr=0.005). There is no evidence that the two groups of children differ with respect to maternal education, maternal marital status, or household SES in either 1997 or 2003. There is, however, evidence for a weakly significant relationship with the change in household SES between 1997 and 2003 (χ2(6)=11.4982, Pr=0.063, with children whose SES has fallen between 1997 and 2003 being more likely to move to a school in the same SP as their home. A weakly significant relationship was also found with both SAL (χ2(4)=8.3838, Pr=0.076) and SP (χ2(4)=9.6659, Pr=0.046) poverty levels. In both cases, children in areas with intermediate poverty levels were more likely to be moving away from schools located in the same area as their home, while children in areas with either very 275 high or low poverty levels were more likely to be moving into schools in the same area as their home. There was no evidence of any relationship between mobility change and the poverty level of the MP in which the child lived. MP At the MP level, of the 427 children still in primary school in 2003 who changed schools at least once, 47 had been attending a school in their MP in 1997, but by 2003 no longer did so. A further 50 children who had not been attending a school in their MP in 1997 were doing so by 2003. There was no evidence that these two groups of children differed with respect to either race or gender. Children moving into a school in the same MP as their home were, however, more likely to have started school later (χ2(1)=9.9730, Pr=0.002), and never have repeated a grade (χ2(1)=5.8091, Pr=0.016), than those children moving away from schools in the same MP as their home. The two groups of children did not differ with respect to any of the household level variables considered (maternal education, maternal marital status, household SES in either 1997 or 2003, and the change in SES between 1997 and 2003). Children moving out of schools in their home SAL (χ2(4)=8.1504, Pr=0.090) and SP (χ2(4)=12.7700, Pr=0.012) areas were more likely to be living in areas with intermediate levels of poverty, while those children moving into schools in the same area as their home were more likely to be living in areas with either very high or very low poverty levels. There was, however, no evidence for any difference in MP poverty. 8.4.3 Nearest school The final approach to measuring changes in mobility is to look at whether children move into, or out of, their nearest grade-appropriate school. Between 1997 and 2003, 67 primary school school-changers moved out of their nearest primary school into another primary school further from home, while 51 276 moved from a primary school further away into their nearest school. There was no evidence that these two groups of children differed systematically with regards to race, gender, age at first enrolment or grade repetition. Children moving out of their nearest school tending to have mothers with higher levels of education than those moving into their nearest schools (χ2(4)=10.2479, Pr=0.034). There was no evidence of a relationship with maternal marital status, household SES in either 1997 or 2003, change in household SES between 1997 and 2003, or poverty at the SAL, SP or MP area levels. 8.4.4 Conclusion: primary school school-changers Children who enrolled in school late for their age, and those who had never repeated a grade were more likely to experience a decrease in distance from home to school. Children living in the most advantaged and disadvantaged SAL and SP areas were also more likely to experience a decrease in distance from home to school. By contrast, children moving to schools further from home between 1997 and 2003 tended to have mothers with particularly low, or particularly high, levels of education. Children moving between schools in different areas appeared to differ from their peers moving between two schools in the same area primarily with respect to maternal education, household SES, and the poverty level of the area in which they lived. Moving between schools in different areas was typically associated with relative advantage, although in some cases, there was a non- linear relationship in which both the most and least advantaged children were particularly likely to move between areas. Children moving into schools in the same area as their homes differed from those moving away from these schools in that they tended to have been older at their first enrolment in school, were less likely to have repeated grades, and lived in areas with either particularly low or high levels of poverty. 277 Children moving out of their nearest primary schools tended to have mothers with higher levels of education than children moving into their nearest primary schools, but otherwise the two groups did not differ significantly. It is not clear that analyses presented here have generated any consistent findings. This may be due to a non-representative sample, and fairly small sample size in some of the analyses, or may simply indicate that mobility change is not significantly related to any of the variables considered here. However, extreme levels of SES, later first enrolment in school, and grade repetition, were the variables most prominent in these analyses, which suggests that they are likely to have stronger relationship to schooling change than the other variables considered. Unfortunately, the nature of this relationship is not clear at this point. 8.5 Correlates of type of mobility change for children transitioning to high school In this section, the correlates of different types of mobility associated with the transition to high school are documented. Unless otherwise indicated, analyses presented here refer to the 390 children in the sample known to have moved from a primary to a high school between 1997 and 2003. Children enrolled at combined schools are excluded from these analyses. Again, the non-random nature of the sub-sample considered here must be emphasized, and findings should be used primarily as a basis for the development of hypotheses requiring further testing. 8.5.1 Straight-line distance Individual level variables A Kruskal-Wallis test found a significant relationship between change in mobility and ethnicity (Pr=0.0365), with black children experiencing greater increases in distance from home to school than coloured children. There was 278 no evidence of any relationship between change in distance from home to school and child gender, age at first school enrolment, or grade repetition. Household level variables There was no evidence for a relationship between change in distance and maternal education, but children of unmarried mothers were more likely to experience larger increases in distance from home to school (Wilcoxon ranksum, Pr=0.0028). Children in the most advantaged and disadvantaged SES quintiles in 1997 were most likely to experience the largest fall in distance from home to school (Kruskal Wallis, Pr=0.0040). Those in the middle quintile in 2003, by contrast, were those experiencing the largest fall in distance (Kruskal Wallis, Pr=0.0247). There was also no evidence for any relationship between change in SES from 1997 to 2003 and change in distance from home to school. Community level variables There was no evidence for any relationship between area poverty, at the SAL, SP or MP level, and change in distance between home and school. 8.5.2 Census geography Two sets of analyses relating to mobility change as defined by census geography are presented for those children transitioning to high school in 2003. The first set of analyses compares children who move between schools in the same area to those who move between schools in different areas. The sample size for these analyses is 400, as the home address data is not used. The second set of analyses compares those children who move away from a school in their home area, and those children who move into a school in their home area. 279 Children moving between schools in different areas SAL As was the case for children moving between primary schools, very few children moving between a primary and high school remained in the same SAL (n=27). Coloured children are more likely to remain in the same SAL than black children (χ2(1)= 4.7763, Pr = 0.029). There is no evidence of a relationship between whether children move between schools in the same or different SALs and any of the other variables examined (gender, age at first enrolment, grade repetition, maternal education, maternal marital status, household SES in 1997 and 2003, change in household SES between 1997 and 2003, and poverty level of the SAL, SP and MP in which they child lives). SP 125 (31.25%) of the children who moved from primary to high school moved between two schools in the same SP, while the remaining 275 children moved between two schools in different SPs. These proportions are again very similar to those found amongst the children moving between two primary schools. Coloured children were again more likely to move between two schools in the same SP than black children (χ2(1)= 30.8927, Pr = 0.000). There is no evidence for a relationship between moving between schools in different SPs and child gender, age at first enrolment, grade repetition, maternal education, or maternal marital status. In both 1997 (χ2(4)=12.3664, Pr=0.015) and 2003 (χ 2 (4)=16.7466, Pr=0.002), children in the most extreme SES quintiles were most likely to be moving between schools in the same SP, while children in the middle quintiles were more likely to be moving between schools in different SPs. There was no evidence for a relationship with change in household SES over time. Finally, children living in poorer SAL (χ2(4)=8.0566, Pr=0.090), SP (χ 2 (4)=14.9720, Pr=0.005) and MP (χ2(2)=9.9770, Pr=0.007) areas were more likely to be moving between schools in two different SPs. 280 MP Of the children moving from primary to high school, 284 (71.00%) move between two schools within the same MP, while 116 (29.00%) move between schools in two different MPs. Again, these figures are very similar to those found for children moving between primary schools. Once again, coloured children are significantly more likely to move between two schools in the same MP than black children (χ2(1)= 9.8932, Pr = 0.002). Again, there is also no evidence for a relationship with gender, age at first enrolment, grade repetition, maternal education, or maternal marital status. In both 1997 (χ2(4)=11.8570, Pr=0.018) and 2003 (χ2(4)=20.6648, Pr=0.000), the children from the poorest households are more likely to move between schools within the same MP, while children in the middle quintiles are more likely to move between schools in different MPs. There is no evidence for a relationship with change in household SES over time however. Children living in poorer SP (χ2(4)=20.9806, Pr=0.000) and MP (χ 2 (2)=11.7971, Pr=0.003) areas are more likely to move between schools in two different MPs than children in more affluent areas, but there is no evidence for a relationship with SAL poverty. Children moving into, and out of, schools in the same area as their home SAL Altogether, 46 children transitioning to high school attend schools in the same SAL as their home in either 1997 or 2003. Of these, 8 attend school in the same SAL as their home at both points in time, while 28 do so only in 1997 and 10 only in 2003. Movement appears to generally be away from the home SAL. Given the small numbers involved, no additional analysis is conducted at this level. SP 219 of the children transitioning to high school attend school in the same SP as their home at some point. 98 attend a school in their home SP in both 1997 and 281 2003, 82 do so only in 1997, and 39 only in 2003. The analyses presented here compare the group of 82 children attending only primary school in their home SP, and the group of 39 children attending only high school in their home SP. There is no significant difference between the two groups of children with respect to race, gender, age at first enrolment, or grade repetition. Children moving into the local SP for high school were more likely to have mothers with intermediate levels of education, while those moving out of the local SP were more likely to have mothers with particularly high or low levels of education (χ2(4)=11.3383, Pr=0.014). There is no evidence that the two groups of children differ with respect to any of the other variables considered (maternal marital status, household SES in 1997 or 2003, change in household SES over time, and area poverty in the home SAL, SP and MP). MP Almost all of the children transitioning to high school, 325, attend a school in the same MP as their home at some point, and 236 are attending school in their home MP at both points in time. Only 56 attend a school in their home MP in 1997 but not 2003, while even fewer, 31, do so only in 2003. The analyses presented below compare the group of 56 children moving out of their home MP for high school with the 31 children moving into their home MP. The two groups of children do not differ with regards to gender, age at first enrolment, grade repetition, or maternal education. Children moving to a high school in their home MP are, however, substantially more likely to be coloured than black (χ2(1)= 11.4483, Pr = 0.001), and are more likely to have married mothers (χ2(1)=6.2554, Pr=0.012). Although the two groups of children do not differ with regards to household SES in 2003, or change in SES over time, children moving to a high school in their home MP were likely to come from either particularly advantaged or disadvantaged households in 1997 (χ2(4)=10.9760, Pr=0.024). Children moving to high schools in their home MP were also more likely to live in particularly advantaged or disadvantaged SAL (χ2(4)=14.1911, 282 Pr=0.008), SP (χ2(4)=20.4014, Pr=0.000), and MP (χ 2 (2)= 14.8044, Pr=0.001) areas. 8.5.3 Nearest school The final set of tests compares those children in high school in 2003, who had been attending their nearest school in 1997 and had moved to a school further away by 2003 (n=41), and those children who attended their nearest school in 2003, but had not done so in 1997 (n=62). Movement towards the nearest school appears to be somewhat more prevalent than movement away from the nearest school, which may be explained by the smaller number of high schools available. The two groups of children do not differ significantly with respect to gender, race, age at first school enrolment, maternal education, maternal marital status, or the poverty of the SAL, SP or MP areas in which they live. They do, however, differ with respect to household SES in both 1997 (χ2(4)=7.8985, Pr=0.092) and 2003 (χ 2 (4)=11.5980, Pr=0.024), with children from more affluent households more likely to be moving away from their nearest school when transitioning to high school. 8.5.4 Conclusion: mobility change associated with transition to high school Somewhat surprisingly, many of the findings for children transitioning to high school echo closely those for children changing between two primary schools. This suggests that the two processes may not be as distinct as initially hypothesized. Change in distance from home to school was related to race, with black children typically moving further afield, as well as with household SES, with children at extreme levels of both advantage and disadvantage tending to move closer to home. There was also evidence that the children of unmarried mothers experienced a greater increase in distance than those with married mothers. Movement between different geographic areas was also associated in the same ways with race and household SES, although in this 283 case an additional relationship with area poverty was also identified. Children living in poorer areas were more likely to move between schools in different geographic areas. Children moving away from their nearest school were more likely to be black and to have mothers who had either particularly high or low levels of education than their peers moving into schools in the same area as their homes. Finally, children moving away from their nearest schools for high school tended to be more affluent than those moving towards their nearest school. 8.6 Conclusion This chapter has provided an overview of how schooling mobility changes over time, and how these changes relate to child and family variables. One of the most striking findings of this chapter is the prevalence of school changes during primary schooling. This high level of school changes – in a sample of children with consistent residential addresses– has important mobility implications. Most notably, it means that not only are many primary school children travelling considerable distances, but also that these distances are not constant with time, despite the widespread preconception that this is a relatively stable phase of schooling. Children living in poor areas, whose household SES increases with time, and whose mothers have intermediate levels of education appear to be the most likely to move between different primary schools. A second important outcome of this chapter is that there are both similarities and differences in the changes to mobility associated with the transition to high school and those associated with a transition between two different primary schools. The overall tendency appears to be for children transitioning to high school to increase their travel distance somewhat, even as a higher proportion of these children also begin to attend the school nearest to their home. By contrast, although a similar proportion of children moving between primary 284 schools increase their travel distance, these increases are typically smaller, and the average change in distance is negative. These primary school children also become less likely to attend the school nearest to their home. Although the patterns of change at each level of census geography is similar for children moving between primary schools, and those moving to high schools, this disguises the fact that children at the primary school level are on average moving to areas closer to home, while children at the high school level are moving to areas further away from home. Despite these differences, however, the correlates of particular kinds of changes to mobility are extremely consistent across the two groups of children. Thirdly, the different correlates associated with the different types of changes to mobility substantiate the notion that the three measures of mobility used in this project capture slightly different aspects of the phenomenon. The evidence continues to be consistent with the notion that there are two separate processes of school choice, and by extension mobility behaviours, in play in contemporary urban Soweto-Johannesburg. One involves substantial travel, and typically involves those children with access to the greatest resources, and those living in areas in which services are not available. The other involves mobility at relatively local levels, and seems to engage children with more intermediate levels of resources and maternal education. 285 Chapter 9: Modelling educational mobility 9.1 Introduction Having explored the bivariate relationships between educational mobility and a range of variables at the child, household, community and school levels, this chapter combines these variables to develop a multivariate model predicting educational mobility. This is done through a series of regression analyses, using child, household, community and schooling data as independent variables, and the various measures of educational mobility as dependent variables. As most of the school attribute variables presented in Chapter 7 were closely related to mobility, but were also highly correlated with each other, principal component analysis (PCA) was conducted on all school attribute variables that were consistently found to be significantly related to mobility (school quintile, school fees, school enrolment, percent black learners, school sector, historical DET status, and pass rate) to generate a school quality index. This process was repeated for the school attended by each child in 1997 and 2003, as well as for the nearest grade-appropriate school to the child‘s home in 1997 and 2003. In all cases, the eigenvalues of the first two components of the PCA were both greater than 1, and were therefore both retained. Transformation of some additional variables was also conducted to allow for their use in a regression context. Race was re-coded into a binary variable, coded 1 if the child was black African and 0 otherwise. Gender was similarly recoded to 1 if the child was a boy and 0 if the child was a girl. Poverty data for the area in which a child lived was used in its raw form, as opposed to the quintile form, as was household SES. The only non-binary categorical independent variable used was maternal education, and this was converted to a 286 series of dummy variables, with a maternal education level of Grade 5 or below used as the base category. In the first section of this chapter, regression models are developed to predict the straight-line distance between home and school for both 1997 and 2003. In the second section, logistic regression models are used to predict the likelihood that that children attend schools located in the same Sub-Place (SP) and Main Place (MP) areas in which they live, again for both 1997 and 2003. In the third section of the chapter, logistic regression models are again developed, this time to predict the likelihood that a child will attend his or her nearest grade- appropriate school. Finally, the implications of these various models for the hypotheses around mobility presented in previous chapters are discussed. 9.2 Straight-line distance As noted in Chapter 5, the distribution of straight-line distance from home to school across the sample is highly non-normal. A range of transformations of this variable were generated and tested for normality, but none were found to be normal. The log transformation, however, was closest to a normal curve, and as a result, is used as the dependent variable in this set of models. All of the child, household and community level variables discussed in Chapter 5, along with the PCA-generated variables relating to the quality of a child‘s attended and nearest school, were considered for inclusion as independent variables. 9.2.1 1997 Initially, a standard OLS regression was run, including all child, household and community variables, as well as the derived scores for schools attended and nearest to a child‘s home. This model, however, featured extensive multicollinearity. The correlation matrix for all variables was examined, and 287 one major area of concern was identified. This was the strong relationship between area poverty and the attributes of the school closest to the child‘s home. As the attributes of the nearest school were also closely related to the attributes of the school attended, a decision was made to retain the area poverty level variables only. Due to the high correlation between poverty levels at the various different levels of geography (SP and MP in particular), each was tested for significance. , and the decision was made to retain only SP poverty in the final model. Results for this model are described below, and presented in Table 9.1. Tests were conducted to explore concerns around cases exerting an undue influence on the models. A number of outliers and cases with high leverage were identified. As these cases appeared to be features of the data rather than errors, they were not removed, but it should be noted that they may have biased the results reported below. Although a Breusch-Pagan test found no evidence of heteroskedasticity, both a visual examination of the variances and a White test provided evidence of heteroskedasticity. The distribution of the residuals also appeared normal on inspection, but a Shapiro-Wilk test rejected this hypothesis. Due to these various concerns, the regression was repeated using regression with robust standard errors, and also a robust regression which weights cases differently to minimize the effects of cases with particularly high influence. These results are also presented in Table 9.1, and it is clear that they are not substantially different from the results of the standard regression. For all models, tests indicated that omitted variable bias was present, and this persisted even with the inclusion of a range of other variables. However, a model specification link test failed to reject the assumption that the model was correctly specified. The results of these regressions, presented in Table 9.1 below, suggest that the coefficients on race, maternal education, and the attributes of the school attended are most significant in predicting the distance a child travels from 288 home to school in 1997. Across all models, black children are significantly more likely to travel further than children of other race groups. The relationship between distance and maternal education is somewhat more complex, and slightly counterintuitive, as it suggests that children whose mothers have very limited formal education (grades 5 or 6 only) tend to travel the furthest, followed by those whose mothers have reached grade 11-12. As shown by the coefficient on the first school attributes variable, children attending more advantaged schools, are also likely to travel further than children attending less advantaged schools. There is some weak evidence that maternal marital status may also be related to mobility, with children of married mothers tending to travel somewhat further. There is, however, no evidence that household SES shapes distance, although it seems more likely that the effect of household SES has already been captured in other relatively highly correlated variables, such as school quality and maternal education, than that it doesn‘t relate to mobility at all. There is also evidence that the poverty level of the area in which a child lives plays a limited role in predicting travel distance, with children in poorer areas tending to travel further. Overall, these findings sustain the hypothesis that black children from homes where mothers have at least some education, attending relatively privileged schools, but still living in less advantaged areas, tend, overall, to travel somewhat further to school than their peers. Standard regression Standard regression with robust errors Robust regression Black African race 2.053 (0.174) *** 2.053 (0.187) *** 2.438 (0.160) *** Male gender 0.005 (0.092) 0.005 (0.094) 0.010 (0.084) Later age at first school enrolment 0.097 (0.094) 0.097 (0.094) 0.064 (0.086) Maternal education grade 5-7 0.793 (0.248) *** 0.793 (0.235) *** 0.527 (0.227) ** Maternal education grade 8-10 0.395 (0.200) ** 0.395 (0.167) ** 0.242 (0.183) Maternal education grade 11-12 0.561 (0.210) *** 0.561 (0.184) *** 0.369 (0.192) * Maternal education post-school 0.404 (0.260) 0.404 (0.240) * 0.393 (0.238) * 289 Maternal marital status 0.176 (0.104) * 0.176 (0.108) 0.184 (0.095) * Household SES 1997 0.013 (0.032) 0.013 (0.034) 0.026 (0.029) School attended 1997 attributes component 1 9 0.644 (0.037) *** 0.644 (0.034) *** 0.696 (0.034) *** School attended 1997 attributes component 2 0.057 (.0481) 0.057 (0.042) 0.051 (0.044) SP poverty (raw score) 0.068 (0.038) * 0.068 (0.045) 0.080 (0.035) ** Constant -1.940 (0.247) *** -1.940 (0.223) *** -2.205 (0.226) *** *significant at P<0.1 level ** significant at P<0.05 level ***significant at P<0.01 level No. of obs = 742 F(12, 729) = 36.70 Prob > F = 0.0000 R-squared = 0.3766 Adj R-squared = 0.3663 Root MSE = 1.236 Number of obs = 742 F(12, 729) = 54.78 Prob > F = 0.0000 R-squared = 0.3766 Root MSE = 1.236 No. of obs = 742 F(12, 729) = 52.10 Prob > F = 0.000 Table 9.1: 1997 regression results. Figures in parentheses are standard errors. 9.2.2 2003 A similar process to that described above was used to develop the model for 2003. The major difference in model construction for 2003 is that a variable indicating whether the child repeated a grade between 1996 and 2003 is included. As was the case with the 1997 model, tests indicated concerns around cases with particularly high influence, as well as the presence of some heteroskedasticity. As a result, Table 9.2, below, reports the results of a standard regression, along with the results of the regression re-run with robust errors, and a robust regression to reduce the impact of particularly influential cases. Once again, evidence for omitted variable bias could not be eliminated, although a model specification link test failed to reject the assumption that the model was correctly specified. 9 The strongest loadings for school component 1 in 1997 were historical DET status and percentage black students, both loading negatively. Component 2 loadings were matric pass rate and school fees, loading positively, and school enrollment, loading negatively. 290 The patterns identified in the 2003 regressions are very similar to those identified for 1997, with race, school attributes, and maternal education continuing to be the strongest predictors of distance from home to school. The role of maternal education in shaping mobility has, however, changed somewhat, becoming more linear. Maternal education at the grade 5 to 7 level no longer contributes to mobility, and higher maternal education is typically associated with higher mobility. Black children continue to travel substantially further than members of other race groups, as do children attending more advantaged schools. Evidence for a role of residential area poverty or maternal marital status in determining mobility has largely disappeared in 2003. Overall, however, the relative similarity of these results over time, despite the substantial change in the age of the children, seems to suggest that the determinants of mobility remain fairly consistent, throughout the primary school years, and even into secondary schooling. Standard regression Standard regression with robust errors Robust regression Black African race 1.876 (0.216) *** 1.876 (0.269) *** 2.275 (0.209) *** Male gender -0.153 (0.096) -0.153 (0.094) -0.212 (0.093) ** Late age at first school enrolment 0.028 (0.100) 0.028 (0.100) -0.003 (0.097) Grade repetition 0.018 (0.104) 0.018 (0.104) -0.026 (0.101) Maternal education grade 5-7 0.327 (0.255) 0.327 (0.249) 0.152 (0.247) Maternal education grade 8-10 0.398 (0.187) ** 0.398 (0.165) ** 0.324 (0.181) * Maternal education grade 11-12 0.488 (0.196) ** 0.488 (0.175) *** 0.378 (0.190) ** Maternal education post-school 0.415 (0.248) * 0.415 (0.232) * 0.280 (0.240) Maternal marital status 0.086 (0.104) 0.086 (0.101) 0.064 (0.101) Household SES 2003 0.112 (0.097) 0.112 (0.055) ** 0.077 (0.094) School attended 2003 attributes component 110 0.696 (0.037) *** 0.696 (0.039) *** 0.746 (0.036) *** 10 For 2003, historical DET status and percent black students both loaded strongly negatively on component 1, while school quintile and school fees loaded strongly positively. School 291 School attended 2003 attributes component 2 0.068 (0.053) 0.068 (0.053) 0.081 (0.051) SP poverty (raw score) 0.071 (0.038) * 0.071 (0.047) 0.024 (0.037) Constant -1.355 (0.283) *** -1.355 (0.282) *** -1.565 (0.274) *** *significant at P<0.1 level ** significant at P<0.05 level ***significant at P<0.01 level No. of obs = 543 F(13, 529) = 40.87 Prob > F = 0.0000 R-squared = 0.5011 Adj R-squared = 0.4888 Root MSE = 1.0728 Number of obs = 543 F(13, 529) = 43.06 Prob > F = 0.0000 R-squared = 0.5011 Root MSE = 1.0728 No. of obs = 543 F(13, 529) = 51.80 Prob > F = 0.0000 Table 9.2: 2003 regression results. Figures in parentheses are standard errors. 9.3 Census area geography In this section, models are developed for whether children attend schools in the same geographical area in which they live. Due to the very low numbers of children attending school in their home Small Area Level (SAL), and the very low numbers who do not attend school in their home Municipality (MN), models are only pursued for mobility at the Sub-Place (SP) and Main-Place (MP) levels. As the dependent variable used in these models is binary, a logistic regression approach is used. Initially, the regression was run using the same set of variables as presented in the distance to school regressions above. However, tests indicated specification errors, which were reduced by replacing the raw household SES scores with dummy variables for each quintile of household SES. This improvement may be related to nonlinearities in the relationship between household SES and the census area based measure of mobility. In both 1997 and 2003, household SES quintile 1 (highest poverty) was used as the base category. SP poverty was also treated as a categorical variable, again to allow for nonlinearity in the relationship between SP poverty and mobility behaviour. Additional tests for multicollinearity, goodness of fit, and particularly influential cases were also conducted on all models before they were finalised (Chen, Ender et al. 2011). Although at the end of this enrolment loaded strongly negatively on component 2, while school fees again loaded strongly positively. 292 process all of the models detailed below continued to provide some evidence of specification error, there was no theoretically justifiable way to further improve their specification. 9.3.1 Sub-Place level Results for both 1997 and 2003 are presented in Table 9.3 below. Regressions were run both with and without robust standard errors. However, as the coefficients in both cases are identical, and significance levels did not change substantially, only the results using robust standard errors are presented in Table 9.3. Note that grade repetition was not included in the 1997 model. The 1997 model used the household SES poverty quintiles based on 1997 data, while the 2003 model used the quintiles based on the 2003. Each model also used the school attributes score based on the school attended at that point in time. The coefficients returned by the logistic regression for SP mobility in 1997 indicate that black children are again significantly more mobile, as black race is negatively associated with the probability that a child attends a school in the same SP as his or her home. By contrast, the association between maternal education and mobility has disappeared. There is some evidence that children in household SES quintiles 2 and 4 are more likely to attend schools outside of their home SP. The evidence that the attributes of the school attended shape mobility remains strong. The coefficients on both components of school attributes indicate that children attending more advantaged schools are more likely to attend a school located outside of their home SP. Finally, area poverty is also significantly associated with mobility. Those children living in wealthier areas are less likely than their peers living in poorer areas to be attending schools in their home SPs. The 2003 results are very similar to those for 1997. Black African race remains a strongly significant predictor of mobility at the SP level, with black children 293 less likely to attend a school in their home SP. There is again no evidence that maternal education is related to mobility at the SP level in 2003. Children attending more advantaged schools, and those living in the least advantaged areas, continue to be significantly more likely to be travelling to attend a school outside of their home SP in 2003. Children with mid-range household SES are also particularly likely to be attending school outside of their home SP. There is also weak evidence that children with married mothers are more likely to be travelling out of their home SP in order to attend school in 2003. Overall, determinants of mobility at the SP level, in both 1997 and 2003, appear to be fairly similar to determinants of distance from home to school, with the exception that maternal education no longer appears to play a significant role. This is critical, because of the close relationship between maternal education and affluence, and suggests that household resource levels may be less important in shaping mobility at the fairly small SP level of geography, than in shaping distance from home to school. 1997: Logistic regression with robust errors 2003: Logistic regression with robust errors Black African race -3.314 (0.480) *** -3.422 (0.725) *** Male gender -0.222 (0.173) 0.005 (0.229) Late age at first school enrolment -0.233 (0.178) 0.280 (0.238) Grade Repetition 0.015 (0.245) Maternal education grade 5-7 -0.624 (0.434) -0.276 (0.573) Maternal education grade 8-10 -0.413 (0.358) -0.458 (0.436) Maternal education grade 11-12 -0.266 (0.380) -0.566 (0.471) Maternal education post-school -0.258 (0.463) -0.231 (0.539) Maternal marital status -0.164 (0.197) -0.433 (0.248) * Household SES quintile 2 -0.554 (0.244) ** -0.531 (0.323) Household SES quintile 3 -0.182 (0.250) -1.130 (0.330) *** 294 Household SES quintile 4 -0.468 (0.274) * -0.269 (0.342) Household SES quintile 5 (most advantaged) -0.465 (0.376) 0.249 (0.391) School attended attributes component 1 -0.947 (0.102) *** -1.240 (0.131) *** School attended attributes component 2 -0.202 (0.099) ** -0.443 (0.147) *** SP poverty quintile 2 (relatively low area poverty) -1.033 (0.349) *** -0.983 (0.627) SP poverty quintile 3 -0.690 (0.422) -0.684 (0.645) SP poverty quintile 4 -0.975 (0.427) ** -1.409 (0.669) ** SP poverty quintile 5 (highest poverty areas) -1.474 (0.464) *** -2.059 (0.698) *** Constant 3.967 (0.535) *** 3.602 (0.760) *** *significant at P<0.1 level ** significant at P<0.05 level ***significant at P<0.01 level No. of obs = 742 Wald chi2(18) = 94.79 Prob > chi2 = 0.0000 Pseudo R-squared = 0.1940 Log likelihood = -410.34555 No. of obs = 543 Wald chi2(19) = 124.39 Prob > chi2 = 0.0000 Pseudo R-squared = 0.2849 Log likelihood = -252.24727 Table 9.3: 1997 & 2003 SP mobility regression results. Figures in parentheses are standard errors. 9.3.2 Main Place level Results for logistic regressions at the MP level for both 1997 and 2003 are presented in Table 9.4 below. Again, only results with robust standard errors are presented. Grade repetition was not included in the 1997 model. The 1997 model used the household SES poverty quintiles based on 1997 data, while the 2003 model used the quintiles based on the 2003. Each model also used the school attributes score based on the school attended at that point in time. 295 In 1997, black children, and those attending comparatively advantaged schools were less likely to be attending a school in their home MP. Although there is weak evidence that maternal education at the grade 11 to 12 level is associated with a higher probability of attending school outside of the MP in which the home is located, there appears to be little role for maternal education in determining MP level mobility. Similarly, evidence for a role of household SES is also limited, with only children falling into quintile 2 being significantly more likely to attend a school outside of their home MP. By contrast, however, and as was the case at the SP level, there is strong evidence that attending a more advantaged school, and living in an area with higher poverty levels, were both associated with an increased probability of attending school outside of the home MP. The 2003 results for mobility at the MP level are fairly similar to those for 1997. Once again, black children are significantly more likely to be attending a school outside of their home MP. There is some evidence that maternal education at the post-school level increases the likelihood of a child attending a school outside of their home MP. Children enrolled in more affluent schools are also more likely to be travelling to a school outside of the MP in which they live. Finally, children living in SP areas with higher levels of poverty are more likely to be attending a school outside their home MP. Overall, these results suggest that the determinants of MP mobility do not change substantially over time, and indicate that black children, attending fairly advantaged schools, but living in poor areas, are most likely to travelling outside of the MP in which their home is located in order to go to school. Interestingly, there was some evidence that particularly high levels of maternal education were predictive of mobility at the MP level, which was not the case for mobility at the SP level. This may be reflective of the typically longer distances associated with MP mobility compared to SP mobility, which would cause it to require greater levels of resource investment. 296 1997: Logistic regression with robust errors 2003: Logistic regression with robust errors Black African race -3.820 (0.533) *** -6.816 (3.574) * Male gender 0.037 (0.255) -0.019 (0.292) Late age at first school enrollment -0.305 (0.244) -0.002 (0.341) Grade repetition -0.091 (0.350) Maternal education grade 5-7 -1.064 (0.802) -0.465 (1.043) Maternal education grade 8-10 -1.057 (0.700) -0.441 (0.649) Maternal education grade 11-12 -1.340 (0.707) * -0.529 (0.626) Maternal education post- school -1.254 (0.773) -1.472 (0.735) ** Maternal marital status -0.056 (0.269) -0.137 (0.339) Household SES quintile 2 -0.985 (0.389) ** -0.243 (0.504) Household SES quintile 3 -0.484 (0.390) -0.798 (0.519) Household SES quintile 4 -0.449 (0.403) -0.505 (0.518) Household SES quintile 5 (most advantaged) -0.598 (0.511) 0.029 (0.546) School attended attributes component 1 -1.324 (0.116) *** -1.931 (0.202) *** School attended attributes component 2 0.042 (0.110) 0.133 (0.180) SP poverty quintile 2 (relatively low area poverty) -1.532 (0.611) ** -2.466 (0.812) *** SP poverty quintile 3 -1.883 (0.694) *** -3.013 (0.820) *** SP poverty quintile 4 -2.018 (0.708) *** -3.410 (0.809) *** SP poverty quintile 5 (highest poverty areas) -3.033 (0.723) *** -3.716 (0.836) *** Constant 8.188 (0.904) *** 11.037 (3.790) *** *significant at P<0.1 level ** significant at P<0.05 level ***significant at P<0.01 level No. of obs = 742 Wald chi2(18) = 201.40 Prob > chi2 = 0.0000 Pseudo R-squared = 0.4315 Log likelihood = -232.40251 No. of obs = 543 Wald chi2(1) = 201.40 Prob > chi2 = 0.0000 Pseudo R-squared = 0.56102 Log likelihood = -148.17552 Table 9.4: 1997 and 2003 MP mobility regression results. Figures in parentheses are standard errors. 297 9.3.3 Census area mobility discussion At both the SP and MP level, and in both 1997 and 2003, black race is a strong predictor for increased mobility, as is attending a comparatively advantaged school, and living in a less affluent SP. Interestingly, maternal education, which did predict distance from home to school, does not predict SP mobility at all, and only the highest levels of maternal education are associated with MP mobility. This may indicated that the role of household resources in determining mobility is more limited when looking at travel between smaller areas. This fits well with the argument that two patterns of mobility – one involving substantial travel to historically advantaged schools, and demanding more resources, and one involving more localized travelling and requiring fewer resources – are evident in urban South Africa. SP mobility appears to fit into the lower-resource pattern of mobility, while MP mobility is more closely linked to the higher-resource pattern. 9.4 Nearest school analysis The final set of models presented in this chapter use attendance at a child‘s nearest grade-appropriate school as the dependent variable. Logistic regressions were conducted using both the variable indicating whether the child attended his or her nearest public school, and the variable indicating whether the child attended his or her nearest public or independent school. As the results in both cases were essentially identical, only the results for the variable including both public and independent schools are presented here. Constructing this model was challenging, as it was not possible to obtain a particularly good fit for either the 1997 or 2003 models. This may be due to omitted variables helping to determine whether a child attends his or her nearest school, and may also be due to a fairly high level of randomness in this particular outcome. However, given that both models pass all other goodness of fit tests, and give no indication of specification errors, the low R-squared in the context of a logistic regression is not necessarily of great concern. 298 Again, both a standard logistic regression, and a logistic regression with robust errors were conducted for each time point, and provided very similar results. Table 9.5, below, presents the results for the regressions with robust errors for 1997 and 2003. Grade repetition was only included in the 2003 model. The 1997 model was conducted using 1997 household SES and school attributes, while the 2003 model used these variables for the 2003 time point. The 1997 model indicates that black children are significantly less likely to attend their nearest school. Children who start school late for their age are more likely to attend their nearest schools than those who started early. This is interesting, as it the effect of age at enrolment appears to be moving in the opposite direction than that suggested in the bivariate analyses. The coefficient may be driven by the fact that more affluent children are likely to start school later, and are also more likely to attend their nearest school, even though their average travel distance is greater than that of their less affluent peers. There is no evidence for any relationship between likelihood of attending the nearest school, and either household SES or maternal education. There is, however, a strong significant relationship between the nature of the school the child attends in 1997, and whether this is the school closest to his or her home. Children attending more advantaged schools are less likely to be attending the school nearest to their home. Finally, there is also evidence that children living in areas with mid-range levels of poverty are least likely to attend their nearest schools. Children living in the most disadvantaged areas are, however, still less likely to attend their nearest school than their peers in more advantaged areas. This is in line with the bivariate findings presented in earlier chapters. As in 1997, black children in 2004 remain significantly less likely to attend their nearest grade appropriate schools than children of other races. The relationship between later age at first enrolment and attending the nearest school has, however, disappeared by 2003. There is weak evidence that 299 children of mothers with limited primary school education (Grades 5-7) may be more likely to attend their nearest school than children of mothers with no formal education, but other than this, there is again no evidence for a relationship between maternal education and whether a child attends his or her nearest school. Similarly, there is also some evidence that children in households with mid-range SES are more likely than the most disadvantaged children to attend a school other than their nearest school, but other than this, no evidence for a relationship between household SES and whether or not a child attends his or her nearest school. A statistically significant negative relationship remains between how advantaged the school a child attends is, and the likelihood that the child is attending his or her nearest school. Finally, there is also a strong negative relationship between area poverty and probability of attending the nearest school, with children living in the poorest areas least likely to do so. The 1997 and 2003 results are once again largely consistent over time, and highlight the role of race, school quality, and area poverty in determining whether a child attends his or her nearest school. The very limited evidence for any role of maternal education or household SES in determining whether a child attends his or her nearest school also supports the argument that choosing not to attend the nearest school is a form of school choice which requires fairly little in terms of the investment of resources. 1997: Logistic regression with robust errors 2003: Logistic regression with robust errors Black African race -2.251 (0.431) *** -1.477 (0.643) ** Male gender -0.138 (0.199) 0.145 (0.263) Late age at first school enrollment 0.504 (0.201) ** -0.150 (0.280) Grade repetition -0.181 (0.285) Maternal education grade 5-7 -0.617 (0.509) 1.110 (0.641) * Maternal education grade 8-10 -0.371 (0.385) 0.151 (0.512) Maternal education -0.388 (0.424) 0.242 (0.539) 300 grade 11-12 Maternal education post-school -0.325 (0.553) 0.636 (0.657) Maternal marital status -0.242 (0.241) 0.100 (0.280) 1997 household SES quintile 2 -0.281 (0.300) -0.215 (0.356) 1997 household SES quintile 3 -0.326 (0.323) -0.858 (0.416) ** 1997 household SES quintile 4 0.194 (0.318) -0.172 (0.397) 1997 household SES quintile 5 (most advantaged) 0.062 (0.421) -0.311 (0.481) School attended 1997 attributes component 1 -0.626 (0.099) *** -0.711 (0.143) *** School attended 1997 attributes component 2 -0.233 (0.102) ** -0.658 (0.172) *** SP poverty quintile 2 (relatively low area poverty) -0.693 (0.333) ** -0.566 (0.582) SP poverty quintile 3 -1.550 (0.433) *** -1.263 (0.587) ** SP poverty quintile 4 -0.962 (0.403) ** -1.291 (0.599) ** SP poverty quintile 5 (highest poverty areas) -0.914 (0.436) ** -1.612 (0.658) ** Constant 1.477 (0.541) *** 0.349 (0.733) *significant at P<0.1 level ** significant at P<0.05 level ***significant at P<0.01 level No. of obs = 738 Wald chi2(18) = 87.39 Prob > chi2 = 0.0000 Pseudo R-squared = 0.1241 Log likelihood = -327.56921 No. of obs = 541 Wald chi2(19) = 67.58 Prob > chi2 = 0.0000 Pseudo R-squared = 0.1631 Log likelihood = -207.7699 Table 9.5: 1997 and 2003 nearest school attendance logistic regression results. Figures in parentheses are standard errors. 9.5 Conclusion The data presented in previous chapters of this thesis have suggested that two patterns of learner mobility are operating in the Johannesburg-Soweto area. Firstly, there are children who are travelling particularly long distances to attend school, typically at a fairly high economic and social cost. The second pattern involves travel at a more local level, with children and their families 301 making active choices between those schools accessible from their homes. This second pattern of mobility represents a substantially less resource-intensive approach to engaging in school choice, and can be engaged in by a broader group of children, and most notably those children whose mothers have only fairly limited education. The results presented in this chapter are summarized in Table 9.6, below, and provide additional support for presence of these two, different, patterns of mobility. Distance from home to school School outside of home SP School outside of home MP Not attending nearest school Variables associated with increased mobility Black race Maternal education Higher school quality Black race 1997 Household SES quintiles 2 & 4 (1997 only) 2003 Household SES middle quintiles Higher school quality Poorer SP area Black race Maternal education Grade 11-12 (1997 only) Maternal education post- school (2003 only) 1997 Household SES quintile 2 (1997 only) Higher school quality Poorer SP area Black race Younger age at first school enrolment (1997 only) Very low maternal education (2003 only) 2003 Household SES quintile 3 (2003 only) Higher school quality Poorer SP area Table 9.6: Summary of variables associated with increased mobility in the regression models presented in this chapter 302 Evidence for the first, longer-distance pattern is provided in the regression models for distance between home and school, and to a lesser degree, mobility at the MP level of census geography. In addition to black race, and attending a high quality school, distance from home to school is strongly related to higher maternal education. The role of higher maternal education in increasing mobility, however, largely disappears in all other models, with the exception of mobility at the MP level. Maternal education, therefore, appears to be critical to determining whether a child engages in a more costly form of mobility or not. Although the absence of significant coefficients on household SES variables is of some concern to the hypothesis that these forms of mobility are pursued by more advantaged families, this is probably due to the strength of the correlation between household SES and school quality, and the very substantial coefficients attached to school quality in these regressions. Overall, the models for distance from home to school, and for MP mobility suggest that these forms of mobility are more demanding on families, and at the very least require the additional social and human capital associated with a more highly educated mother. When looking at the results for SP mobility, and a child not attending his or her nearest school, black race and the quality of the school attended continue to be important predictors. Although, as noted, there is little or no evidence for a role of maternal education in predicting these types of mobility, there is some evidence that intermediate levels of household SES may play a role. These combinations of significant variables support the argument that these are forms of mobility that do not require particularly high levels of social or economic capital, even though they are not completely cost free. The role of area poverty in these models has also become more significant, suggesting that these forms of mobility are likely to be more strongly driven by the nature of local educational opportunities than by the resources available to a child and his or her family. 303 Chapter 10: Conclusion 10.1 Introduction This chapter begins with a brief overview and synthesis of the key findings presented in this thesis, and then discusses the implications of these findings. The original contributions – methodological, empirical and theoretical – that this thesis makes to the existing body of scholarly literature are highlighted. The contextual relevance of the study findings is discussed, with a brief discussion of potential implications for school policy in South Africa. Finally, limitations to the work presented here, along with suggestions for future work, are presented. 10.2 Overview of key findings Table 10.1 below outlines the key findings of this thesis, with respect to each of the study‘s major aims. These findings are discussed at greater length in the subsequent sections. Objective Chapter Thesis Findings To develop approaches to the measurement of learner mobility appropriate to the South African context 3 & 5 -- Three different approaches to measuring learner mobility were tested -- Each approach provided different, but complementary data -- Using multiple approaches to measuring learner mobility allowed for the identification of two distinct patterns of mobility – one based primarily on choice between local schools, and one based on travel of substantial distances to schools in more advantaged areas To measure the extent of learner mobility in post- Apartheid Johannesburg- Soweto, South Africa 5 -- Each approach to the measurement of learner mobility provided evidence that learner mobility is highly prevalent in Johannesburg-Soweto -- Approximately a quarter of children travel over 5km to school each way on a daily basis -- Roughly 25% of children attend school outside of the MP in which they live, while 60% attend 304 school outside of the SP in which they live -- Less than 20% of children attend the grade- appropriate school that is closest to their home. -- This data suggests two distinct patterns of mobility – one due primarily to choice between fairly local schools, and involving relatively limited travel, and one involving the choice of historically advantaged schools substantially further afield To identify potential determinants of learner mobility at the child, family and community level 6 &9 -- At the child level, bivariate analyses indicated that mobility behaviour was related to race, age at school enrolment, grade repetition, and phase of schooling -- At the family level, there was evidence for a relationship between mobility and maternal education and household SES -- At the community level, there was evidence for a non-linear relationship between mobility and area poverty -- Multivariate analysis suggested that race, maternal education and area poverty where the strongest and most consistent determinants of mobility behaviour To explore the relationship between school attributes and mobility behaviour 7 & 9 -- On average, children were found to be attending a school more advantaged than the school closest to their home -- More advantaged children were found to be attending more advantaged schools -- Attending a comparatively advantaged school was associated with greater engagement in all forms of mobility -- Children attending private schools, higher quintile schools, schools with a lower proportion of black learners, schools charging higher fees, schools that did not historically fall under the DET, and schools with higher pass rates were more likely to be engaged in learner mobility To identify whether and how learner mobility changes as children age 8 -- There was no clear evidence to suggest that mobility behaviour changes substantially as children age -- Although there was some evidence that children typically travelled slightly further to high schools than primary schools, and were slightly more likely to attend their nearest school at the high school level, this appeared to be due to differences in the sizes and distribution of primary and high schools To generate a preliminary 9 -- Models were developed using each of the 305 model of the determinants of learner mobility three definitions of learner mobility investigated in this thesis -These models suggested that black race and attending a high quality school were strong predictors of all forms of mobility -- Mobility requiring substantial travel was also predicted by maternal education, while mobility at the more local level was most strongly predicted by local area poverty To develop an evidence-based conceptual framework to support ongoing research into learner mobility and school choice 3 & 9 & 10 -- The evidence presented in this thesis supports the argument that child, family, community and school level variables all play a role in shaping school choice decision making -- However, given that two patterns of learner mobility are in operation in Johannesburg- Soweto, and that they appear to be driven by different variables, the evidence also suggests that a conceptual framework which does not differentiate between these forms of mobility may be insufficient Table 10.1: Overview of the key findings presented in this thesis 10.2.1 Developing approaches to measuring learner mobility The thesis made use of three different approaches to measuring learner mobility: straight-line distance between home and school, whether children attend school in the area in which they live, and whether children attend the grade-appropriate school nearest to their homes. Although all of these measures provided consistent evidence for high levels of learner mobility amongst urban South African primary school children, they also captured different aspects of this mobility. As such, the measures proved to be complementary, providing evidence that learner mobility in Johannesburg- Soweto should be understood to consist of two distinct components. These are firstly the mobility involving fairly limited travel distances associated with choice between a number of local schools, and secondly the mobility involving much greater travel distances, and relating to the choice of schools much 306 further afield, typically in historically more advantaged areas than the child‘s home. 10.2.2 Measuring the extent of learner mobility in Johannesburg-Soweto Although there has long been substantial reason to believe that South African learners are motivated to travel relatively long distances to attend particular schools (Cosser and du Toit 2002; Fiske and Ladd 2004; Maile 2004; Msila 2005; Woolman and Fleisch 2006; Msila 2009), this thesis presents the first population-based evidence on what these distances actually are, and how widespread engagement in mobility is. This evidence does suggest that primary school learners in the Johannesburg-Soweto area are extremely mobile. At both ages 7 and 13, over 25% of children were travelling to schools over 5km away from their homes. Almost 60% of children were travelling outside of the Census Sub-Place (SP) area in which they lived (roughly equivalent to a suburb), to attend school in a different SP. At the Main Place (MP) level, equivalent to a small town, roughly 25% of children were travelling to attend a school in an MP other than the one in which they lived. Finally, fewer than 20% of children were found to be attending the grade-appropriate school closest to their home. These figures provide evidence that learner mobility, and school choice, is widespread amongst primary school children and their families in urban South Africa. Certainly, the numbers of children travelling on a daily basis to schools over 5km from their homes is notable, and particularly at 7 years of age was not anticipated by existing information. As corroborated by the overlap between children travelling over 5km, and those mobile at the MP level, most of these children are travelling to areas in very different parts of Johannesburg from their homes, and will be attending school in a very different linguistic and socio-economic context. The data presented also suggest that while some 307 families engage in school choice by sending children to schools further afield, a substantial proportion of others engage in choice at a local level. A full 50% of children lived less than 1.5km from their school, but less than half of them were attending the school closest to their home. This suggests that even less advantaged families are making active decisions in pursuit of the best possible educational opportunities for their children, even in the context of an extremely poorly performing public schooling system. Again, this provides support for the argument that there are two distinct patterns of learner mobility in evidence in Johannesburg-Soweto. 10.2.3 Potential child, household and community-level determinants of learner mobility This thesis tested a range of variables at the child, family and community levels to explore their relationship to learner mobility. At the child level, race, gender, age at first school enrolment and grade repetition, and school phase in 2003 were examined. For all definitions of mobility, race was strongly related to mobility. Although small sample sizes meant that conclusions could not be drawn about the mobility of white and Indian children, there was clear evidence that black children were substantially more likely to engage in mobility than their coloured peers. There was some indication that girls, especially on reaching high school, tended to travel slightly further than boys, but overall there was no compelling evidence for a relationship between gender and mobility. There was evidence that children who first enrolled in school at an older age travelled further at age 7, though not at age 13. This may be due to less wealthy parents sending their children to school at a younger age, to minimize the need to provide childcare. It may also relate to wealthier parents enrolling their children in primary school at a slightly older age, when they might be expected to cope more easily with academic and social challenges associated 308 with attending a school outside of the local area. Grade repetition was also strongly related to distance from home to school, at both ages 7 and 13, as well as to mobility at the SP and MP levels. By contrast, there was no evidence for a relationship between grade repetition and enrolment at the nearest school. Finally, children attending high schools did travel further than those still enrolled at the primary level, and were also significantly more likely to attend their nearest school. At the family level, the thesis examined maternal education, maternal marital status, and household SES. Maternal education was strongly linked to all measures of mobility, with the children of more educated mothers tending to travel further, in both 1997 and 2003. However, there was some attenuation of this relationship at the very highest level of maternal education, perhaps because these families tended to live in more affluent areas, closer to high quality schools. There was no clear evidence for a relationship between maternal marital status and a child‘s engagement in learner mobility. Finally, again in both 1997 and 2003, there was a strong relationship between household SES and learner mobility, with children living in more advantaged families being substantially more likely to engage in mobility, and tending to travel greater distances. Finally, at the community level, the poverty level of the area in which the child lived was explored. Although this data was fairly complex, two general patterns were discernable. Firstly, there was a clear, but non-linear, relationship between distance from home to school, and area poverty. Those children travelling furthest tended to be living in areas that were either relatively affluent, or particularly poor. Secondly, however, there was also a fairly linear, and positive, relationship between the likelihood of a child attending his or her nearest school, and the affluence of the area in which the child lived. 309 The relationships between learner mobility and the variables discussed here substantiate the notion that two different types of school choice are in play in contemporary urban South Africa. Firstly, there is clear evidence that certain measures of mobility – particularly those relating to distance travelled – are associated with indicators of affluence, such as socio-economic status, maternal education, or living in a comparatively advantaged residential area. The extent to which a child‘s family has the means to engage in mobility is clearly one determinant of school choice involving mobility. Secondly, however, there is also evidence of mobility, particularly at a relatively local level, that is not strongly linked to affluence. Engagement in this more local mobility appears to be more closely related to the poverty of the area a child lives in, and by extension the quality of the local schools. This second type of mobility is particularly evident in the data around whether or not a child attends his or her nearest school, which suggests that even families with relatively limited means appear to be making use of school choice to obtain the best educational opportunities possible. 10.2.4 Potential school-level determinants of learner mobility The thesis also explored the relationship between a child‘s mobility, and a range of attributes of the school he or she attended. The first important finding here was that, on average, children attended schools that were more advantaged than would be expected on the basis of the schools closest to their homes. The second important finding was that there was a very strong relationship between child, family and community attributes, and the attributes of the school attended, with more advantaged children tending to attend more advantaged schools. Higher maternal education and household SES were particularly strong predictors of attending higher quality schools. By contrast, 310 children living in areas with high poverty levels were significantly more likely to attend poorer schools. The third important finding was the clear evidence for a strong relationship between mobility and the attributes of the school a child attended. Children attending private schools, higher quintile schools, schools with a lower proportion of black learners, schools charging higher fees, schools that did not historically fall under the DET, and schools with higher pass rates, were more likely to be engaged in learner mobility, largely regardless of how it was measured. This group of findings substantiates the notion that learner mobility and school choice are typically used to enable a child to access higher quality education than would ordinarily be the case. This appears to hold regardless of the type of mobility explored. While more advantaged children remain substantially more likely to attend more advantaged schools, it does appear to be the case that even less advantaged children and families are able to make use of school choice and learner mobility in such a way as to improve the educational opportunities that they are able to access. 10.2.5 Changes in mobility as children age While it was initially anticipated that mobility would increase substantially between 1997 and 2003, as the children aged, became more independent, and began to transition to high schools, this did not appear to be the case. Although children moving from a primary school to a high school did tend to experience a slight increase in distance, this appeared to be due to the smaller number of high schools available. There was, however, little evidence that children‘s travel behaviour changed substantially when the transitioned to high school. Children who were enrolled in primary schools at both points in time did also not experience any significant changes to their mobility. 311 Despite the relative consistency in children‘s mobility over time, school change was widespread across the sample, with over half of the sample making school changes other than those required by the transition to high school. Children living in poor areas, whose household SES increases with time, and whose mothers have intermediate levels of education appear to be the most likely to move between different primary schools. Although changing between two primary schools typically resulted in a small decrease in distance travelled to school, substantial changes in mobility were not evident. This suggests that school change during the primary school period may also be a strategy used in pursuit of the best available educational opportunities by children and families with resources to allow for substantial travel. These findings also provide support for the hypothesis that two different patterns of learner mobility are in play. Firstly, there is a group of children travelling fairly long distances. These children typically experience more stable primary schooling, and their mobility does not change substantially with the transition to high school. Secondly, there is a larger group of children attending relatively local schools. These children tend to experience more school changes during the primary school years, although these changes typically do not involve substantial changes in mobility. This data additionally suggests that not only are at least two patterns of school choice in play, but that the pattern in which a child is engaged appears to be fairly path-dependent. The pattern of mobility in which a child engages at the beginning of their schooling appears to remain relatively consistent even during the transition to high school. 10.2.6 Predicting mobility In Chapter 9 of the thesis, all of the variables discussed above were combined to generate models for each of the forms of mobility behaviour discussed. The results of these models are summarized in Table 10.2 below. 312 Variables Distance SP mobility MP mobility Nearest school Child level Race 1997 Black race associated with greater travel Black race associated with mobility Black race associated with mobility Black race associated with mobility 2003 Black race associated with greater travel Black race associated with mobility Black race associated with mobility Black race associated with mobility Gender 1997 -- -- -- -- 2003 -- -- -- -- Age at first enrolment 1997 -- -- -- Earlier enrolment associated with mobility 2003 -- -- -- -- Repetition 2003 -- -- -- -- Household level Maternal education 1997 Maternal education between Gr5 & Gr12 associated with greater travel -- Maternal education between Gr11 & 12 associated with mobility -- 2003 Maternal education beyond Gr8 associated with greater travel -- Post-school maternal education associated with mobility Maternal education between Gr5 & 7 associated with less mobility Maternal marital status 1997 -- -- -- -- 2003 -- Married mother associated with mobility -- -- Household SES 1997 -- SES quintile 2 & 4 associated with mobility SES quintile 2 associated with mobility -- 2003 -- SES quintile 3 associated with -- SES quintile 3 associated with mobility 313 mobility Community level Community poverty (SP) 1997 Living in a higher poverty area associated with slightly greater travel Living in a high or low poverty area associated with mobility Living in a higher poverty area associated with mobility Living in a higher poverty area associated with mobility 2003 -- Living in a higher poverty area associated with mobility Living in a higher poverty area associated with mobility Living in a higher poverty area associated with mobility School level School resources 1997 Attending a better school associated with greater travel Attending a better school associated with mobility Attending a better school associated with mobility Attending a better school associated with mobility 2003 Attending a better school associated with greater travel Attending a better school associated with mobility Attending a better school associated with mobility Attending a better school associated with mobility Table 10.2: Summarized results for the models of mobility developed in Chapter 9. Results presented are for regressions with robust errors. When all variables are controlled, being black, having a more educated mother, attending a more advantaged school, and living in a comparatively disadvantaged area all appeared to predict a greater distance between home and school, in both 1997 and 2003. That is, more advantaged children living in relatively disadvantaged areas, are likely to travel the greatest distances to get to school. When mobility was measured in a way that picked up local level school choice, all the variables listed above retained significance, with the exception of maternal education. The loss of a significant coefficient for maternal education suggests that household resource levels may be less critical 314 in shaping school choice and mobility at the local level. Again, this provides support for the notion that two patterns of travel are in evidence, one requiring substantially more in the way of resources than the other. 10.2.7 Developing an evidence based conceptual framework for the study of learner mobility Much of the evidence presented in this thesis has indicated that there are two distinct patterns of learner mobility in operation in contemporary urban South Africa. The first form appears to involve roughly 25% of children, and is fairly resource intensive, involving often substantial travel, typically to well- resourced schools in historically advantaged areas. The second form of learner mobility is far less resource intensive, and relates primarily to children who are attending local schools, but not the schools that are closest to their homes. Typically these children are travelling less than 1.5km each way. The conceptual framework proposed in Chapter 3 of this thesis is largely appropriate, in that child, family and community variables all do appear to be shaping decision making related to learner mobility, within the context of history, geography and policy. In light of the strong evidence for two patterns of mobility, however, it is appropriate to modify the outcome of the model to indicate that, typically, one of two distinct paths is followed. A child and his or her family are likely either to embark on the high resources mobility path, or the low resource mobility path, and are likely to remain on this path throughout the child‘s primary schooling, and during the child‘s transition to secondary schooling. The revised framework is presented in Figure 10.1 below. 315 Figure 10.1: Conceptual framework revised on the basis of study findings 10.3 Key contributions The findings summarized above provide a number of important original contributions to the body of scholarly work on school choice and learner mobility, both in South Africa and internationally. These contributions can be categorized as methodological, empirical and theoretical, and are discussed in these categories below. 10.3.1 Methodological contributions The thesis has made three innovative methodological contributions to the study of school choice and learner mobility. Firstly, as discussed above, it explored three different approaches to the measurement of educational mobility: straight-line distance from home to school; movement between different areas; Historical, geographical and policy context Decision making process - Desired educational outcomes - Investment required for desired outcomes - Investment constraints Residential location Child Househol d & Family Maternal High resource mobility – selection of historically advantaged schools Low resource mobility – selection of local schools 316 and whether or not a child attends the grade-appropriate school nearest to his or her home. As far as I can determine, combining the use of these three approaches has not previously been reported in either the South African or international literature. By using these three different operationalizations of mobility, it was possible to explore different dimensions of the phenomenon, leading to the observation that there are at least two distinct patterns of mobility in place in urban South Africa. Secondly, this thesis is innovative with regards to the type of data used. As discussed in Chapter 2, it is the first work of which I am aware to make use of panel data to explore school choice and educational mobility in South Africa. It is also the first work to make use of population level data for this purpose. Finally, it is the first project on school choice in South Africa that I have been able to identify that combines data drawn from a number of different sources to simultaneously explore the relationship of household, community and school- level variables to school choice and educational mobility. It has illustrated that these types of data can be used for these purposes, and, as will be discussed below, can provide novel theoretical and empirical contributions to current knowledge. As such, the study contributes to filling the gap created by the lack of empirical studies into the determinants of school choice, both in South Africa and internationally, that was identified in Chapter 2. 10.3.2 Empirical contributions As highlighted above, this thesis provides, for the first time, detailed quantitative data on learner mobility in contemporary urban South Africa, obtained at the population level. The study findings provide clear evidence of how widespread learner mobility is, and furthermore that mobility is not limited strictly to the most advantaged children, as was anticipated. Instead, it suggests that two patterns of school choice and learner mobility are fairly widespread: firstly, school choice requiring significant travel to historically advantaged schools, and by extension the investment of substantial economic 317 and other resources; and secondly, school choice at a more local level, which is less constrained by access to financial and social resources. This has important implications for dominant narratives about the lives of urban, working class children in South Africa. These children have typically been portrayed as disadvantaged (which they are), and even where the literature focuses on resilience, it is resilience in the context of hardship and disadvantage (Barbarin and Richter 2001). This narrative tends to portray children and families as passive in the face of difficult circumstances, and largely devoid of choice with regards to services. The data presented here, however, suggests that for a fairly substantial proportion of these children – perhaps 30% - engagement in social mobility, particularly through education, is evident. Although their home lives are located in a context of deprivation, they attend school in more advantaged areas, socializing with more advantaged children, and typically receiving a better education than they would in the school nearest to their home. This thesis has also presented clear evidence that the patterns of school choice and mobility identified are highly path-dependent. Children who begin their primary schooling on a socially mobile path typically continue to attend advantaged schools. By contrast, those who begin their schooling close to home are also likely to remain at these more local schools at least until the end of the primary phase. Additional original empirical contributions include the finding that mobile children typically attend a school more advantaged than the one nearest to their home, and that certain groups of children (particularly black children, from relatively well-off households and with more educated mothers and living in relatively disadvantaged areas) are more likely to engage in educational mobility than others. Finally, the finding that mobility behaviour is fairly stable and consistent over time, even following the transition to high school, is also novel, and largely unanticipated. This counters the widespread assumption that mobility increases as children age and become more independent, and 318 particularly once they transition to high school, and provides further evidence that mobility behaviour is fairly path dependent. 10.3.3 Theoretical contributions Finally, at a more theoretical level, the thesis presents a preliminary theoretical model detailing potential determinants of school choice and mobility. This model raises some questions about the traditional market orientation of school choice literature, both in South Africa and internationally. Certainly, in the sample explored here, a large proportion of the children engaged in school choice appeared to be doing so in ways that did not demand a very high level of economic investment. That is, many children are travelling within a constrained radius, making choices between a number of public schools with much the same fee structures and associated costs. While wealth appears to shape mobility, it is not as centrally important as might have been expected. School choice work in South Africa has tended to focus very heavily on those children able to access particularly advantaged schools, which typically requires the investment of substantial financial and social resources (Paterson and Kruss 1998; Sekete, Shilubane et al. 2001; Fiske and Ladd 2004). The current thesis, however, suggests that a great number of children are also engaged in another, less costly, form of school choice. As this form of choice still has implications for the opportunities available to children, is seems important that it receives closer attention in the future. At the international level, work exploring school choice in developing countries has also tended to focus very heavily on access to privileged schools, and paid less attention to the ways in which children and families seek to maximize their educational opportunities even in a context of very limited resources (Carnoy and McEwan 2003; Tsang 2003; Elacqua, Schneider et al. 2006).). Again, this thesis suggests that looking at school choices made by children facing resource constraints may prove very valuable. 319 It also raises questions about the series of hypotheses, common in both developing and developed country contexts, about schools and children who are ‗left behind‘ in the context of choice (Bridge and Blackman 1978; Capell 1981; Henig 1994; Witte and Thorn 1996; Levin 1998; Goldhaber 1999; Hoxby 2003; Peterson, Howell et al. 2003). If engagement in school choice is as widespread as this thesis suggests, that we have to widen our understanding of choice and its parameters in order to grasp the degree of agency exercised by parents and children, and how this might be leveraged to create higher demand for quality education. This feeds into the international debate, summarized in Chapter 2, about the relationship between school choice, particularly in relatively unregulated contexts, and educational segregation and inequality. This thesis suggests that although engagement in choice may be extremely widespread, and relatively unconstrained by economic and social resources, these resources still shape the ways in which choice can be exercised. Different socio-economic groups appear to access different forms of school choice, which in turn are likely to result in differing educational outcomes for their children. The international literature tends to suggest that more educated and advantaged parents are more strongly involved in school choice than less advantaged parents (Carnoy and McEwan 2003; Elacqua, Schneider et al. 2006). The findings presented here suggest that this may not actually be the case. Instead, it appears that less advantaged parents are highly engaged in choice, but simply lack the resources to pursue choice in the ways that are likely to be most beneficial to their children. 320 10.4 Contextual relevance 10.4.1 Relevance to South Africa The findings of this thesis suggest a range of implications for individuals, families, communities and society more broadly. These implications have some relevance to policy, and suggest a range of avenues for additional research and investigation. At the level of children and their families, the most important implications relate to child well-being. Engaging in school choice and mobility provides a child with access to educational opportunities they might not otherwise receive. In the South African context, learner mobility is also likely to be a path to social mobility for at least some children. However, these educational and social benefits do come at some cost. Economic costs, related to the cost of school fees and travel, may mean that a child‘s family has fewer resources available to meet other needs. In the South African context, the safety of a young child travelling substantial distances, typically alone, is a potential risk, and travel time is certainly a cost. An additional risk relates to the fact that for most children travelling substantial distances, their schooling will take place in what is essentially a foreign social context, and often in a language that they are not very familiar with. They will not have neighbourhood friendships and it will be more difficult for their parents to monitor their friendships and activities. In a society with as extensive societal and educational disparities as South Africa, it seems like that over the long term, the benefits of learner mobility for most children would tend to outweigh the costs. In the shorter term, however, the risks and costs for individual children are likely to remain quite high. The implications of widespread learner mobility in urban South Africa are, however, likely to be quite different at the community and societal levels. Here, it seems likely that while the short-term costs and risks are reduced, the longer term costs may be quite substantial. One group of particular concern is those children who are not able to engage the more resource-intensive forms of 321 educational mobility, for whatever reason, and are obliged to attend schools close to their homes. These children, along with the schools they attend, are likely to be negatively impacted by the tendency of all the more advantaged children in the area to attend schools further afield, as this will further reduce the resources available at the local level. With the out-migration of more advantaged children, and a relatively captive market of disadvantaged children, poorly performing schools may lose the incentive to try to improve. Over the longer term, high levels of mobility are also likely to have harmful implications for community coherence, and may well contribute to growing levels of inequality, both economic and educational, within historically disadvantaged areas. The current distribution of educational opportunities, which motivates the high levels of mobility identified in this thesis, are also hugely inefficient, requiring large investments on the parts of families and children, even while they potentially exacerbate already high societal inequality. This pattern, in which learner mobility appears likely to be beneficial for individual children, particularly over the longer term, but also very costly at the level of their community and society, poses real challenges for policy makers. How to balance these various costs and benefits is a challenging question, particularly in a society already so deeply challenged by inequality, and a poorly performing educational system. This is further complicated by the very strong incentives faced by individuals to continue engaging in mobility, regardless of the policy environment. The findings presented in this thesis do raise questions about the validity of certain core elements of South African educational policy, such as the concept of our schools as ―community schools‖, and the notion that a school‘s access to resources can be determined by looking at its location, as opposed to the composition of its student body. The complexity of the results presented, however, combined with the preliminary, hypothesis-building nature of the study itself does, however, suggest that much more needs to be known about how school choice operates in South Africa, 322 and how its costs and benefits play out at the individual and societal levels, before making substantial changes to policy. For this reason, this thesis does not provide a comprehensive discussion of policy relevance at this point. 10.4.2 International relevance The findings of this thesis also have implications for both policy and future research internationally. The thesis has highlighted just how widespread it is possible for school choice to be in a developing country context with limited regulation. South Africa has some distinctive features which limit the generalizability of this study‘s results, such as the fact that in South Africa school performance is very closely associated with geography, and the fact that South Africa‘s public schooling system is much larger than those found in most other countries at similar levels of development, and contains a subset of very well performing schools. Nonetheless, South Africa‘s high level of inequality, limited school choice regulation (or capacity to enforce regulation) and a generally poorly performing public schooling system are all shared with a number of other countries at similar levels of developments. The extent of school choice in South Africa suggests that levels of choice in other similar countries can also be expected to be high. Similar studies in other low and middle income countries would therefore be extremely illuminating. One question that would be particularly useful to ask is whether school choice in other countries, particularly those where private schooling is more widespread, is more strongly economically driven than choice in South Africa. Additionally, understanding whether less advantaged children in other countries also exercise local level school choice will be critical in determining how best to finance public schools and enhance access to high quality education. Understanding the extent of school choice, the forms which it takes, and its determinants is essential to developing appropriate ways to capture this engagement of children and families with educational systems in ways to enhance the performance of both schools and their students. 323 If indeed school choice is widespread in other developing countries, this also suggests a need to think about ways of protecting the most vulnerable groups of children from potential choice-related harm. For example, if more advantaged children are enrolling predominantly in private schools, this deprives the public system of resources, and is likely to enhance societal inequality. This raises questions as to whether voucher systems, such as those found in Chile, or other innovative approaches to school funding should be tested more widely in other developing countries. 10.5 Project limitations and future work 10.5.1 Sample composition One of the major limitations of the thesis is the relatively constrained sample that has been used for analysis. Although, as discussed, this was unavoidable for practical reasons, an ideal next step is to broaden the study sample so that it also includes children who do change their residential addresses during the period under consideration. This more representative sample would ensure that findings can be more broadly generalized. In addition, this sample would also allow for the exploration of potential interactions between residential and educational mobility. 10.5.2 Study end point and longitudinal analysis A second limitation is that only a proportion of the study sample members progressed to high school by 2003. As those children who had progressed to high school by 2003 differed systematically from those who had not, this introduced challenges around using the data to understand whether mobility during primary and high school differed. Although preliminary analysis suggested a great degree of path-dependency and consistency over time, it also appeared to be the case that secondary school status had implications on 324 mobility behaviour, at the very least due to the different geographical distributions of primary and secondary schools. Expanding the sample longitudinally so that all study sample members reach secondary schooling would facilitate analyses exploring the extent to which mobility changes at the secondary school level, and why this is the case. It would also facilitate the use of more complex longitudinal analyses, which would provide clearer and more valuable data on the nature of changes in mobility behaviour over time. The construction of a genuine longitudinal dataset, in which home and schooling data were available for each year during a child‘s schooling would also enhance the value of further analysis, by allowing for the use of more advanced analytical techniques. This would also provide clearer evidence around school change during primary and secondary schooling, and potential motivating factors for this. Introducing data for a more contemporary cohort of children, although likely to be extremely difficult and costly, would also allow exploration of whether patterns of mobility have changed over time, since the beginning of the post-Apartheid period. 10.5.3 Methodological approach Finally, as with any methodological approach, the use of quantitative secondary analysis in this project imposed a number of limitations. Perhaps most importantly, it makes it extremely difficult to provide answers to questions around the individual decision-making processes underlying the decision to engage in learner mobility. However, it is valuable in highlighting correlates of mobility, and by extension generating hypotheses about the types of decisions which individuals may be making. These data-driven hypotheses can then be tested by subsequent research using different methodological approaches. Population based quantitative secondary analysis is also fairly limited in the extent to which it can describe the implications and outcomes of learner mobility. School and community level outcomes, in particular, can only really be tested with data collected at those levels. To understand individual 325 level outcomes, a far more longitudinal approach, along with far more detailed and specialised data, particularly regarding outcomes, than that used here would be necessary. 10.5.4 Future work There are a number of ways in which the work presented in this thesis could be usefully extended. Firstly, as alluded to above, broadening the study sample will produce more generalizable findings, and allow for an exploration of the interactions between learner mobility and residential mobility. Secondly, extending the dataset longitudinally will allow for the examination of learner mobility in the high school period, as well as for the application of more sophisticated tools for longitudinal analysis11. Thirdly, study methodology could be developed further in a few directions, to strengthen study findings. For example, it would be possible to generate a measure of ‗non-essential‘ travel to school, by looking at the difference between the distance to a child‘s nearest school, and the school a child attends. This might provide a more accurate measure of travel related to choice. Additional approaches to dealing with the highly non-normal distribution of the travel data could also be usefully explored, and in particular, more systematic approaches to dealing with outliers. Developing measures of practical or substantive significance to accompany the presentation of measures of statistical significance would also contribute usefully to the interpretation of study findings.12 This study would also greatly benefit from the introduction of a qualitative component. This could be used to test the hypotheses that this thesis has generated. For example, do motivations for mobility (and constraints in engaging in mobility), as experienced by children and their families, tie in with 11 Funding for this extension of the project has been obtained through an ESRC Pathfinders grant, starting in June 2011. 12 I am indebted to my examiners for suggesting these methodological developments that would contribute to strengthening future work. 326 the results presented in the previous chapters?13 Do children and their families identify with the notion that there are two distinct patterns of school choice, and by extension mobility, in play in contemporary urban South Africa? Qualitative work would also allow for further unpacking of the decision making process, providing insight into ways in which families select the schools at which they pursue enrolment for their children. Finally, it would provide a way to integrate the current work more strongly with the international literature, by examining how and why school choice in the South African context differs from experiences in other countries. A final enhancement to work using the Bt20 data would be to develop more sophisticated approaches to the measurement of variables used in the study. In particular, alternative measures of school resource levels, community affluence and coherence, and household SES could all usefully be explored. An important set of questions which this thesis has not adequately explored are those relating to the supply-side geography of schools in South Africa. A clearer understanding of how schools with different attributes are distributed across space would enrich our understanding of how and why children travel to go to school. Incorporating supply side variables into the models of mobility presented in this thesis would strengthen them considerably. For example, poor supply of schools in local areas may be one of the reasons that mobility is fairly high amongst the least advantaged sample members. An exploration of the correlation between school density and population density in different areas would also provide some indication as to whether supply side issues are behind much of the mobility documented in this thesis. Additionally, mapping the distribution of schools by the languages that they operate in will answer questions about whether high levels of local level mobility might be explained in part by language of schooling. These types of analyses can be accomplished, 13 Given the difficulty of accurately measuring school characteristics, answering this question is particularly important, as it would address the concern that the high levels of mobility documented in this study relate more to a highly stochastic schooling environment. Many thanks to my examiners for highlighting this concern to me. 327 at least in part, with the data already compiled for this thesis, and would provide some valuable extensions to study findings14. Moving beyond the current study, similar work applied to populations in other parts of South Africa would also be very useful in establishing the extent to which the patterns identified here are prevalent throughout the country. A similar study in a rural area, where school choice would be expected to be far more limited due to the lower density of available schools would be particularly valuable. Similar analyses applied to other low and middle income countries would also help to shed light on just how widespread school choice is in other contexts, and particularly the extent to which less advantaged children are able to participate in it. This study also suggests a number of additional, related questions that might usefully be pursued. One question relates to the roles of schools in shaping school choice outcomes. For example, to what extent are children not able to enrol in the school they select, for example for reasons of overcrowding, lack of social capital or knowhow, or even overt discrimination? A second question relates to the implications of school choice for academic outcomes, both for learners, and for entire schools. This issue lies at the core of much of the international debate around school choice, and is a critical question which this thesis has not been able to address. This issue is likely to be particularly complex in the South African context, where the potential costs of mobility (ranging from economic, to travel time, to learning in a language not used at home) are extremely high, but variations in public school quality mean that potential benefits are also substantial. Enhancing the dataset by the inclusion of data on academic outcomes at the individual level would allow for an exploration of the relationship between mobility, school choice, and academic outcomes. A final set of additional questions relate to the implications of 14 I am indebted to my examiners for highlighting to me the importance that work of this type would have, and suggesting ways in which it might usefully be approached in the near future.. 328 learner mobility and school choice for social mobility. To what extent does the ability to access education at a historically advantaged school determine the opportunities available to a child as he or she moves through school, and then into higher education or the workplace? 10.6 Conclusion This final chapter has provided a brief overview and summary of the key findings presented in this thesis. These highlight the original contributions that this thesis makes to the scholarly literature on learner mobility and school choice. Methodological contributions have included new approaches to the identification and measurement of learner mobility, as well as the use of population-based panel data, combined with data from other sources, to study the phenomenon. At the empirical level, the study has contributed data on the extent, correlates and determinates of learner mobility. Finally, at the theoretical level, the study has contributed a conceptual framework to support other work on the topic, and the insight that in contemporary urban South Africa, there are at least two forms of learner mobility in play. The study also feeds into broader international debates about the implications of school choice to educational segregation and inequality. The chapter concludes by mapping out a spectrum of further research possibilities. 329 Appendix A: Alternative data sources considered for the thesis A.1 Cape Area Panel Study (CAPS) CAPS data collection began in 2002, and has been focused largely on a sample of almost 5000 young adults who were then aged between 14 and 22. Most of these participants have been interviewed four times since 2002, and additional interviews have been conducted with their households, as well as some additional households and older individuals. As indicated by the study‘s name, the sample was drawn from the greater Cape Town area. Participants have been asked a wide range of questions over the four waves of the study for which data is available to date. These questions include school enrolment, both current and historical, as well as reasons for enrolment at particular schools, and for changes in school enrolment over time. The study‘s second wave also includes a module in which perceptions of school quality for a range of neighbourhood schools are explored for each child (Lam, Ardington et al. 2008; Lam, Ardington et al. 2008). While the nature of the schooling data available in the study made it an extremely strong candidate for use in this dissertation, the decision against using it was finally made largely on the basis of its focus on the Cape Town area, which is well known for having a far stronger educational system than that found anywhere else in South Africa (Fiske and Ladd 2004)15. Additionally, the Cape Town area differs substantially from the rest of the country in terms of its population and its socio-economic conditions. For this reason, it was felt that while the data from the study would certainly enable a clear understanding of learner mobility in the Western Cape, this understanding would be unlikely to travel well across the rest of South Africa. 15 Although the 2010 Matric examination results, in which Gauteng province outperformed the Western Cape, may signal a shift in this pattern. 330 An additional concern related to the study‘s focus on data collection on youth older than 14, as this would make it difficult to explore learner mobility during primary schooling, and during the transition to secondary schooling. A.2 Kwa-Zulu Natal Income Dynamics Survey (KIDS) The KIDS study is a longitudinal dataset, with a focus primarily on poverty and inequality in the KwaZulu Natal (KZN) region of South Africa. It developed out of a cross-sectional study, the 1993 Project for Statistics on Living Standard and Development (PSLSD), and the PSLSD data collected in KZN in 1993, covering 1558 households, forms the first wave of KIDS. Households interviewed for PSLSD were followed up, and re-interviewed in 1998, forming the second wave of KIDS data. A 3rd round of interviews was conducted in 2004, with 865 households (including households previously interviewed, and next-generation households which had split off from households previously interviewed). During each interview wave, a detailed household roster was completed, which in 1998 and 2004, along with socio- economic information included current school enrolment of all school-aged children. For each household, address information and GIS coordinates were also collected (May, Carter et al. 1999; May, Aguero et al. 2007). While the KIDS dataset did include all the information essential for this dissertation, there were a number of ways in which the sample was not ideal. Firstly, while the study itself is longitudinal in nature, this was focused at the household level, rather than at the level of particular children or other individuals. Therefore, while longitudinal data is available on some children, this is not true of all children across the sample, and depends heavily on their mobility and relationship to the household head. This would have made identifying a sample appropriate to longitudinal analysis very challenging. The relatively high levels of attrition experienced by KIDS (around 38% over 11 years), while unsurprising given the long intervals between the waves of data collection and the longitudinal nature of the study, also raise concerns about 331 how representative the remaining sample is, particularly when considering that the relevant data would only be available for a sub-sample of children, unlikely to be randomly distributed amongst households. Nonetheless, I had initially hoped to make some use of the KIDS data, to provide some insight into the variations in learner mobility between urban and rural areas, as it appeared to be the best potential source of data on the enrolment of rural learners in South Africa. However, preliminary communication with the KIDS research team revealed that a substantial amount of preparatory work on the KIDS school enrolment data would be required for it to become useable for this dissertation, which, in light of time and resource constraints, resulted in a final decision not to use this data for the current project. A.3 National Income Dynamics Survey (NIDS) NIDS was developed to answer very similar types of questions to those addressed in KIDS, but at a national level. Areas of primary focus included household wealth creation, demographic dynamics, social heritage, and access to services and cash transfers. During its first wave of data collection NIDS collected data on 7305 households, consisting of 28255 individuals, distributed across all nine provinces of South Africa (Leibbrandt, Woolard et al. 2009). While much of the essential data on residential addresses and schooling was collected, the same limitations detailed in the discussion of KIDS, relating to the focus on households rather than individual children or youth, continue to hold. In addition, ethical protections on the NIDS data appear to prohibit the release of residential address data to researchers altogether. However, the main reason that it was not possible to seriously consider making use of NIDS data for this dissertation was that formal data collection only began in 2008, and data release only occurred from mid 2009. Waiting for this data would have therefore substantially delayed this project. In addition, even the data available 332 in 2009 would only have been cross-sectional, preventing any longitudinal analysis. A.4 Agincourt Health and Demographic Surveillance System (Agincourt) As suggested by the name, the Agincourt data is concerned primarily with health and demographics. Data is collected on the full population of around 82000 individuals living in the Agincourt sub-district of Bushbuckridge, South Africa. A baseline census of the area was conducted in 1992 (Kahn, Tollman et al. 2007). This data has subsequently been updated 12 times, most recently in 2008. Unfortunately, this dataset did not prove to be suitable for this project, as the education data collected was extremely limited. School enrolment data was collected at 4 points during the study, but was not in a form that made it feasible to use for this study, particularly given the large sample size. A.5 Africa Centre for Health and Population Studies The Africa Centre is located in rural KwaZulu Natal, South Africa, and serves as the base for a number of research project, including the longitudinal collection of demographic and health data through the Africa Centre Demographic Information System (ACDIS). ACDIS data collection began in 2000, and is ongoing, with data collection each year. ACIDS covers about 90 000 individuals in approximately 11 000 households, including those household members who are not resident in the area. Unfortunately, the educational data collected in this study is limited to attainment, and details regarding the school enrolment of individuals are not available (Herbst, Newell et al. 2010). For this reason it was not possible to pursue the use of this data set for this dissertation. 333 Appendix B: Letter confirming approval of ethics clearance for thesis, received from the School of Education, University of the Witwatersrand 334 Wits School of Education STUDENT NUMBER: 386849 Protocol: 2009ECE01 7-May-09 Ms. Julia de Kadt 86 Glenroy Road MANOR GARDENS DURBAN 4001 Dear Ms. de Kadt Application for Ethics Clearance: Doctor of Philosophy I have pleasure of advising you that the Ethics Committee in Education of the Faculty of Humanities, acting on behalf of the Senate has agreed to approve your application for ethics clearance submitted for your proposal entitled: Learner Migration in South Africa: Dimensions and implications for Educational Practice and Policy. 27 St Andrews Road, Parktown, Johannesburg, 2193 • Private Bag 3, Wits 2050, South Africa Tel: +27 11 717-3007 • Fax: +27 11 717-3009 • E-mail: enquiries@educ.wits.ac.za • Website: www.wits.ac.za 335 Recommendation: Ethics clearance is granted Yours sincerely Matsie Mabeta Wits School of Education Cc Supervisor: Prof. B Fleisch (via email) 336 Appendix C: Relationships within the study sample between variables hypothesized to act as determinants of learner mobility C.1 Race and other variables Given the small numbers of white and Indian children in the study sample, all further discussion of race is limited to the black and coloured groups only. There is no evidence for a relationship between race and gender. Similarly, there is no evidence for a difference between age at first enrolment for black and Coloured children, with slightly over half the members of each group enrolling on time. Coloured children are slightly more likely to have reached secondary school by 2003 than black children (χ2(1)= 3.4241, Pr=0.064), and are somewhat less likely to have repeated any grades (χ2(1)= 4.0985, Pr=0.043). Race is also related to the various household and family variables considered. Maternal education is slightly higher for black children than coloured children (Wilcoxon rank-sum test, Pr=0.0781). Black mothers were also less likely to be married (χ2(1)= 44.4026, Pr=0.000) than coloured mothers. Wilcoxon rank- sum tests indicate that ethnicity and household SES, for both 1997 and 2003, are significantly related (Pr=0.0000), with coloured children having lower household SES than black children. By contrast, however, black children live in SAL, SP and MP areas with higher poverty levels than coloured children (Wilcoxon rank-sum tests, Pr=0.0000). 337 C.2 Gender and other variables Gender of the child is significantly related to age at first enrolment (χ2(1)= 5.1999, Pr=0.023) and grade repetition (χ2(1)=36.0374, Pr=0.000), with girls more likely to start early or on time, and only about half as likely as boys to repeat a grade. Given that girls tend to start their schooling earlier, and are less likely to repeat grades, it comes as no surprise that they are also significantly more likely to have reached high school by 2003 (χ2(1)=23.3055, Pr=0.000). There is no evidence for any relationship between child gender and maternal education, household SES in 1997 or 2003, or the poverty level of the area in which the child lives. Although a weakly significant relationship between child gender and maternal marital status is found (χ2(1)=3.5817, Pr=0.058), this seems likely to be spurious. C.3 Age at first enrolment and other variables Predictably, there is a strong relationship between age at first enrolment and phase of schooling in 2003 (χ2(1)=503.7355, Pr=0.000), with children who started school late being unlikely to have reached high school at this point. There is, however, no evidence of a relationship between age at first enrolment and grade repetition. There is also no evidence that age at first enrolment is related to maternal education, maternal marital status, or household SES in 1997. By contrast, there is a significant relationship between age at first enrolment and household SES in 2003 (χ2(4)=15.8754 Pr = 0.003), with more advantaged children tending to enrol later. Finally, there are significant relationships between age at first enrolment and area poverty at the SAL (χ2(4)=8.3949, Pr=0.078) and SP (χ 2 (4)=9.8775, Pr=0.043) levels, with children living in wealthier areas being more likely to start school late. There was no relationship between MP level poverty and age at first enrolment. 338 C.4 Phase of education in 2003 and other variables Predictably, children who repeated a grade between 1997 and 2003 are significantly less likely to have reached high school by 2003 (χ2(1)= 313.9621, Pr=0.000). Children of more highly educated mothers are also more likely to have reached high school by 2003 (χ2(1)=32.9057, Pr=0.000). There is no evidence of a relationship between maternal marital status and the child‘s phase of education by 2003. There is a weakly significant relationship between phase of education in 2003 and household SES for 1997 (χ2(4)=8.9023, Pr=0.064), but when 2003 household SES is used, this effect disappears. There is no association between phase of education in 2003 and the poverty of the area, SAL, SP or MP, in which a child lives. C.5 Grade repetition and other variables Children with less highly educated mothers are more likely to have repeated a grade (χ2(4)=45.6920, Pr=0.000). Children whose mothers were unmarried at their birth are also more likely to have repeated a grade (χ2(1)=3.7110, Pr = 0.054). Repetition is also significantly related to household SES, both in 1997 (χ2(4)=50.9931, Pr=0.000) and 2003 (χ 2 (4)=45.7106, Pr=0.000), with children in more affluent households being less likely to have repeated a grade. Finally, there is also a positive relationship between repetition and the poverty level of the area in which a child lives, whether this is calculated at the SAL (χ2(4)=33.9833, Pr=0.000), SP (χ 2 (4)=28.7967, Pr=0.000) or MP (χ 2 (2)=17.6835, Pr=0.000) level. However, these relationships are not strictly linear throughout all poverty quintiles, and are most marked at the extremes. C.6 Maternal education and other variables A positive, significant, but non-linear relationship exists between maternal education and maternal marital status, both measured at the time of the child‘s 339 birth (χ2(4)= 16.3190, Pr = 0.003). The proportion of married mothers is highest amongst mother with post-school education, followed by those with primary schooling or less, while rates are lowest amongst mothers with partial or complete secondary schooling. There is also a significant, positive relationship between maternal education and household SES, both for 1997 (Kruskal- Wallis test, Pr=0.0001) and for 2003 (Kruskal-Wallis test, Pr=0.0001). Maternal education is also significantly related to area poverty, measured at the SAL (χ2(16)=74.1141, Pr=0.000 ), SP (χ 2 (16)=57.4465, Pr=0.000 ), and MP (χ2(8)=21.5601, Pr=0.006 ) levels, with more educated mothers tending to live in more advantaged areas. C.7 Maternal marital status and other variables Maternal marital status is significantly related to household SES in both 1997 (χ2(4)=130.5849, Pr=0.000) and 2003 (χ2(4)=54.9065, Pr=0.000), with mothers in more affluent households more likely to be married. Similarly, mothers living in more advantaged areas are also significantly more likely to be married, whether area poverty is measured at the SAL (χ2(4)=121.2915, Pr=0.000), SP (χ2(4)=134.1958, Pr=0.000) or MP (χ 2 (2)=55.5876, Pr=0.000) level. C.8 Household SES and residential area poverty levels Household SES in 1997 and 2003 are strongly related, with a correlation of 0.7655 (Pr=0.000). Household SES, measured in both 1997 and 2003, is also strongly and inversely related to the poverty area in which the child lives (see Table A3.1 below), with household SES tending to be higher in the areas with the lowest poverty levels. 340 Relationship between household SES and residential area poverty SAL poverty level SP poverty level MP poverty level Household SES, 1997 χ2(16)=381.0466 Pr=0.000 χ2(16)=298.4818 Pr=0.000 χ2(8)=164.0330 Pr=0.000 Household SES, 2003 χ2(16)=229.4532 Pr=0.000 χ2(16)=230.2197 Pr=0.000 χ2(8)=68.0648 Pr=0.000 Table A.1: Significance of relationships between household SES and residential area poverty levels C.9 Conclusion All relationships documented here appear to operate in the expected direction. There is evidence that a number of the variables considered are strongly related to each other, as expected. This implies that during the modelling component of the thesis, attention must be paid to avoiding multicollinearity. 341 References Ahmed, R. and Y. Sayed (2009). "Promoting access and enhancing education opportunities? The case of ‗no-fees schools‘ in South Africa." Compare: A Journal of Comparative and International Education 39(2): 203 - 218. Alderman, H. and E. M. King (1998). "Gender differences in parental investment in education." Structural Change and Economic Dynamics 9(4): 453-468. Andrabi, T., J. Das, et al. (2009). What Did You Do All Day? Maternal Education and Child Outcomes. Astin, A. W. (1992). "Educational "Choice": Its Appeal May be Illusory." Sociology of Education 65(4): 255-260. Bagley, C. (1996). "Black and White Unite or Flight? The Racialised Dimension of Schooling and Parental Choice." British Educational Research Journal 22(5): 569-580. Barbarin, O. A. and L. M. Richter (2001). Mandela's children: Growing up in post-Apartheid South Africa. New York, Routledge. Bifulco, R. and H. Ladd (2006). School choice, racial segregation and test- score gaps: evidence from North Carolina's charter school programme. . Annual Meeting of Allied Social Science Associations, Boston, MA. Bosetti, L. (2004). "Determinants of school choice: understanding how parents choose elementary schools in Alberta." Journal of Education Policy 19(4): 387 - 405. Bradley, S. and J. Taylor (2002). "The effect of the quasi-market on the efficiency-equity trade-off in the secondary school sector." Bulletin of Economic Research 54(5): 295-314. Bray, R., I. Gooskens, et al. (2010). Growing up in the new South Africa: Childhood and adolescence in post-apartheid Cape Town. Cape Town, South Africa, HSRC Press. Bridge, R. and J. Blackman (1978). A study of alternatives in American education: Vol IV: Family choice in schooling. Santa Monica, CA RAND Corporation. Bryman, A. (2004). Social research methods. Oxford, United Kingdom, Oxford University Press. Capell, F. (1981). A study of alternatives in American education: Vol VI: Student outcomes at Alum Rock, 1974-1976. Santa Monica, CA RAND Corporation. Carnoy, M. (1999). Globalizaton and educational reform: what planners need to know. Paris, France UNESCO. Carnoy, M. and P. McEwan (2003). Does privatization improve education? The case of Chile's national voucher plan. Choosing choice: School choice in international perspective. D. Plank and G. Sykes. New York, NY, Teachers College Press, Columbia University: 24-44. 342 Centre for Development and Enterprise (2010). Hidden assets: South Africa's low-fee private schools. Johannesburg, South Africa. Chen, X., P. Ender, et al. (2011). Stata web books: Logistic regression with Stata, UCLA Academic Technology Services. Chisholm, L. (2004). Changing Class: Education and Social Change in post- Apartheid South Africa. Cape Town HSRC Press. Chisholm, L. (2005). The state of South Africa's schools. State of the nation: South Africa 2004-2005. J. Daniel, R. Southall and J. Lutchman. Cape Town, HSRC Pres: 201-226. Coleman, J. S. (1992). "Some Points on Choice in Education." Sociology of Education 65(4): 260-262. Cosser, M. and J. du Toit (2002). From school to higher education? Factors affecting the choices of Grade 12 learners. Cape Town, HSRC Publishers Daun, H. (2003). Market forces and decentralization in Sweden: Impetus for school development or threat to comprehensiveness and equity? Choosing choice: School choice in international perspective. D. Plank and G. Sykes. New York, NY, Teachers College Press, Columbia University: 92-111. De Jong, G., F. (2000). "Expectations, Gender, and Norms in Migration Decision-Making." Population Studies 54(3): 307-319. Denessen, E., G. Driessena, et al. (2005). "Segregation by choice? A study of group-specific reasons for school choice." Journal of Education Policy 20 (3): 347-368. Department of Education (2000). Brochure for the 2000 School Register of Needs Report. du Toit, J. (2003). Independent Schooling. Human Resources Development Review 2003: Education, employment and skills in South Africa. A. Kraak and H. Perold. Cape Town, HSRC Press: 380-395. Elacqua, G. (2006). Enrollment practices in response to vouchers: evidence from Chile. Elacqua, G., M. Schneider, et al. (2006). "School choice in Chile: Is it class or the classroom?" Journal of Policy Analysis and Management 25 (3): 577-601. Fedderke, J. W., R. de Kadt, et al. (2000). "Uneducating South Africa: The failure to address the 1910-1993 legacy." International Review of Education 46(3): 257-281. Filer, R. and D. Munich (2003). Public support for private schools in Post- Communist Central Europe. Choosing choice: School choice in international perspective. D. Plank and G. Sykes. New York, NY, Teachers Colllege Press, Columbia University: 196-222. Filmer, D. and L. Pritchett (2001). "Estimating wealth effects without expenditure data—or tears: an application to educational enrollments in states of India." Demography 38(1): 115-132. Fiske, E. and H. Ladd (2000). When schools compete: a cautionary tale. Washington DC, Brookings Institution Press 343 Fiske, E. and H. Ladd (2004). Elusive equity : education reform in post- apartheid South Africa. Washington D.C. , Brookings Institute. Fiske, E. B. and H. F. Ladd (2005). Racial equality in education: how far has South Africa come? Terry Sanford Institute of Public Policy Working Paper series, Duke University. Fleisch, B. (2008). Primary Education in Crisis: Why South African Schoolchildren Underachieve. Cape Town, South Africa, Juta. Fleisch, B. and J. Schindler (2008). Gender repetition: School access, transitions and equity in the 'Birth-to-Twenty' cohort panel study in urban South Africa. Fleisch, B. and J. Schindler (2009). Patterns and prevalence of school access, transitions and equity in South Africa: Secondary analyses of BT20 large-scale data sources. CREATE Pathways to Access Research Monographs. Brighton, United Kingdom, Consortium for Research on Educational Access, Transitions and Equity. Garcia, D. R. (2008). "Academic and Racial Segregation in Charter Schools: Do Parents Sort Students Into Specialized Charter Schools?" Education and Urban Society 40(5): 590-612. Ginsburg, C., S. Norris, et al. (2009). "Patterns of residential mobility amongst children in greater Johannesburg-Soweto, South Africa: observations from the Birth to Twenty cohort." Urban Forum 20 397-413. Ginsburg, C., L. M. Richter, et al. (2010). "An analysis of associations between residential and school mobility and educational outcomes in South African urban children: The Birth to Twenty cohort." International Journal of Educational Development In Press. Glazerman, S. M. (1998). School Quality and Social Stratification: The Determinants and Consequences of Parental School Choice. Annual Meeting of the American Educational Research Association. San Diego, CA,: April 13-17, 1998. Godwin, K., F. Kemerer, et al. (1998). "Liberal Equity in Education: A Comparison of Choice Options." Social Science Quarterly (University of Texas Press) 79(3): 502-522. Godwin, K., S. Leland, et al. (2006). "Sinking Swann : public school choice and the resegregation of Charlotte's public schools." Review of Policy Research 23 (5): 983-997. Goldhaber, D. (2000). "School Choice: Do We Know Enough?" Educational Researcher 29(8): 21-22. Goldhaber, D. and E. Eide (2002). "What do we know (and need to know) about the impact of school choice reforms on disadvantaged students?" Harvard Educational Review 72(2): 157-176. Goldhaber, D. D. (1999). "School Choice: An Examination of the Empirical Evidence on Achievement, Parental Decision Making, and Equity." Educational Researcher 28(9): 16-25. Goldring, E. B. and C. S. Hausman (1999). "Reasons for parental choice of urban schools." Journal of Education Policy 14: 469-490. 344 Gorard, S. and J. Fitz (2000). "Investigating the determinants of segregation between schools." Research Papers in Education 15(2): 115-132. Gorard, S. and J. Fitz (2006). "What counts as evidence in the school choice debate?" British Educational Research Journal 32(6): 797-816. Gorard, S., J. Fitz, et al. (2001). "School Choice Impacts: What Do We Know?" Educational Researcher 30(7): 18-23. Greenberg, J. P. (2011). "The impact of maternal education on children's enrollment in early childhood education and care." Children and Youth Services Review In Press, Corrected Proof. Greene, J., T. Loveless, et al. (2010). Expanding choice in elementary and secondary education: A report on rethinking the federal role in education, Brown Center on Education Policy at Brookings. Hanson, K. and L. Litten (1982). Mapping the road to academia: A review of research on women, men, and the college selection process. The undergraduate woman: Issues in Educational Equity. P. Perun. Lexington, Lexington Books. Henig, J. (1994). Rethinking school choice: Limits of the market metaphor. Princeton, NJ Princeton University Press. Herbst, K., M. L. Newell, et al. (2010). "Study summary: ACDIS." Hofmeyr, J. and S. Lee (2004). The new face of private schooling. Changing Class. L. Chisholm. Cape Town, HSRC Press: 143-174. Holmes, J. (2002). "Buying homes, buying schools: school choice and the social construction of school quality." Harvard Educational Review 72 (2): 177-205. Howe, L. D., J. R. Hargreaves, et al. (2008). "Issues in the construction of wealth indices for the measurement of socio-economic position in low- income countries." Emerging Themes in Epidemiology 5(3). Hoxby, C. M. (1998). "What Do America's 'Traditional' Forms of School Choice Teach Us About School Choice Reforms?" Economic Policy Review, Vol.4, No.1, March 1998. Hoxby, C. M. (2002). "School Choice and School Productivity (or Could School Choice be a Tide that Lifts All Boats?)." SSRN eLibrary. Hoxby, C. M. (2003). The economics of school choice. Chicago, The University of Chicago Press. Hoxby, C. M. (2003). Introduction. The economics of school choice. C. M. Hoxby. Chicago, The University of Chicago Press: 1-22. Hunter, M. (2010). Racial desegregation and schooling in South Africa: contested geographies of class formation. Johnson, D. (2007). "Building citizenship in fragmented societies: The challenges of deracialising and integrating schools in post-Apartheid South Africa." International Journal of Educational Development 27(3): 306-317. Kahn, K., S. M. Tollman, et al. (2007). "Research into health, population and social transitions in rural South Africa: Data and methods of the Agincourt Health and Demographic Surveillance System." Scandinavian Journal of Public Health 35(3 supp 69): 8-20. 345 Kanjee, A. (2007). Improving learner achievement in schools: applications of national assessments in South Africa. State of the Nation: South Africa 2007. S. Buhlungu, J. Daniel, R. Southall and J. Lutchman. Cape Town, HSRC Press: 470-499. Kanjee, A. and A. Chudgar (2009). Accuracy of the poverty quintile system for classifying South African schools. 2nd Monitoring and Evaluation Colloquim, Gauteng Department of Education. Sandton, Johannesburg. Kanjee, A. and A. Chudgar (2009). "School money: funding the flaws." HSRC Review 7(4). Karlsson, J. (2007). Mobility and equity: school transport, cost of schooling and class formation in post-apartheid South Africa. Dilemmas of implementing education reforms: Explorations from South Africa and Sweden. C. Odora Hoppers, U. P. Lundgren, J. Pampallis, E. Motala and E. Nihlfors. Uppsala, Sweden, STEP, Uppsala Universitet. Karsten, S., C. Felix, et al. (2006). "Choosing Segregation or Integration?: The Extent and Effects of Ethnic Segregation in Dutch Cities." Education and Urban Society 38(2): 228-247. Klasen, S. (2002). "Low Schooling for Girls, Slower Growth for All? Cross‐Country Evidence on the Effect of Gender Inequality in Education on Economic Development." The World Bank Economic Review 16(3): 345-373. Kolenikov, S. and G. Angeles (2008). Socioeconomic status measurement with discrete proxy variables: is principal component analysis a reliable answer? Kristen, C. (2005). School choice and ethnic school segregation: Primary school selection in Germany. Munster, Waxmann Verlag GmbH. Ladd, H. (2003). Introduction. Choosing choice: School choice in international perspective. D. Plank and G. Sykes. New York, NY, Teachers College Press: Columbia University: 1-23. Lam, D., C. Ardington, et al. (2008). The Cape Area Panel Study: a very short introduction to the integrated waves 1-2-3-4 data, The University of Cape Town. Lam, D., C. Ardington, et al. (2008). The Cape Area Panel Study: Overview and Technical Documentation, Waves 1-2-3-4 (2002-2006). University of Cape Town. Lam, D., C. Ardington, et al. (2008). Schooling as a lottery: racial differences in school advancement in urban South Africa. Southern Africa Labour and Development Research Unit Working Paper, SALDRU, University of Cape Town. Le, A. T. and P. W. Miller (2003). "Choice of School in Australia: Determinants and Consequences." Australian Economic Review 36(1): 55-78. Leibbrandt, M., I. Woolard, et al. (2009). Methodology: Report on NIDS wave 1, Technical paper no. 1. University of Cape Town, National Income Dynamics Study. 346 Lemon, A. (2005). "Shifting geographies of social inclusion and exclusion: Secondary education in Pietermaritzburg, South Africa." African Affairs 104: 69-96. Levin, H. M. (1991). "The economics of educational choice." Economics of Education Review 10(2): 137-158. Levin, H. M. (1998). "Educational Vouchers: Effectiveness, Choice, and Costs." Journal of Policy Analysis and Management 17(3): 373-392. Lombard, B. (2007). "Reasons why educator-parents based at township schools transfer their own children from township schools to former Model C schools." Education as Change 11(1): 43 - 57. Lubienski, C., P. Weitzel, et al. (2009). "Is There a "Consensus" on School Choice and Achievement?: Advocacy Research and the Emerging Political Economy of Knowledge Production." Educational Policy 23(1): 161-193. Magnuson, K. (2007). "Maternal education and children's academic achievement during middle childhood." Dev Psychol 43(6): 1497-1512. Maile, S. (2004). "School Choice in South Africa." Education and Urban Society 37(1): 94-116. Martin, P. (2010). Government-funded programmes and services for vulnerable children in South Africa. Cape Town, South Africa. May, J. D., J. Aguero, et al. (2007). "The KwaZulu-Natal Income Dynamics Study (KIDS) Third Wave: Methods, First Findings and an Agenda for Future Research." Development Southern Africa 24(5): 629-648. May, J. D., M. Carter, et al. (1999). KwaZulu-Natal Income Dynamics Study (KIDS) 1993-1998. Working Paper No. 21. McMillan, J. and S. Schumacher (2005). Research in education: evidence based enquiry, Allyn & Bacon. Motala, S. (1995). "Surviving the System - a critical appraisal of some conventional wisdoms in primary education in South Africa." Comparative Education 31(2): 161 - 180. Motala, S. (2009). "Privatising public schooling in post-apartheid South Africa – equity considerations." Compare: A Journal of Comparative and International Education 39(2): 185 - 202. Motala, S., V. Dieltiens, et al. (2009). "Physical access to schooling in South Africa: mapping dropout, repetition and age-grade progression in two districts." Comparative Education 45(2): 251-263. Msila, V. (2005). "The education exodus: the flight from township schools." Africa Education Review 2(2): 173-188. Msila, V. (2009). "School choice and intra-township migration: black parents scrambling for quality education in South Africa." Journal of Education 46: 81-98. Narodowski, M. and M. Nores (2002). "Socio-economic segregation with (without) competitive education policies. A comparative analysis of Argentina and Chile." Comparative Education 38 (4): 429-451. 347 Nelson Mandela Children's Fund (2005). Emerging Voices: A report on Education in South African rural communities. Cape Town, HSRC Press. Nkomo, M., C. McKinney, et al. (2004). Reflections on school integration: colloquium proceedings. Cape Town, HSRC Press Noden, P. (2000). "Rediscovering the Impact of Marketisation: Dimensions of Social Segregation in England's Secondary Schools, 1994-99." British Journal of Sociology of Education 21(3): 371-390. Norris, S. A., L. M. Richter, et al. (2007). "Field Report: Panel studies in developing countries: case analysis of sample attrition over the past 16 years within the Birth to Twenty cohort in Johoannesburg, South Africa." Journal of International Development 19 1143-1150. Osborne, C. and S. McLanahan (2007). "Partnership Instability and Child Well-Being." Journal of Marriage and Family 69: 1065-1083. Pampallis, J. (2003). Education reform and school choice in South Africa. Choosing choice: School choice in international perspective. D. Plank and G. Sykes. New York, NY, Teachers College Press, Columbia University: 143-163. Pampallis, J. (2008). School fees. Issues in Education Policy. Johannesburg, Centre for Education Policy Development. Paterson, A. and G. Kruss (1998). "Educational migration and its effect on access to schooling in South Africa." South African Journal of Education 18 (3): 149-155. Pendlebury, S. and N. Rudolph (2008). Children's access to education. South African Child Gauge 2007/2008. P. Proudlock, M. Dutschke, L. Jamieson, J. Monson and C. Smith. Cape Town, Children's Institute, University of Cape Town: 74 -77. Peterson, P., W. Howell, et al. (2003). School vouchers: results from randomized experiments. The economics of school choice. C. M. Hoxby. Chicago, The University of Chicago Press: 107-144. Plank, D. and G. Sykes (2003). Choosing choice: School choice in international perspective. New York, NY, Teachers College Press, Columbia University. Reddy, V. (2006). Mathematics and science achievement at South African schools in TIMMS 2003. Cape Town, HSRC Press. Reschovsky, A. (2006). "Financing Schools in the New South Africa." Comparative Education Review 50(1): 21-45. Richter, L., S. Norris, et al. (2004). "Transition from Birth to Ten to Birth to Twenty: the South African cohort reaches 13 years of age." Paediatric and Perinatal Epidemiology 18 290-301. Richter, L., S. Norris, et al. (2007). "Cohort Profile: Mandela's children: The 1990 birth to twenty study in South Africa." International Journal of Epidemiology: dym016. Richter, L., S. Norris, et al. (2006). "In-migration and living conditions of young adolescents in greater Johannesburg, South Africa " Social Dynamics 32 (1): 195-216. 348 Richter, L., S. Panday, et al. (2009). "Factors influencing enrolment: A case study from Birth to Twenty, the 1990 birth cohort in Soweto- Johannesburg." Evaluation and Program Planning 32 (3 ): 197-203. Richter, L., S. Panday, et al. (2009). "Adolescents in the city: material and social living conditions in Johannesburg-Soweto, South Africa." Urban Forum 20 319-334. Rinne, R., J. Kivirauma, et al. (2002). "Shoots of revisionist education policy or just slow readjustment? The Finnish case of educational reconstruction." Journal of Education Policy 17(6): 643-658. Rutstein, S. O. and K. Johnson (2004). The DHS wealth Index. DHS Comparative Reports. Calverton, MD, Measure DHS+. Saporito, S. (2003). "Private Choices, Public Consequences: Magnet School Choice and Segregation by Race and Poverty." Social Problems 50(2): 181-203. Schneider, M. and J. Buckley (2002). "What Do Parents Want From Schools? Evidence From the Internet." Educational Evaluation and Policy Analysis 24(2): 133-144. Schneider, M., M. Marschall, et al. (1998). "School Choice and Culture Wars in the Classroom: What Different Parents Seek from Education." Social Science Quarterly (University of Texas Press) 79(3): 489-501. Scott, D. W. (1992). Multivariate density estimation: theory, practice, and visualization. New York, New York, John Wiley. Sekete, P., M. Shilubane, et al. (2001). Deracialisation & Migration of learners in South African schools: Challenges and Implications. Pretoria HSRC Press. Sinnott, R. W. (1984). "Virtues of the Haversine." Sky and Telescope 68(2): 159. Smith, E. (2006). Using secondary data in educational and social research. New York, NY McGraw Hill/Open University Press. Soderstrom, M. and R. Uusitalo (2005). School choice and segregation: evidence from an admission reform. Uppsala, Sweden, Institute for Labour Market Policy Evaluation. 7 Soudien, C. (2003). Inclusion and exclusion in some South African schools: Preliminary findings. Soudien, C., N. Carrim, et al. (2004). School inclusion and exclusion in South Africa: some theoretical and methodological considerations. Reflections on school integration:. M. Nkomo, C. McKinney and L. Chisholm. Cape Town, HSRC Press: 19-42. South African Human Rights Commission (2004). The right to education, South African Human Rights Commission. South African Human Rights Commission (2006). Report of the public hearing on the right to basic education, South African Human Rights Commission Spaull, N. (2011). A preliminary analysis of SACMEQ III South Africa. Stellenbosch Economic Working Papers. Stellenbosch, University of Stellenbosch. 349 Sujee, M. (2004). Deracialisation of Gauteng schools - a quantitative analysis. Reflections on school integration: Colloquium proceedings. M. Nkomo, C. McKinney and L. Chisholm. Cape Town, South Africa, HSRC Publishers. Teske, P. and M. Schneider (2001). "What Research Can Tell Policymakers about School Choice." Journal of Policy Analysis and Management 20(4): 609-631. Tsang, M. (2003). School choice in the People's Republic of China. Choosing choice: School choice in international perspective. D. Plank and G. Sykes. New York, NY, Teachers College Press, Columbia University: 164-195. Unterhalter, E. (2005). Gender equality and education in South Africa: Measurements, scores and strategies. Gender equity in South African education 1994-2004: Perspectives from research, government and unions. L. Chisholm and J. September. Cape Town, South Africa, HSRC Press. van der Berg, S., C. Burger, et al. (2011). Low quality education as a poverty trap. Stellenbosch, Department of Economics, Stellenbosch University. van der Berg, S., L. Wood, et al. (2002). "Differentiation in black education." Development Southern Africa 19: 289-306. Veriava, F. (2005). Free to learn: A discussion paper on the School Fee Exemption policy. Cape Town, South Africa, Children's Institute, University of Cape Town. Viteritti, J. (2005). School choice: how an abstract idea became a political reality, Brookings Institute. Vyas, S. and L. Kumaranayake (2006). "Constructing socio-economic status indices: how to use principal components analysis." Health Policy and Planning 21(6): 459-468. Waslander, S. and M. Thrupp (1995). "Choice, competition and segregation: an empirical analysis of a New Zealand secondary school market, 1990-93." Journal of Education Policy 10(1): 1-26. Weiher, G. and K. Tedin (2002). "Does choice lead to racially distinctive schools? Charter schools and household preferences." Journal of Policy Analysis and Management 21(1): 79-92. West, A. and A. Hind (2007). "School Choice in London, England: Characteristics of Students in Different Types of Secondary Schools." Peabody Journal of Education 82(2): 498 - 529. Witte, J. F. and C. A. Thorn (1996). "Who Chooses? Voucher and Interdistrict Choice Programs in Milwaukee." American Journal of Education 104(3): 186-217. Woolman, S. and B. Fleisch (2006). "South Africa's unintended experiment in school choice: how the National Education Policy Act, the South Africa Schools Act and the Employment of Educators Act create the enabling conditions for quasi-markets in schools." Education and the Law 18: 31-75. 350 Yach, D., N. Cameron, et al. (1991). "Birth to Ten: Child health in South Africa in the nineties: rationale and methods of a birth cohort study." Paediatric and Perinatal Epidemiology 5: 211-233. Yamauchi, F. (2004). Race, equity and public schools in post-Apartheid South Africa: Is opportunity equal for all kids? IFPRI Food Consumption and Nutrition Division Working Paper Series. Zietz, J. and P. Joshi (2005). "Academic choice behavior of high school students: economic rationale and empirical evidence." Economics of Education Review 24(3): 297-308.