You are here
Topic Areas
Teacher quality
Full Abstract
The onset of the pandemic in spring 2020 substantially disrupted routes into teaching and offered a unique opportunity to study this process with different requirements for initial entry into the classroom. We examine the impacts of the Temporary Certificate of Eligibility (Temporary CE), which allowed teacher candidates in New Jersey to enter the workforce before completing assessment and performance requirements. Relative to the novice teacher workforce before the pandemic, Temporary CE teachers were substantially more diverse without any significant effects on teacher performance or student test scores. However, Temporary CE holders were less likely to remain in the same school or in the New Jersey teaching workforce between 2020–21 and 2021–22. Although Temporary CE holders disproportionately entered through alternate routes into teaching, these patterns hold for both traditional- and alternate-route entrants.
Citation: Benjamin Backes, Dan Goldhaber (2023). The Relationship Between Pandemic-Era Teacher Licensure Waivers and Teacher Demographics, Retention, and Effectiveness in New Jersey. CALDER Working Paper No. 286-0623
Full Abstract
This paper examines how different measures of teacher quality are related to students’ long-run educational trajectories. We estimate teachers’ test-based and nontest value-added (the latter based on contributions to student absences, suspensions, grade progression, and grades) and assess how these predict various student postsecondary outcomes. We find that both types of value-added have positive effects on student outcomes. Test-based teacher quality measures have more explanatory power for outcomes relevant for students at the top of the achievement distribution, such as attending a more selective college, while nontest measures have more explanatory power for whether students enroll in college at all.
This is an updated version of the paper originally titled "Teachers and Students’ Postsecondary Outcomes: Testing the Predictive Power of Test and Nontest Teacher Quality Measures" (WP 270-1022), released in October 2022.
Citation: Benjamin Backes, James Cowan, Dan Goldhaber, Roddy Theobald (2023). How to Measure a Teacher: The Influence of Test and Nontest Value-Added on Long-Run Student Outcomes. CALDER Working Paper No. 270-0423-2
Full Abstract
We use publicly available, longitudinal data from Washington state to study the extent to which three interrelated processes—teacher attrition from the state teaching workforce, teacher mobility between teaching positions, and teacher hiring for open positions—contribute to “teacher quality gaps” (TQGs) between students of color and other students in K–12 public schools. Specifically, we develop and implement an agent-based model simulation of decisions about attrition, mobility, and hiring to assess the extent to which each process contributes to observed TQGs. We find that eliminating inequities in teacher mobility and hiring across different schools would close TQGs within 5 years, while just eliminating inequities in teacher hiring would close gaps within 10 years. On the other hand, eliminating inequities in teacher attrition without addressing mobility and hiring does little to close gaps.
Citation: Dan Goldhaber, Matt Kasman, Vanessa Quince, Roddy Theobald, Malcolm Wolff (2023). How Did It Get This Way? Disentangling the Sources of Teacher Quality Gaps Through Agent-Based Modeling. CALDER Working Paper No. 259-0223
Full Abstract
Student-teacher relationships are at the core of student experiences in schools and, arguably, fundamental to influencing student outcomes. Using a statewide, student-level school climate survey from Massachusetts, we investigate teachers’ contributions to school climate, which we refer to as climate value added (VA), and how it varies by student race/ethnicity. We first show that climate VA contributes to student learning: Teachers whose students report positive feelings about climate also contribute more to student test scores and to an aggregate of nontest student outcomes (student absences, suspensions, and grade progression). And teachers identified by students of color as contributing to better school climate have outsize effects on learning gains for these students. Differences in teachers’ climate effects across racial/ethnic groups are largest on topics aligned with cultural competency, school participation, and comfort with faculty. Lastly, we find that Black students assigned to Black teachers report better school climate than Black students assigned to other teachers.
Citation: Benjamin Backes, James Cowan, Dan Goldhaber, Roddy Theobald (2022). Teachers and School Climate: Effects on Student Outcomes and Academic Disparities. CALDER Working Paper No. 274-1022
Full Abstract
Most research about how to improve the teacher workforce has focused on interventions designed to improve incumbent teachers, far less attention has been directed toward teacher hiring processes and whether districts can make better hiring decisions. Using data from Spokane Public Schools and Washington state, we describe the findings from a study analyzing measures of the predictive validity of teacher applicant quality measures obtained from professional references. We find that professional reference ratings of prospective teachers are significantly predictive of teacher quality as measured by inservice performance evaluations and teacher value added in math. These findings are driven by applicants with at least some teaching experience and vary by rater type (e.g., principal or university supervisor); the magnitude of the relationship between the ratings of applicants and teacher performance is much smaller and not statistically significant for applicants that do not have teaching experience. Overall, the evidence suggests that obtaining explicit ratings of teacher applicants from professional references is a low-cost way to contribute to the applicant information available to hiring officials and has potential for improving hiring outcomes.
Citation: Dan Goldhaber, Cyrus Grout, Malcolm Wolff (2022). How Well Do Professional Reference Ratings Predict Teacher Performance?. CALDER Working Paper No. 272-1022
Full Abstract
We present new estimates of the importance of teachers in early grades for later grade outcomes, but unlike the existing literature that examines teacher “fade-out,” we directly compare the contribution of early-grade teachers to later year outcomes against the contributions of later year teachers to the same later year outcomes. Where the prior literature finds that much of the contribution of early teachers fades away, we find that the contributions of early-year teachers remain important in later grades. The difference in contributions to eighth-grade outcomes between an effective and ineffective fourth-grade teacher is about half the difference among eighth-grade teachers. The effect on eighth-grade outcomes of replacing a fourth-grade teacher who is below the 5th percentile with a median teacher is about half the underrepresented minority (URM)/non-URM achievement gap. Our results reinforce earlier conclusions in the literature that teachers in all grades are important for student achievement.
Citation: Dan Goldhaber, Zeyu Jin, Richard Startz (2022). How Much Do Early Teachers Matter?. CALDER Working Paper No. 264-0422
Full Abstract
Novice teachers’ professional contexts may have important implications for their effectiveness, development, and retention. However, descriptions of these contexts suffer from data limitations, resulting in unidimensional or vague characterizations. Using 10 years of administrative data from the Los Angeles Unified School District, we describe patterns of new teacher sorting using 27 context measures organized along three distinct dimensions - intensity of instructional responsibilities, homophily, and colleague qualifications – and use school-level survey data to measure a fourth dimension (professional culture). Relative to more experienced teachers, novice teachers have placements that are more challenging along the first three dimensions, and composite measures are differentially predictive of teachers’ outcomes. This suggests that policymakers should consider placements to better retain and develop novice teachers.
This paper has been published in American Educational Research Journal and can be found here, October 2019.
Citation: Paul Bruno, Sarah Rabovsky, Katharine O. Strunk (2019). Taking their First Steps: The Distribution of New Teachers into School and Classroom Contexts and Implications for Teacher Effectiveness and Growth. CALDER Working Paper No. 212-0119-1
Full Abstract
We use longitudinal data from North Carolina and Washington to study the extent to which four processes—teacher attrition from each state workforce, teacher mobility within districts, teacher mobility across districts, and teacher hiring—contribute to “teacher quality gaps” (TQGs) between advantaged and disadvantaged schools. We first replicate prior findings documenting inequities in each of these processes using different measures of student disadvantage (race and poverty) and teacher quality (experience, licensure test scores, and value added) and then develop and implement a simulation to assess the extent to which each process contributes to observed TQGs in each state. We find that all four processes contribute to TQGs but also document considerable heterogeneity in the extent to which each process contributes to the different TQG measures. For example, patterns in teacher attrition and mobility contribute more to TQGs measured by teacher experience, while patterns in teacher hiring explain the majority of TQGs measured by teacher licensure test scores and value added.
Citation: Dan Goldhaber, Vanessa Quince, Roddy Theobald (2018). How Did It Get This Way? Disentangling the Sources of Teacher Quality Gaps Across Two States. CALDER Working Paper No. 209-1118-1
Full Abstract
We study the relative performance of two policy relevant value-added models – a one-step fixed effect model and a two-step aggregated residuals model – using a simulated dataset well grounded in the value-added literature. A key feature of our data generating process is that student achievement depends on a continuous measure of economic disadvantage. This is a realistic condition that has implications for model performance because researchers typically have access to only a noisy, binary measure of disadvantage. We find that one- and two-step value-added models perform similarly across a wide range of student and teacher sorting conditions, with the two-step model modestly outperforming the one-step model in conditions that best match observed sorting in real data. A reason for the generally superior performance of the two-step model is that it better handles the use of an error-prone, dichotomous proxy for student disadvantage.
WP 179 was revised in September 2018. It was originally released in June 2017
Citation: Eric Parsons, Cory Koedel, Li Tan (2018). Accounting for Student Disadvantage in Value-Added Models (Update). CALDER Working Paper No. 179
Full Abstract
We use statewide data from Massachusetts to investigate teacher performance evaluations as a measure of teaching effectiveness. Consistent with prior research, we find that assignment to lower achieving classrooms reduces teachers’ performance ratings. But after adjusting for these and other observable differences between classroom assignments, we show that regression-adjusted performance measures can reliably predict future evaluation ratings as teachers move across grades and subjects within the same school. However, we also document substantial unexplained variation in ratings across schools and districts in the state. In particular, districts vary substantially both in the extent to which they differentiate between teachers and in the sensitivity of performance ratings to differences in teacher effectiveness as measured by value added. As a result, even after regression adjustment, teacher evaluation ratings generally provide unreliable predictions of future teacher evaluations after teachers switch schools. These findings suggest that policymakers and researchers should use caution in using performance evaluation ratings to make comparisons between teachers in different contexts.
WP 197-0618-1 was originally released in June 2018. An updated version (WP 197-0618-2) was released in August 2020.
This paper has been published in the Journal of Research on Educational Effectivess and can be found here, March 2022.
Citation: James Cowan , Dan Goldhaber, Roddy Theobald (2018). Performance Evaluations as a Measure of Teacher Effectiveness when Standards Differ: Accounting for Variation across Classrooms, Schools, and Districts. CALDER Working Paper No. 197-0618-2
Full Abstract
Few studies examine employee responses to layoff-induced unemployment risk; none that we know of quantify the impact of job insecurity on individual employee productivity. Using data from the Los Angeles Unified School District and Washington State during the Great Recession, we provide the first evidence about the impact of the layoff process on teacher productivity. In both sites we find that teachers impacted by the layoff process are less productive than those who do not face layoff-induced job threat. LAUSD teachers who are laid off and then rehired to return to the district are less productive in the two years following the layoff. Washington teachers who are given a reduction-in-force (RIF) notice and are then not laid off have reduced effectiveness in the year of the RIF. We argue that these results are likely driven by impacts of the layoff process on teachers’ job commitment and present evidence to rule out alternate explanations.
WP 140 was revised in March 2018. It was originally released in November 2015.
Citation: Katharine O. Strunk, Dan Goldhaber, David S. Knight, Nate Brown (2018). Are There Hidden Costs Associated With Conducting Layoffs? The Impact of RIFs and Layoffs on Teacher Effectiveness. CALDER Working Paper No. 140
Full Abstract
We study how the introduction of a rigorous teacher evaluation system in a large urban school district affects the quality composition of teacher turnovers. With the implementation of the new system, we document increased turnover among the least effective teachers and decreased turnover among the most effective teachers, relative to teachers in the middle of the distribution. Our findings demonstrate that the alignment between personnel decisions and teacher effectiveness can be improved through targeted personnel policies. However, the change in the composition of exiters brought on by the policy we study is too small to meaningfully impact student achievement.
Working Paper 168-0717 was updated March 2019, it was orginally published July 2017.
Citation: Julie Berry Cullen, Cory Koedel, Eric Parsons (2017). The Compositional Effect of Rigorous Teacher Evaluation on Workforce Quality. CALDER Working Paper No. 168
Full Abstract
We investigate the relationship between teacher licensure test scores and student test achievement and high school course-taking. We focus on three subject/grade combinations—middle school math, ninth-grade algebra and geometry, and ninth-grade biology—and find evidence that a teacher’s basic skills test scores are modestly predictive of student achievement in middle and high school math and highly predictive of student achievement in high school biology. A teacher’s subject-specific licensure test scores are a consistent and statistically significant predictor of student achievement only in high school biology. Finally, we find little evidence that students assigned to middle school teachers with higher basic-skills test scores are more likely to take advanced math and science courses in high school.
Citation: Dan Goldhaber, Trevor Gratz, Roddy Theobald (2016). What’s in a Teacher Test? Assessing the Relationship between Teacher Licensure Test Scores and Student Secondary STEM Achievement. CALDER Working Paper No. 158
Full Abstract
This policy brief reviews evidence about the extent to which disadvantaged students are taught by teachers with lower value-added estimates of performance, and seeks to reconcile differences in findings from different studies. We demonstrate that much of the inequity in teacher value added in Washington state is due to differences across different districts, so studies that only investigate inequities within districts likely understate the overall inequity in the distribution of teacher effectiveness because they miss one of the primary sources of this inequity.
Citation: Dan Goldhaber, Vanessa Quince, Roddy Theobald (2016). Reconciling Different Estimates of Teacher Quality Gaps Based on Value Added. CALDER Working Paper No. 141 216
Full Abstract
There is mounting evidence of substantial “teacher quality gaps” (TQGs) between advantaged and disadvantaged students, but practically no empirical evidence about their history. We use longitudinal data on public school students, teachers, and schools from two states—North Carolina and Washington—to provide a descriptive history of the evolution of TQGs in these states. We find that TQGs exist in every year in each state and for all measures we consider of student disadvantage and teacher quality. But there is variation in the magnitudes and sources of TQGs over time, between the two states, and depending on the measure of student disadvantage and teacher quality.
Citation: Dan Goldhaber, Vanessa Quince, Roddy Theobald (2016). Has It Always Been This Way? Tracing the Evolution of Teacher Quality Gaps in U.S. Public Schools. CALDER Working Paper No. 171
Full Abstract
We use longitudinal data from Washington State to provide estimates of the extent to which performance on the edTPA, a performance-based, subject-specific assessment of teacher candidates, is predictive of the likelihood of employment in the teacher workforce and value-added measures of teacher effectiveness. While edTPA scores are highly predictive of employment in the state’s public teaching workforce, evidence on the relationship between edTPA scores and teaching effectiveness is more mixed. Specifically, continuous edTPA scores are a significant predictor of student mathematics achievement in some specifications, but when we consider that the edTPA is a binary screen of teaching effectiveness (i.e., pass/fail), we find that passing the edTPA is significantly predictive of teacher effectiveness in reading but not in mathematics. We also find that Hispanic candidates in Washington were more than three times more likely to fail the edTPA after it became consequential in the state than non-Hispanic White candidates.
Citation: Dan Goldhaber, James Cowan, Roddy Theobald (2016). Evaluating Prospective Teachers: Testing the Predictive Validity of the edTPA (Update). CALDER Working Paper No. 157
Full Abstract
In practice, teacher turnover appears to have negative effects on school quality as measured by student performance. However, some simulations suggest that turnover can instead have large, positive effects under a policy regime in which low-performing teachers can be accurately identified and replaced with more effective teachers. This study examines this question by evaluating the effects of teacher turnover on student achievement under IMPACT, the unique performance-assessment and incentive system in the District of Columbia Public Schools (DCPS). Employing a quasi-experimental design based on data from the first year years of IMPACT, we find that, on average, DCPS replaced teachers who left with teachers who increased student achievement by 0.08 SD in math. When we isolate the effects of lower-performing teachers who were induced to leave DCPS for poor performance, we find that student achievement improves by larger and statistically significant amounts (i.e., 0.14 SD in reading and 0.21 SD in math). In contrast, the effect of exits by teachers not sanctioned under IMPACT is typically negative but not statistically significant.
Citation: Melinda Adnot, Thomas Dee, Veronica Katz, James Wyckoff (2016). Teacher Turnover, Teacher Quality and Student Achievement in DCPS. CALDER Working Paper No. 153
Full Abstract
Using administrative longitudinal data from five states, we study how value-added measures of teacher performance are affected by changes in state standards and assessments. We first document the stability of teachers’ value-added rankings during transitions to new standard and assessment regimes and compare our findings to stability during stable standard and assessment regimes. We also examine the predictive validity of value-added estimates during nontransition years over transition-year student achievement. In most cases we find that measures of teacher value added are similarly stable in transition years and nontransition years. Moreover, there is no evidence that the level of disadvantage of students taught disproportionately influences teacher rankings in transition years relative to stable years. In the states we study, student achievement in math can consistently be forecasted accurately—although not perfectly—using value-added estimates for teachers during stable standards and assessment regimes. There was somewhat less consistency in reading, because we find cases where test transitions significantly reduced forecasting accuracy.
Citation: Benjamin Backes, James Cowan, Dan Goldhaber, Cory Koedel, Luke Miller, Zeyu Xu (2016). The Common Core Conundrum: To What Extent Should We Worry That Changes to Assessments and Standards Will Affect Test-Based Measures of Teacher Performance?. CALDER Working Paper No. 152
Full Abstract
We use data from six Washington State teacher education programs to investigate the relationship between teacher candidates’ student teaching experiences and their later teaching effectiveness and probability of attrition. We find that teachers who student taught in schools with lower teacher turnover are less likely to leave the state’s teaching workforce, and that teachers are more effective when the student demographics of their current school are similar to the student demographics of the school in which they did their student teaching. While descriptive, these findings suggest that the school context in which student teaching occurs has important implications for the later outcomes of teachers and their students.
Citation: Dan Goldhaber, John Krieg, Roddy Theobald (2016). Does the Match Matter? Exploring Whether Student Teaching Experiences Affect Teacher Effectiveness and Attrition. CALDER Working Paper No. 149
Full Abstract
We use rich longitudinally matched administrative data on students and teachers in North Carolina to examine the patterns of differential effectiveness by teachers’ years of experience. The paper contributes to the literature by focusing on middle school teachers and by extending the analysis to student outcomes beyond test scores. Once we control statistically for the quality of individual teachers by the use of teacher fixed effects, we find large returns to experience for middle school teachers in the form both of higher test scores and improvements in student behavior, with the clearest behavioral effects emerging for reductions in student absenteeism. Moreover these returns extend well beyond the first few years of teaching. The paper contributes to policy debates by documenting that teachers can and do continue to learn on the job.
December 2015 Update
Citation: Helen Ladd, Lucy C. Sorensen (2015). Returns to Teacher Experience: Student Achievement and Motivation in Middle School. CALDER Working Paper No. 112