<article_title>Alignment between South African mathematics assessment standards and the TIMSS assessment frameworks  </article

AOSIS Openjournalshttp://www.openjournals.netinfo@openjournals.nethttp://www.pythagoras.org.za1012-23462223-7895<article_title>Alignment between South African mathematics assessment standards and the TIMSS assessment frameworks  </article_title>South Africa’s performance in international benchmark tests is a major cause for concern amongst educators and policymakers, raising questions about the effectiveness of the curriculum reform efforts of the democratic era. The purpose of the study reported in this article was to investigate the degree of alignment between the TIMSS 2003 Grade 8 Mathematics assessment frameworks and the Revised National Curriculum Statements (RNCS) assessment standards for Grade 8 Mathematics, later revised to become the Curriculum and Assessment Policy Statements (CAPS). Such an investigation could help to partly shed light on why South African learners do not perform well and point out discrepancies that need to be attended to. The methodology of document analysis was adopted for the study, with the RNCS and the TIMSS 2003 Grade 8 Mathematics frameworks forming the principal documents. Porter’s moderately complex index of alignment was adopted for its simplicity. The computed index of 0.751 for the alignment between the RNCS assessment standards and the TIMSS assessment objectives was found to be significantly statistically low, at the alpha level of 0.05, according to Fulmer’s critical values for 20 cells and 90 or 120 standard points. The study suggests that inadequate attention has been paid to the alignment of the South African mathematics curriculum to the successive TIMSS assessment frameworks in terms of the cognitive level descriptions. The study recommends that participation in TIMSS should rigorously and critically inform ongoing curriculum reform efforts. Mdutshekelwa Ndlovu1,2 1Institute for Mathematics and Science Teaching (IMSTUS), Department of Curriculum Studies, University of Stellenbosch, South Africa  2Faculty of Humanities, Tshwane University of Technology, South Africa  Andile Mji2 2Faculty of Humanities, Tshwane University of Technology, South Africa  Mdutshekelwa Ndlovu mcn@sun.ac.za Faculty of Education (IMSTUS), University of Stellenbosch, Private Bag X1, Matieland 7602, South Africa 18233310.4102/pythagoras.v33i3.182 28 June 201224 Oct. 201210 Dec. 2012 Ndlovu, M., & Mji, A. (2012). Alignment between South African mathematics assessment standards and the TIMSS assessment frameworks. Pythagoras, 33(3), Art. #182, 9 pages. http://dx.doi.org/10.4102/pythagoras.v33i3.182 © 2012.The Authors. Licensee: AOSIS OpenJournals. This work is licensed under the Creative Commons Attribution License.Introduction Hencke, Rutkowski, Neuschmidt and Gonzalez (2009) make the important remark that the Trends in International Mathematics and Science Study (TIMSS) examines the effectiveness of curriculum and instruction in relation to student achievement. There is increasing global interest in and attention paid to the resultant rankings of participating countries, making the very participation in TIMSS a high-stake local decision. As a consequence of the heightened (political and educational) stakes, the relevance of the tests to local curricula has come under sharp scrutiny, which makes the issue of alignment of the South African (SA) curriculum with TIMSS important for educators, curriculum workers, test developers and policymakers. Hencke et al. (2009) concede upfront that whilst TIMSS assessments were developed to represent an agreed-upon framework with as much in common across countries as possible, it was inevitable that the match between test and curriculum would not be identical in all countries. However, the more aligned a national curriculum is to what is common across countries the greater the chance of that country’s students performing well. In other words, rather than reject the common core assessments as irrelevant it might be beneficial to investigate in depth what discrepancies exist between SA’s curricula and TIMSS, with special focus on the overlapping content.  Mullis, Martin, Ruddock, O’Sullivan and Preuschoff (2009) refer to the TIMSS curriculum model as consisting of an intended curriculum, an implemented curriculum and an attained curriculum, all of which are familiar terms in curriculum theory. For instance, Porter (2004, p. 1) suggests that a curriculum can be divided into four aspects: the intended, enacted, assessed and learned curriculum. The enacted curriculum refers to instructional events in the classroom whereas the assessed curriculum refers to student achievement tests. Mullis et al.’s (2009) attained curriculum refers to student achievement in those tests. For cross-national tests such as TIMSS to be valid, it is critical that their assessed curricula correspond with the intended national curricula. Moreover, assessments aligned with the assessment standards can guide instruction and raise achievement (Martone & Sireci, 2009; Polikoff, Porter & Smithson, 2011). In view of the foregoing it is expected that, in order to be relevant, cross-national studies or tests should provide curriculum information that can help countries to improve the quality of their education systems on the basis of benchmarking performance (Reddy, 2006). This makes curriculum matching analysis a logical starting point. In bemoaning the absence of extensive use of alignment research in the classroom, Martone and Sireci (2009) point out lost opportunities to help policymakers, assessment developers and educators to make refinements so curriculum, assessment and instruction support each other in achieving what is expected of students. In an attempt to bridge this gap the aim of the study was to analyse the alignment between SA’s Grade 8 Mathematics curriculum and TIMSS by means of the Porter (2002) procedure. To achieve this goal the remainder of this article gives the theoretical background to alignment studies in general and shows why the Porter index was chosen. Thereafter we spell out the research questions guiding the study and outline the procedure for determining the index before presenting and discussing the results. The article concludes with summary observations and recommendations. Theoretical framework of alignment studies Definition of alignment and scope of alignment studies For purposes of comparing the Grade 8 Revised National Curriculum Statements (RNCS) for Mathematics and the TIMSS assessment we analyse measures of curricula and assessment alignment based on research that has developed methods for judging the extent and nature of alignment (e.g. Porter, 2002; Porter & Smithson, 2001; Webb, 2005). Alignment can be defined as the degree of agreement, match or measure of consistency between curriculum content (content standards) for a specific subject area and the assessment(s) used to measure student achievement of these standards (Bhola, Impara & Buckendahl, 2003; Näsström, 2008; Näsström & Henricksson, 2008). A major feature of alignment studies is the development of common languages of topics and categories of cognitive demand for describing content in different subject areas such mathematics, reading and science (Berends, Stein & Smithson, 2009, p. 4). The underlying logic is that if standards specify what and how well students should be learning and tests measure what they know and can do, then the two ought to be synchronised (Herman & Webb, 2007, p. 1). In other words, the language of the assessment items must match the language of the outcomes stated in the RNCS or its successor, Curriculum and Assessment Policy Statements (CAPS) (Department of Basic Education, 2011a). Similarly, the content and cognitive domain language of the CAPS should match that of the TIMSS assessment frameworks as closely as possible. Alignment, thus, has both content and consequential validity in terms of the knowledge and skills prescribed and tested (Bhola et al., 2003, p. 21) Although the alignment between standards and assessment has been most commonly studied (e.g. Bhola et al., 2003; Herman & Webb, 2007), the alignments between standards and instruction as well as between instruction and assessment have also been studied (e.g. Porter, 2002). In curriculum theory and practice, standards have lately come to refer to ‘descriptions of what students are expected to know and be able to do’ (Näsström, 2008, p. 16), which makes them synonymous with the intended relationship between educational objectives and subject matter content. In SA, the term ‘outcomes’ has been used widely to frame statements about both subject matter content and anticipated learning behaviours. Porter’s model for evaluating alignment  From the three commonly used primary models of evaluating alignment, that is Webb’s (1997, 2005) Depth of Knowledge Procedure, Rothman, Slattery, Vranek, and Resnick’s (2002) Achieve Procedure model and Porter’s (2002) Surveys of Enacted Curriculum index, we opted for the last one. Unlike the other two approaches, the Surveys of Enacted Curriculum index does not rely on direct comparison of assessments or assessment items with objectives or standards. Instead, content analysts first code the standards and assessments onto a common framework, a content taxonomy, developed by subject matter experts. The taxonomy defines content in terms of two variables: topics or sub-topics and levels of cognitive demand. The two variables compare favourably with Webb’s Categorical Concurrence and Performance Centrality. Analysts place assessment items and objectives from standards documents into the taxonomy and the documents are then represented as matrices of proportions, where the proportion in each cell (topic and cognitive demand) indicates the proportion of total content in the document that emphasises that particular combination of topic and cognitive demand. The matrices for standards and assessments are then compared, cell by cell, and an alignment index is calculated. We believe that the Porter procedure achieves in two dimensions what the Webb and Achieve procedures do in four measures. More importantly, the Porter alignment model ‘can be applied to analyse the match between any two of curriculum, instruction and assessment’ (Liu et al., 2009, p. 795). It was therefore appropriate for our purpose since we wanted to compare two curriculum documents: the TIMSS frameworks and RNCS. The calculated Porter alignment index ranges from 0 to 1 with 0.5 as its centre since it uses absolute differences, a characteristic that has to be taken into account when interpreting the computed values. Fulmer (2011) has recently provided critical values for the strength of the Porter index of alignment based on the number of cells and number of standard points used. Furthermore, the Porter procedure agrees with Bloom’s Taxonomy of educational objectives which has also been used by TIMSS and the RNCS.  Purpose of the study The purpose of this study was to determine the degree of alignment between the Grade 8 RNCS for Mathematics and the 2003 TIMSS assessment frameworks by means of the Porter index. TIMSS regularly assesses learners at the Grade 4 and 8 levels and SA has previously participated (in 1995, 1999, 2003 and 2011). The Grade 12 level has not been consistently assessed. We chose the Grade 8 curriculum because it is a transitional grade between primary and secondary phases. The 2003 results were the latest available of South Africa’s participation in TIMSS because the 2011 results were still pending at the time of this article. The following research questions guided the study: What is the structure of the content and cognitive domain matrices for the components of the 2003 TIMSS Grade 8 Mathematics assessment frameworks? What is the structure of the content and cognitive domain matrix for the RNCS Grade 8 Mathematics assessment standards? What are the computed Porter indices of alignment within and between the components of the 2003 TIMSS assessment frameworks and the RNCS assessment standards for Grade 8 Mathematics? What is the structure of discrepancies in emphasis between the RNCS assessment standards and the 2003 TIMSS assessment objectives? How do the discrepancies assessment objectives compare with SA’s performance in TIMSS 2003? To help answer these questions we adopted the document analysis methodology in this study.  MethodologyThe document analysis methodology The methodology of document analysis was adopted for this study as it entails systematic and critical examination rather than mere description of instructional or curriculum documents (Center for Teaching and Learning, 2007). Document analysis is also referred to as qualitative content analysis (Daymon & Holloway, 2011), an analytical method used in qualitative research to gain an understanding of trends and patterns that emerge from data. The aim of qualitative document analysis is to discover new or emergent patterns, including overlooked categories (Daymon & Holloway, 2011, p. 321). Statistical reports within a qualitative study should reveal ways in which the data and statistics have been organised and presented to convey the key messages and meanings intended. The qualitative document analysis in this study is organised and presented statistically by means of the Porter alignment procedure to convey messages and meanings about the strength of the alignment between the TIMSS (2003) mathematics assessment frameworks and the RNCS for Grade 8 Mathematics. For the empirical work the first author worked with two experienced mathematics in-service facilitators for the Senior Phase and the Further Education and Training phase. The Grade 8 RNCS Mathematics assessment standards were compared with the TIMSS assessment objectives and a common template consisting of 110 standard points or fragments as follows: Number: 32 (8 whole number points, 5 integer points, 10 fractions and decimals points, 2 irrational number and financial mathematics points, and 7 ratio, proportion and percentage points); Algebra: 26 (3 patterns, 7 algebraic expressions, 7 equations and formulas, 9 relationships/functions); Measurement: 16 (3 attributes and units, 13 tools, techniques and formulae); Geometry: 18 (3 lines and angles, 6 two-dimensional and three-dimensional shapes, 3 congruence and similarity, 4 location and spatial relationships, 3 symmetry and transformations); and Data: 17 (4 data collection and organisation, 4 data representation, 5 data interpretation, 4 uncertainty and probability). In the common template, 88% of the RNCS assessment standards were covered whilst 89% of the TIMSS assessment objectives were covered. The facilitators were introduced to the mathematics cognitive domain categories used in TIMSS 2003 (Mullis et al., 2003, pp. 27−33) and asked to code the cognitive domain levels elicited by each standard point in the template according to the verbs used (see Table 1). Following the level descriptors, the two facilitators independently coded the standards and the author allocated marks according to their coding, totalling 1 score point per standard point. Table 1 shows the content and cognitive domain categories, the weightings and the verbs or descriptors that characterise the cognitive levels.  Table 2 shows an example of five selected RNCS standard points coded following Airasian and Miranda’s (2002, pp. 251−253) procedure of coding objectives according to the Revised Bloom’s Taxonomy of educational objectives. Assessment standard point 1.1, for example, uses three verbs, the first of which is in the ‘knowing’ category and the other two are both in the ‘using concepts’ category. Standard point 1.2 uses the same three verbs but for a different content sub-topic (fractions instead of integers); standard point 2.1 solves problems and standard point 4.1 investigates and extends at the reasoning level. Reliability was assured by the independent coding of the experts who were given copies of the relevant pages for the classification of TIMSS assessment objectives as they appear in Mullis et al. (2003, pp. 27−33) and implored to adhere to these as closely as possible. (The inter-rater kappa reliability index could not be computed because it applies to items falling in mutually exclusive categories.) Computation of the Porter index As already noted, the Porter procedure analyses the extent of alignment between two matrices or matrices of frequencies (Fulmer, 2011, p. 384). It produces a single alignment index, ranging from 0 to 1 to indicate how closely the distribution of points in the first matrix (of standards) aligns with the second matrix (of assessment). The alignment index P is arrived at in four steps as shown in Figure 1:  Create matrices of frequencies for the two documents being compared and label these as X and Y. For each cell in matrices X and Y, compute the ratio of points in the cell with the total number of points in the respective matrix. Label the matrices of ratios as x and y. For every row j and column k in matrices X and Y (the matrices of ratios), calculate the absolute value of the discrepancy between the ratios in cells xjk and yjk. Compute the alignment index using the formula EQN_1.JPG , where J is the number of rows, K is the number of columns in each of matrices X and Y, and xjk and yjk are ratios of points in the cells at row j and column k for each of ratio matrices x and y respectively. Critical values for the strength of the alignment index  A greater number of cells in the matrices will yield a range of likely values that is lower than for matrices with fewer cells. Hence the total number of cells in the A and B matrices can have an effect on the significance of the alignment index. When we also consider that the centre of the distribution of indices is not zero, as noted earlier, we need to assess how far an observed alignment index is from 0.5. Fulmer (2011) generated a matrix of means and critical values for alignment indices with results also demonstrating the expected (mean) distribution of pattern of alignment indices (see sample entries in Table 3). In addition to matrix-size dependence, the alignment index also depends on the number of curriculum or standards statements or test items being coded. If the total number of cells in the matrix is N (= J × K) then for matrices A and B, J = 2 and K = 2 yields N = 4. In this study we used Fulmer’s (2011) estimates of the critical values as determined by the number of cells and standard points. Table 4 shows sample reference (or critical) value estimates from the corresponding number of cells and standards points. From results presented by Porter (2002), for instance, the alignment between the standards of four US states (and the NCTM) and their own assessments ranged from 0.30 to 0.47 for 30 standards points. Six content areas and five cognitive levels were used, which meant 30 squares made up matrices A and B. Table 4 gives a critical value of 0.7372 for  the lower quantile (  EQN_2.JPG  ) if a two-tailed test is used  at the alpha level of 0.05 (i.e. lower than might be expected by chance). Therefore one can conclude that alignment amongst assessment and standards was very low. Liu et al. (2009) used a coding structure with five content categories and six cognitive levels (hence 30 squares again) to compare the alignment of physics curriculum and assessments for China, Singapore and New York state, China and Singapore had alignments of 0.67, which were significantly lower than the  meanEQN_3.JPG ) at the 0.05 level (below the critical value of 0.7372); New York’s alignment index of 0.80 was equivalent to the mean. EQN_4.JPGResults and discussion  The structure of the content and cognitive domain matrices for the components of the 2003 Grade 8 TIMSS Mathematics assessment frameworks Three matrices were derived in respect of the three components of the TIMSS assessment frameworks: the TIMSS 2003 assessment objectives, the TIMSS 2003 target percentages and the released TIMSS 2003 test items. The content and cognitive domain matrix for the TIMSS 2003 assessment objectives was derived from the list of objectives given in the TIMSS assessment frameworks document (Mullis et al., 2003, pp. 27−33). Table 5 shows the results of the coding of the 98 (fine-grained) TIMSS objectives. The numerical values form the required matrix. The 98 fine-grained objectives were accorded equal weight guided by the estimated time to be devoted to each of them. Of the 110 fine-grained standard points (assessment objectives or standards) in the template, 12 were not amongst the TIMSS assessment objectives. The content and cognitive domain matrix for the TIMSS 2003 target percentages (Table 6) was computed by extrapolation from Exhibit 2 (Mullis et al., 2003, p. 9) showing the target percentages of TIMSS 2003 mathematics assessment time devoted to content and cognitive domain for the Grade 8 level. Time devoted was assumed to be equivalent to the importance attached to the respective categories in the frameworks as underpinned by the respective objectives. The content and cognitive domain matrix (Table 7) for the released TIMSS 2003 test items was derived from tallying the coding of all the released TIMSS 2003 test items. The assumption was that the items were accurately coded and accurately reported on. In the coding process multiple-choice items were allocated one score point each whilst constructed response items were allocated marks, depending on the amount of work to be done, so that at least one third of the assessment came from constructed response items (Martin, Mullis & Chrostowski, 2004, p. 35). The results of the tallying (Table 7) agreed with Exhibit 2.24 in Martin et al. (2004, p. 60) and were converted to proportions. The structure of the content and cognitive domain matrix for the Grade 8 RNCS for Mathematics  Table 8 shows the content and cognitive domain matrix obtained for SA’s Grade 8 Mathematics curriculum. All of the 98 RNCS assessment standards whose content was covered by TIMSS were coded. The resultant score points were totalled for each content and cognitive domain category and converted to the proportions shown.  Computed Porter indices of alignment for this study Table 9 shows the calculated raw cell-by-cell differences between RNCS assessment standards and TIMSS assessment objectives. These raw differences were converted to absolute differences, from which the Porter index of alignment was computed using the formula   EQN_5.JPG  The computed index was 0.735. The mean-simulated alignment index for a 5 × 4 comparison with 20 cells is 0.9635 (see Table 3). Using a two-tailed test, at the 0.05 alpha level, we looked to the 0.025 and 0.975 quantiles in Table 4. Close to 100 standards points and matrices of 20 squares each were used, so the critical values for 90 standard points are 0.9302 and 0.9758 respectively whilst those for 120 standard points are 0.9333 and 0.9843 respectively. The computed alignment value is well below 0.9302 and 0.9333 in the 0.025 quantiles ( ). The alignment was therefore significantly lower than would be expected by chance at the 0.05 level.  Further iterations were conducted to determine the pair-wise alignment amongst the RNCS assessment standards and the three components of the TIMSS 2003 assessment frameworks. The indices obtained are as shown in Table 10. Surprisingly the pair-wise alignment is significantly lower in all instances, without exception, suggesting a low internal consistency even amongst the TIMSS components themselves. The structure of discrepancies between TIMSS 2003 framework components and RNCS assessment standards  The first structure of discrepancies investigated was between RNCS assessments standards and the TIMSS assessment objectives. Figure 2 shows the structure of (mis)alignment by content and cognitive domain. From the graph it is evident that the RNCS assessment standards were stronger than TIMSS assessment objectives in all cases where the bars extend upwards above zero but weaker in those cases where the bars extend downwards below zero. Whilst there is a common criticism of teachers concentrating on knowledge of facts and procedures, the tables shows that the RNCS was weaker than TIMSS objectives in this cognitive level in four of the content domains, namely Number, Algebra, Geometry and Data. The RNCS was stronger with respect to routine problem solving in Number and Data but weaker in Measurement and Geometry. A similarly mixed picture emerged in respect of Reasoning.  The second structure to be investigated was between the RNCS objectives and the TIMSS target percentages. Figure 3 summarises the structure. Surprisingly, the RNCS was stronger on knowledge of facts and procedures in Number, Measurement and Geometry but weaker in Algebra and Data. Taken together, these discrepancies were significant.  A third and final structure to be investigated was that between the RNCS and TIMSS test items. Figure 4 summarises the discrepancies. A marked shift in this comparison is that the RNCS was stronger with respect to knowledge of facts and procedures in Number, Geometry and Data. The RNCS was, however, weaker in respect of Reasoning in all content categories. Routine problem solving was almost evenly split, above in Algebra, Measurement and Data, but below in Number and Geometry.  The next logical question is whether there was any relationship between the structure of discrepancies and South African students’ performance in TIMSS 2003. Given the discrepancies within the TIMSS components themselves, the ultimate question is whether the discrepancies between the RNCS and the TIMSS 2003 test items had any correlation with student performance as they were partially in force at the time of the 2003 TIMSS assessments.  A comparison of the RNCS-TIMSS test discrepancies with South African students’ performance in TIMSS 2003 Figure 5 was compiled after extracting South African students’ performance in each of the released test categories by content and cognitive domain relative to the international average. It is already well known that South African students performed below the international average across the board (e.g. Reddy, 2006).  Beyond that, however, the intention in this study was to additionally investigate if the pattern of discrepancies between RNCS assessment standards and TIMSS assessment objectives was in any way related to South African students’ performance (i.e. the achieved TIMSS curriculum).  Table 11 and Figure 6 attempt to answer that question. It is evident that student performance correlated negatively with discrepancies in Number but positively with discrepancies in Algebra, Measurement, Geometry and Data. That is, the narrower (or positive) the discrepancy was, the closer the performance was to the international average in all content domains except Number.  In Number, SA students performed worst in items on Using Concepts even though this was not the weakest cognitive domain representation in the RNCS assessment standards. In Algebra, SA students performed worst in the Knowledge of Facts and Procedures and this was the weakest category. In Measurement they performed the worst in Routine Problem Solving which was the weakest category of RNCS. In Geometry they performed worst in the Routine Problem Solving category which was the second weakest in the RNCS curriculum. In Data Handling they performed worst in Using Concepts, which was also the weakest point in the RNCS. Overall, SA learners performed worst in Using Concepts, suggesting little conceptual understanding being achieved by the curriculum. Routine Problem Solving was second worst. This pattern has implications for the intended curriculum which determines what curriculum materials should emphasise and ultimately what teachers should teach in the classroom. Brijlall (2008) notes that the lack of problem-solving skills in SA may be a result of the way it has been taught in schools: individual solution by learners, presentation of abstract problems foreign to learners. There is little doubt that the ultimate answer lies in the implemented curriculum but what feeds into the implemented curriculum is the intended curriculum.  ConclusionsThe study reported in this article set out to investigate the alignment of South Africa’s RNCS for Grade 8 Mathematics with the TIMSS 2003 Grade 8 Mathematics assessment frameworks. From the results we conclude that the computed Porter index of 0.751 suggests that the misalignment was low enough to warrant urgent attention, from curriculum designers, assessment practitioners, educators, teacher educators and policymakers alike, in order to enhance prospects of improved performance in future participations. In particular there is need to pay attention to the observed discrepancies between the content and cognitive domain emphases. The fact that, even where the RNCS curriculum was stronger than TIMSS, performance was still generally poor suggests the likelihood of a gap between the intended curriculum and the implemented curriculum. Such a gap further suggests a possible mismatch in emphasis between the intended curriculum and the curriculum support materials that actualise it. However, this conjecture requires further investigation. The study also points to the likelihood of a consequential gap between the implemented (SA) curriculum and the attained (TIMSS) curriculum reported by Reddy (2006, p. xiv). From a developing country perspective, what is even more disconcerting is that the three components of TIMSS do not appear to be aligned. That the misalignment is statistically significant calls into question the value-neutrality of TIMSS which currently appears to be a constantly shifting target that only well-resourced, developed countries can cope with.  Finally, participation in TIMSS should not be another bureaucratic ritual. Rather, it should rigorously and reflexively inform curriculum reform and innovation. In an increasingly globalised knowledge economy the school system needs to be globally competitive in the gateway fields of mathematics and science education. A simple illustration of the current disconnect is that, despite SA’s participation in previous TIMSS studies, the recently published Curriculum and Assessment Policy Statements (CAPS), which are largely a refinement of the RNCS, proclaim to have been influenced by the cognitive domain levels used in TIMSS 1999 (Department of Basic Education, 2011a, p. 55, 2011b, p. 59). This is so in spite of changes in 2003 (when the country participated in TIMSS for the third time) and further changes in 2007 (when the country did not participate), which were carried over to 2011. Accordingly, the influence of the Revised Bloom’s Taxonomy on TIMSS as evident in the 2007 and 2011 frameworks was apparently not taken into account in the latest curriculum revisions. This suggests that even the newly introduced Annual National Assessments (ANA) for Grades 1−6 and 9, together with the Senior National Certificate examinations, will continue to be guided by out-of-sync domain categories. By implication, curriculum and assessment will continue to be out of step with international trends resulting in mixed messages for teaching and learning. Bansilal (2011), for example, calls for a closer alignment of curriculum implementation plans with classroom realities. It is ironic that although SA teachers and educationists have complained of rapid curricula changes, the National Curriculum Statement has not changed at the same pace as TIMSS. Accordingly, educators, teacher educators and education researchers should be engaged more constructively in the curriculum and assessment reform processes for sustainable curricula coherence to be achieved. Reddy (2006, p. xiv) reports that during the period of TIMSS 2003, SA teachers consulted disparate curricula documents to determine what and how they taught. As affirmed by Airasian and Miranda (2002, p. 253), severe misalignment of assessment, standards and instruction can cause numerous difficulties. Given the extent to which the misalignment of the SA curriculum has gone relatively unchecked, the school system will continue to buckle for some time to come when subjected to international scrutiny. The latest of such scrutiny is the Global Competitiveness report (Schwab, 2012) and the Southern and East African Consortium for Monitoring Educational Quality report (Spaull, 2011), in which SA ranks very unfavourably.  Acknowledgements This research was funded by the Tshwane University of Technology through its postdoctoral fellowship placement of the first author. However, the opinions expressed do not necessarily reflect the views of the university. The authors are also grateful to Jeram Ramesh and Cerenus Pfeiffer for their time and expertise in the coding of the RNCS Grade 8 Mathematics assessment standards and the TIMSS assessment objectives. We are further grateful to Iben Christiansen’s comments on an earlier draft of this manuscript. Authors’ contributions M.N. (University of Stellenbosch) was the researcher responsible for the empirical study as well as the writing of the manuscript. A.M. (Tshwane University of Technology), as postdoctoral supervisor, shared his expertise during the main study and the development of the manuscript. Competing interests We declare that we have no financial or personal relationship(s) which might have inappropriately influenced our writing of this article. 1.Airasian, P.W., & Miranda, H. (2002). The role of assessment in the revised taxonomy. Theory into Practice, 41(4), 249−254. http://dx.doi.org/10.1207/s15430421tip4104_8 2.Bansilal, S. (2011). Assessment reform in South Africa: Opening up or closing spaces for teachers? Educational Studies in Mathematics, 78, 91−107. http://dx.doi.org/10.1007/s10649-011-9311-8 3.Berends, M., Stein, M., & Smithson, J. (2009). Charter public schools and mathematics instruction: How aligned are charters to state standards and assessments? Nashville, TN: National Center on School Choice, Vanderbilt University. 4.Bhola, D., Impara, J.C., & Buckendahl, C.W. (2003). Aligning tests with states’ content standards: Methods and issues. Educational Measurement: Issues and Practice, 22(3), 21−29. http://dx.doi.org/10.1111/j.1745-3992.2003.tb00134.x 5.Brijlall, D. (2008). Collaborative learning in a multilingual class. Pythagoras, 68, 52−61. http://dx.doi.org/10.4102/pythagoras.v0i68.67 6.Center for Teaching and Learning. (2007). Instructional assessment resources: Document analysis. Austin, TX: The University of Texas at Austin. Available from http://www.utexas.edu/academic/ctl/assessment/iar/teaching/plan/method/ 7.Daymon, C., & Holloway, I. (2011). Qualitative research methods in public relations and marketing communications. New York, NY: Routledge. 8.Department of Basic Education. (2011a). Curriculumm and assessment policy statement: Mathematics Grades 7−9. Pretoria: DBE. Available from http://www.education.gov.za/LinkClick.aspx?fileticket=7AByaJ8dUrc%3d&tabid=672&mid=1885 9.Department of Basic Education. (2011b). Curriculum and assessment policy statement: Mathematics Grades 10−12. Pretoria: DBE. Available from http://www.education.gov.za/LinkClick.aspx?fileticket=QPqC7QbX75w%3d&tabid=420&mid=1216 10.Fulmer, G.W. (2011). Estimating critical values for strength of alignment among curriculum, assessments and instruction. Journal of Educational and Behavioural Statistics, 36(3), 381−402. http://dx.doi.org/10.3102/1076998610381397 11.Hencke, J., Rutkowski, L., Neuschmidt, O., & Gonzalez, E.J. (2009). Curriculum coverage and scale correlation on TIMSS 2003. In D. Hastedt, & M. von Davier (Eds.), IERI monograph series: Issues and methodologies in large-scale assessments, Vol. 2 (pp. 85−112). Princeton, NJ: IEA-ETS Research Institute. 12.Herman, J.L., & Webb, N.M. (2007). Alignment methodologies. Applied Measurement in Education, 20, 1−5. 13.Liu, X., Zhang, B.H., Liang, L.L., Fulmer, G.W., Kim, B., & Yuan, H.Q. (2009). Alignment between the physics content standards and standardized tests: A comparison among US-NY, Singapore and China-Jiangsu. Science Educaton, 93, 777−797. http://dx.doi.org/10.1002/sce.20330 14.Martin, M.O., Mullis, I.V., & Chrostowski, S.J. (2004). TIMSS 2003 technical report. Chestnut Hill, MA: International Association for the Evaluation of Educational Achievement. Available from http://timss.bc.edu/PDF/t03_download/T03TECHRPT.pdf  15.Martone, A., & Sireci, S.G. (2009). Evaluating alignment between curriculum, assessment, and instruction. Review of Educational Research, 19(9), 11−16. http://dx.doi.org/10.3102/0034654309341375 16.Mullis, I.V., Martin, M.O., Ruddock, G.J., O’Sullivan, C.Y., & Preuschoff, C. (2009). TIMSS 2011 assessment frameworks. Chestnut Hill, MA: International Association for the Evaluation of Educational Achievement. Available from http://timssandpirls.bc.edu/timss2011/downloads/TIMSS2011_Frameworks.pdf 17.Mullis, I.V., Martin, M.O., Smith, T.A., Garden, R.A., Gregory, K.D., Gonzalez, E.J., et al. (2003). TIMSS assessment frameworks and specifications 2003. (2nd edn.). Chestnust Hill, MA: International Association for the Evaluation of Educational Achievement. Available from http://timss.bc.edu/timss2003i/PDF/t03_af_book.pdf  18.Näsström, G. (2008). Measurement of alignment between standards and assessment. Umeå, Sweden: Umeå universitet. Available from http://umu.diva-portal.org/smash/get/diva2:142244/FULLTEXT01 19.Näsström, G., & Henricksson, W. (2008). Alignment of standards and assessment: A theoretical and empirical study of methods for alignment. Electronic Journal of Research in Educational Psychology, 6(3), 667−690. 20.Polikoff, M.S., Porter, A.C., & Smithson, J. (2011). How well aligned are state assessments of student achievement with state content standards? American Educational Research Journal, 48(4), 965−995. http://dx.doi.org/10.3102/0002831211410684  21.Porter, A.C. (2002). Measuring the content of instruction: Uses in research and practice. Educational Researcher, 31(7), 3−14. http://dx.doi.org/10.3102/ 0013189X031007003 22.Porter, A.C. (2004). Curriculum assessment. Nashville, TN: Vanderbilt University. Available from http://datacenter.spps.org/uploads/curricassess.pdf 23.Porter, A., & Smithson, J. (2001). Defining, developing, and using curriculum indicators. Philadelphia, PA: Consortium for Policy Research in Education, University of Pennsylvania. Available from http://www.cpre.org/sites/default/files/researchreport/788_rr48.pdf 24.Reddy, V. (2006). Mathematics and Science achievement at South African schools in TIMSS 2003. Cape Town: Human Sciences Research Council. 25.Rothman, R., Slattery, J.B., Vranek, J.L., & Resnick, L.B. (2002). Benchmarking and alignment of standards and testing. Los Angeles, CA: Center for the Study of Evaluation. 26.Schwab, K. (2012). The global competitiveness report 2012−2013. Geneva: World Economic Forum. Available from http://www3.weforum.org/docs/WEF_GlobalCompetitivenessReport_2012-13.pdf  27.Spaull, N. (2011). A preliminary analysis of SACMEQ III South Africa. Stellenbosch: Department of Economics, University of Stellenbosch. Available from http://www.ekon.sun.ac.za/wpapers/2011/wp112011/wp-11-2011.pdf 28.  Webb, N.L. (1997). Criteria for alignment of expectations and assessments in mathematics and science education. Madison, WI: National Institue for Science Education. 29.Webb, N.L. (2005). Alignment analysis of mathematics standards and assessments, Michigan high schools. Madison, WI: Author.