In Tyranny of the Textbook Beverlee Jobrack offers many observations that you’ve heard before. Standards alone won’t improve achievement. Testing alone won’t improve achievement. Technology alone won’t improve achievement. What makes the book worth reading is not Jobrack’s thoughts on these topics, because they are, frankly, fairly ordinary. But her thoughts on the textbook industry make the book well worth your time. The kernel of her argument has three pieces: (1) Textbook development: Textbooks are developed based on tradition and based on competitors’ products. No one in the publishing industry worries about whether the materials are effective. As Jobrack notes, publishers are for-profit enterprises. They need decision-makers to adopt their textbooks. Decision-makers do not base adoptions on effectiveness—or at least, publishers believe that they do not. (2) Textbook adoption: What factors drive adoptions? To the extent that teachers have any input, it will be teacher leaders, and they already teach well. They have an existing set of lesson plans that work well. So they are not interested in a textbook that would necessitate rewriting all of those lesson plans. So new textbooks tend to be conservative. Further, just three publishing companies account for 75% of the market. So most of the books look the same. Consequently, relatively trivial features have an outsize influence on adoption decisions. Trivial features like the cover design. Like the font size. Like whether the important features are clearly labeled or a bit more difficult to find. Content matters to adoptions, according to Jobrack, only insofar as the publishers ensure that all of the state standards are “covered.” But she goes on to point out that there is little or no attention paid to ordering and presenting this content in a way to ensure that students learn. Again, effectiveness of learning is simply not on the publishers radar screen. (3) Why textbooks matter: Jobrack argues that textbooks are hugely important because they constitute a de facto curriculum. Beginning teachers are overwhelmed by the prospect of writing lesson plans, and so depend heavily instructional materials provided by publishers. Is Jobrack right about all this? She ought to know whereof she speaks. She was promoted through the editorial ranks until she was the editorial director of SRA/McGraw-Hill. Still, we should bear in mind that these are mostly Jobrack’s impressions, not a systematic study of publishing business practices. I admit that I’m probably more ready to believe Jobrack on publishing because her description so often matches my own experience. Like the beginning teachers she describes, when I first started teaching cognitive psychology, I relied heavily on published materials. I laid out four textbooks on my desk, used the sequence of topics they all shared, and cobbled together lectures by stealing the best stuff from each. I saw the conservatism Jobrack describes much later when I prepared to write my cognitive textbook and told my editor that I wanted to do something really different than what was currently on the market. Her response: “Okay, but don’t make it more than about 20% different or you’ll never get any adoptions.” A point Jobrack makes indirectly but strikes me as more important than she realizes is the role of measurement. Jobrack notes that publishers would be motivated to make textbooks effective if that drove the market. Well, in order to know whether they are effective, we—teachers, administrators, parents, researchers, policymakers—need to agree on what we mean by effective and on a way to measure it. The textbook problem brings fresh urgency to this issue. Whether Jobrack is right or not, I hope this book will prompt greater discussion about textbooks, and greater scrutiny of adoption processes. 5 Comments The latest on intelligence 05/10/2012
You may remember The Bell Curve. The book was published in 1994 by Richard Herrnstein and Charles Murray, and it argued that IQ is largely determined by genetics and little by the environment. It further argued that racial differences in IQ tests scores were likely due to genetic differences among the races. A media firestorm ensued, with most of the commentary issuing from people without the statistical and methodological background to address the core claims of the book. The American Psychological Association created a panel of eminent researchers to write a summary of what was known about intelligence, which would presumably contradict many of these claims. The panel published the article in 1996, a thoughtful rebuttal of many of the inaccurate claims in The Bell Curve, but also a very useful summary of what some of the best researchers in the field could agree on when it came to intelligence. Now there's an update. A group of eminent scientists thought the time was ripe to provide the field with another status-of-the-field statement. They argue that there have been three big changes in the 15 years since the last report: (1) we know much more about the biology underlying intelligence; (2) we have a much better understanding of the impact of the environment on intelligence, and that impact is larger than was suspected; (3) we have a better understanding of how genes and the environment interact. Some of the broad conclusions are listed below (note that these are close paraphrases of the article's abstract).
Neisser, U.; Boodoo, G.; Bouchard, T. J. , J.; Boykin, A. W.; Brody, N.; Ceci, S. J.; Halpern, D. F.; Loehlin, J. C. et al (1996). Intelligence: Knowns and Unknowns. American Psychologist, 51: 77. Nisbett, R. E., Aronson, J., Blair, C., Dickens, W., Flynn, J., Halpern, D. F., & Turkheimer, E. (2012, January 2). Intelligence: New Findings and Theoretical Developments. American Psychologist, 67, 130-159. One finding (from Seymour, Aro & Erskine, 2003) illustrated in one figure (Figure 5.3 from Stan Dehaene's marvelous book,Reading in the Brain.). The figure shows errors in word reading at the end of first grade, by country. Are we to conclude that the differences are due to educational practice? The vaunted Finnish system shows smashing results even at this early age, whereas the degenerate British system can't get it right? Countrywide differences in instruction could play a role, but Dehaene emphasize that the countries in which children make a lot of errors--Portugal, France, Denmark, and especially Britain--just happen to have deeper orthographies. A shallow orthography means that there is a straightforward correspondence between letters and phonemes. English, in contrast, has one of the deepest (most complex) orthographies among the alphabetic languages: for example, the letter combination "gh" if pronounced differently in in "ghost," "eight," and "enough." In short, children learning to read English have a difficult task in front of them--and so too, therefore, do teachers. Is there a lesson to be drawn here? To me, the difficult orthography of English highlights the importance of careful sequencing in the learning of grapheme-phoneme pairs, along with a limited number of sight words--sequencing that exploits the regularities that exist, and bring children as swiftly as possible to the point that they can read texts and so feel a sense of accomplishment. In Italy, for example, the order in which grapheme-phoneme pairs are taught would matter much less because there simply is not that much to learn. Several months of instruction is sufficient for most children to reach a point that they can decode most texts. The deep orthography of English also sheds light on why American schools spends as much time on English-language arts (ELA) as they do: something like two-third of instructional time in the first grade (NICHD Early Child Care Research Network, 2002). One might draw the conclusion that the difficulty of the task in reading requires enormous amounts of time. Another point of view--one I share--is that this practice places too much emphasis on ELA at the expense of other content, and runs a high risk of discouraging kids who might become passionate about science, or history, or geography, but won't because the early elementary years contain so little content beyond ELA and mathematics. I think it would worth our accepting slower progress in reading in exchange for broader subject-matter coverage in early grades--coverage that will actually pay dividends for reading comprehension in later grades. Dehaene, S. (2009). Reading in the Brain. New York: Viking. National Institute of Child Health and Human Development Early Child Care Research Network (2002). The Relation of Global First-Grade Classroom Environment to Structural Classroom Features and Teacher and Student Behaviors. The Elementary School Journal, 102, 367-387 Seymour, P. H. K., maro, M., & Erskine, J. M. (2003). Foundation literacy acquisition in European orthographies. British Journal of Psychology, 94, 143-174. Book porn 05/03/2012
Most educators are avid readers. Most avid readers love books not only for the purpose of reading them, but as physical objects. There is a marvelous section of the website Reddit called BookPorn, in which users submit beautiful and interesting pictures of books: libraries, book stores, unusual book shelves, individual volumes of interest, and the occasional celebrity. Here are a few samples. (Click for larger versions.) ![]() Library, Trinity College, Dublin I was just at a reading conference and gave a talk on reading comprehension strategies. I’ve written about them before (article here). The next paragraph provides just a brief summary of what I’ve written. The figure below shows the strategies themselves, if you’re not familiar with them (click the image for a larger version). The short version of my conclusion is that they don’t really improve the comprehension process per se. Rather, they help kids who have become good decoders to realize that the point of reading is communication. And that if they can successfully say written words aloud but cannot understand what they’ve read, that’s a problem. Evidence for this point of view include data that kids don’t benefit much from reading comprehension instruction after 7th grade, likely because they’ve all drawn this conclusion, and that increased practice with reading comprehension strategies doesn’t bring any improved benefit. It’s a one-time increment. How much time is devoted to reading comprehension strategy instruction? I can’t find good (or poor) data on this question, and I doubt it exists. There is so much variation among districts (and probably even classrooms) on this issue, it’s hard to draw a conclusion with much confidence. Any time I talk about reading, a lot of teachers, coaches, and administrators tell me that enormous amounts of time go to reading comprehension strategy instruction in their district—but I’m sure the people who make sure to mention this to me are not a random sample. Whatever the proportion of time, much of it is wasted, at least if educators think it’s improving comprehension, because the one-time boost to comprehension can be had for perhaps five or ten sessions of 20 or 30 minutes each. Some reading comprehension strategies might be useful for other reasons. For example, a teacher might want her class create a graphic organizer as a way of understanding how an author builds narrative arc The wasted time obviously represents a significant opportunity cost. But has anyone ever considered that implementing these strategies make reading REALLY BORING? Everyone agrees that one of our long-term goals in reading instruction is to get kids to love reading. We hope that more kids will spend more time reading and less time playing video games, watching TV, etc. How can you get lost in a narrative world if you think you’re supposed to be posing questions to yourself all the time? How can a child get really absorbed in a book about ants or meteorology if she thinks that reading means pausing every now and then to anticipate what will happen next, or to question the author’s purpose? To me, reading comprehension strategies seem to take a process that could bring joy, and turn it into work. Should kids be allowed to chew gum in class? If a student said "but it helps me concentrate. . ." should we be convinced? If it provides a boost, it's short-lived. It's pretty well established that a burst of glucose provides a brief cognitive boost (see review here), so the question is whether chewing gum in particular provides any edge over and above that, or whether a benefit would be observed when chewing sugar-free gum. One study (Wilkinson et al., 2002) compared gum-chewing to no-chewing (and "sham chewing" in which subjects were to pretend to chew gum, which seems awkward). Subjects performed about a dozen tasks, including some of vigilance (i.e., sustaining attention), short-term and long term memory. Researchers reported some positive effect of gum-chewing for four of the tests. It's a little hard to tell from the brief write-up, but it appears that the investigators didn't correct their statistics for the multiple tests. This throw-everything-at-the-wall-and-see-what-sticks may be a characteristic of this research. Another study (Smith, 2010) took that same approach and concluded that there were some positive effects of gum chewing for some of the tasks, especially for feelings of alertness. (This study did not use sugar-free gum so it's hard to tell whether the effect is due to the gum or the glucose.) A more recent study (Kozlov, Hughes & Jones, 2012) using a more standard short-term memory paradigm, found no benefit for gum chewing. What are we to make of this grab-bag of results? (And please note this blog does not offer an exhaustive review.) A recent paper (Onyper, Carr, Farrar & Floyd, 2011) offers a plausible resolution. They suggest that the act of mastication offers a brief--perhaps ten or twenty minute--boost to cognitive function due to increased arousal. So we might see benefit (or not) to gum chewing depending on the timing of the chewing relative to the timing of cognitive tasks. The upshot: teachers might allow or disallow gum chewing in their classrooms for a variety of reasons. There is not much evidence to allow it for a significant cognitive advantage. EDIT: Someone emailed to ask if kids with ADHD benefit. The one study I know of reported a cost to vigilance with gum-chewing for kids with ADHD Kozlov, M. D., Hughs, R. W. & Jones, D. M. (2012). Gummed-up memory: chewing gum impairs short-term recall. Quarterly Journal of Experimental Psychology, 65, 501-513. Onyper, S. V., Carr, T. L, Farrar, J. S. & Floyd, B. R. (2011). Cognitive advantages of chewing gum. Now you see them now you don't. Appetite, 57, 321-328. Smith, A. (2010). Effects of chewing gum on cognitive function, mood and physiology in stressed an unstressed volunteers. Nutritional Neuroscience, 13, 7-16. Wilkinson, L., Scholey, A., & Wesnes, K. (2002). Chewing gum selectively improves aspects of memory in healthy volunteers. Appetite, 38, 235-236. The New York Times Magazine has an article on working memory training and the possibility that it boosts on type of intelligence. I think the article is a bit--but only a bit--too optimistic in its presentation. The article correctly points out that a number of labs have replicated the basic finding: training with one or another working memory task leads to increases in standard measures of fluid intelligence, most notably, Raven's Progressive Matrices. Working memory is often trained with a N-back task, shown in the figure at left from the NY Times article. You're presented with a series of stimuli, e.g. you're hearing letters. You press a button if a stimulus is the same as the one before (N=1) or the time before last (N=2) or. the time before that (N=3). You start with N=1 and N increases if you are successful. (Larger N makes the task harder.) To make it much harder, researchers can add a second stream of stimuli (e.g., the colored squares shown at left) and ask you to monitor BOTH streams of stimuli in an N-back task. That is the training task that you are to practice. (And although the figure calls it a "game" it's missing one usual feature of a game; it's no fun at all.) There are two categories of outcome measures taken after training. In a near-transfer task, subjects are given some other measure of working memory to see if their capacity has increased. In a far-transfer task, a task is administered that isn't itself a test of working memory, but of a process that we think depends on working memory capacity. All the excitement has been about far-transfer measures, namely that this training boosts intelligence, about which more in a moment. But it's actually pretty surprising and interesting that labs are reporting near-transfer. That's a novel finding, and contradicts a lot of work that's come before, showing that working memory training tends to benefit only the particular working memory task used during training, and doesn't even transfer to other working memory tasks. The far-transfer claim has been that the working memory training boosts fluid intelligence. Fluid intelligence is one's ability to reason, see patterns, and think logically, independent of specific experience. Crystallized intelligence, in contrast, is stuff that you know, knowledge that comes from prior experience. You can see why working memory capacity might lead to more fluid intelligence--you've got a greater workspace in which to manipulate ideas. A standard measure of fluid intelligence is the Ravens Progressive Matrices task, in which you see a pattern of figures, and you are to say which of a several choices would complete the pattern, as shown below. So, is this finding legit? Should you buy an N-back training program for your kids? I'd say the jury is still out. The Times quotes Randy Engle--a highly regarded working memory researcher--on the subject, and he can hardly conceal his scorn: “May I remind you of ‘cold fusion’?” Engle--who is not one of those scientists who has made a career out of criticizing others--has a lengthy review of the working memory training literature which you can read here. Another recent review (which was invited for the journal Brain & Cognition) concluded "Sparse evidence coupled with lack of scientific rigor, however, leaves claims concerning the impact and duration of such brain training largely unsubstantiated. On the other hand, at least some scientific findings seem to support the effectiveness and sustainability of training for higher brain functions such as attention and working memory." My own take is pretty close to that conclusion. There are enough replications of this basic effect that it seems probable that something is going on. The most telling criticism of this literature is that the outcome measure is often a single task. You can't use a single task like the Ravens and then declare that fluid intelligence has increased because NO task is a pure measure of fluid intelligence. There are always going to be other factors that contribute to task performance. The best measure of an abstract construct like "fluid intelligence" is one that uses several measures of what look to be quite different tasks, but which you have reason to think all call on fluid intelligence. Then you use statistical methods to look for shared variance among the tasks. So what we'd really like to see is better performance after working memory training on a few tasks. The fact is that in many of these studies, researchers have tried to show transfer to more than one task, and the training transfers to one, but not the other. Here's a table from a 2010 review by Torkel Klingberg showing this pattern. (Click the image to see a larger version.) This work is really just getting going, and the inconsistency of the findings means one of two things. Either the training regimens need to be refined, whereupon we'll see the transfer effects more consistently, OR the benefits we've seen thus far were mostly artifactual, a consequence of uninteresting quirks in the designs of studies or the tasks My guess is that the truth lies somewhere between these two--there's something here, but less than many people are hoping. But it's too early to say with much confidence. Should boys have male teachers? 04/18/2012
In primary school, a student's relationship with his or her teacher has a significant impact on the student's academic progress. Students with positive relationships are more engaged and learn more (e.g., Hughes et al, 2008). In addition, teachers are more likely to have negative relationships with boys than with girls (e.g., Hamre & Pianta, 2001). Previous research has not, however, accounted for the gender of the teacher. Perhaps conflict is more likely when teacher and student are of different sexes, and because there are more female than male teachers, we end up concluding that boys tend not to get along with their teachers. A new study (Split, Koomen & Jak, in press) indicates that's not the case. This appears to be the first large-scale study that examined teacher-student relationships in primary school while accounting for the sex of teachers. Teachers completed questionnaires about their relationships with their students. The questionnaires measured three constructs:
For Closeness, female teachers generally felt closer to their students than male teachers. Male teachers did not feel closer to either boys or girls, but female teachers felt closer to girls than they did to boys. For Conflict, female teachers reported less conflict than male teachers did. Both male and female teachers reported less conflict with girls than with boys. For Dependency, female teachers reported less dependency than male teachers did. There were no differences among boys and girls on this measure. This research has been difficult to conduct, simply because most groups of teachers don't have enough male teachers in elementary grades to conduct a meaningful analysis. This is just one study, but the results indicate that all teachers--male and female--have a tougher time with boys. More conflictual relationships are reported with boys than with girls, and female teachers report less close relationships with boys. Hamre, B. K., & Pianta, R. C. (2001). Early teacher–child relationships and the trajectory of children's school outcomes through eighth grade. Child Development, 72, 625–638. Hughes, J. N., Luo, W., Kwok, O. M., & Loyd, L. K. (2008). Teacher–student support, effortful engagement, and achievement: A 3-year longitudinal study. Journal of Educational Psychology, 100, 1–14. Split, J. L., Koomen, H. M. Y., & Jak, S. (in press) Are boys better off with male and girls with female teachers? A multilevel investigation of measurement invariance and gender match in teacher-student relationship quality. Journal of School Psychology. Most colleges have strict polices about student plagiarism, often including stringent penalties for those who violate the rules. (At the University of Virginia, where I teach, the penalty is expulsion.) Yet infractions occur. Why? My own intuition has been that plagiarism is often due to oversight or panic. A student will fall behind and, with a deadline looming, get sloppy in the writing of a paper: a few sentences or even a paragraph makes its way into the student paper without attribution. In the rush to finish the student forgets about it, or decides it doesn't matter. Thomas Dee and Brian Jacob had a different idea. Some data (e.g., Power, 2009) indicate that even college students are not very knowledgeable about what constitutes plagiarism and how to avoid it, and so many instances of plagiarism may actually be accidental. Given the stiff penalties, why don't students bone up on the rules? Dee & Jacob point out that this may be an instance of rational ignorance. That is, it's logical for students not to try to obtain better information about plagiarism; the cost of learning this information is relatively high because the rules seem complex, and the payoff seems small because the odds of punishment for plagiarism are low. Dee and Jacob's idea: reduce plagiarism by reducing the cost of learning about what constitutes plagiarism. Their experiment included 1,256 papers written by 573 students in a total of 28 humanities and social-science courses during a semester a selective liberal arts college. Half of the students were required to complete a "short but detailed interactive tutorial on understanding and avoiding plagiarism." The student papers were analyzed with plagiarism detection software. In the control group, plagiarism was observed in 3.3 percent of papers. (Almost every instance was a matter of using sentences without attribution.) Students who had completed the tutorial had a plagiarism rate of about 1.3% Thus, a relatively simple and quite inexpensive intervention may be highly effective in reducing at least one variety of plagiarism. Replicating this finding in other types of coursework--science and mathematics--would be important, as would replication at other institutions, including less selective colleges, and high schools. Even with those limitations, this is a promising start. This paper was just published as: Dee, T. S. & Jacob, B. A. (2012) Rational ignorance in education: A field experiment in student plagiarism. Journal of Human Resources, 47, 397-434. (I've linked to the NBER publication above because it's freely downloadable.) Power, L. G. (2009). University Students’ Perceptions of Plagiarism. Journal of Higher Education, 80, 643-662. Is the internet killing books? 04/12/2012
A lot of people have been tweeting or posting to Facebook a link to a graph which was posted to the Atlantic Monthly website on April 6, with this provocative headline: And here's the chart you are urged to show, derived from Gallup poll numbers over the years. We're invited to conclude that because the percentage of readers increased after the advent of the Internet, the Internet did not have a negative impact on book reading. All of the postings I've seen have apparently taken this conclusion at face value, so it seems like it's worth going through why this conclusion is not justified. First, lots of stuff happened between 1949 and 2005. For example, household income increased for middle- and high-income families. It could be that the internet has negatively affected book reading, but a number of other factors have increased it, so overall we see an increase. The idea that other factors are having a big impact on reading seems legitimate, given the big increase in reading from 1957 to 1990, a year in which very few people had internet access. So perhaps those factors are continuing to boost reading, despite the negative impact of the internet. The type of analysis we're being asked to perform implicitly is a variety of time series analysis. It's useful in situations where one can't conduct an experiment with a control group. For example, I might track classroom behavior daily for one month using a scale that runs from 1-10. I find that it ranges from 4 to 6 every day. The teacher implements a new classroom management scheme, and from that day forward, classroom behavior ranges from 7 to 9 every day. Interpreting the new classroom management scheme as the cause of the change is still not certain--some outside-of-class-factor could have just happened to have occurred on that same day that prompted the change in classroom behavior. But the fact that the change in behavior happened over such a narrow time window makes us more confident that such a coincidence is unlikely. And of course it helps my confidence that it's the same class. In the chart above, we're looking at events that happened over years, with different people who we hope had similar characteristics. To really get at the effect of the internet on reading habits, you need more finely controlled data. A number of studies were done in the 1950s, examining the effect of television on reading habits. The best of these (see Coffin, 1955) measured reading habits in a city that did not have a television station but was poised to get one, and then measured reading habit again after people in the city had access to television. (The results showed an impact on reading, especially light fiction. Newspaper reading was mostly unaffected, as television news was still in its infancy. Radio listening took a big hit.) Second, we might note that the most recent year on the chart is 2005. According to the Pew Internet and American Life Project, only 55% of Americans had internet access at that point, and only 30% had broadband. So perhaps the negative impact wouldn't be observed until more people had internet access. This brings us to more serious studies of whether use of the Internet displaces other activities. The studies I know of (e.g., Robinson, 2010) conclude that Internet use does not displace other activities, but rather enhances them. The data are actually a bit weird--IT use is associated with more of everything: more leisure reading, more newspaper reading, more visits to museums, playing music, volunteering, and participating in sports. The obvious explanation would be that heavy IT users have higher incomes and more leisure time, but the relationships held after the author controlled for education, age, race, gender, and income--though most of these effects were much attenuated. This research is really just getting going, and I don't think we're very close to understanding problem. In sum, the question of what people do with their leisure time and how one activity influences another is complicated. One chart will not settle it. Coffin, T. E. (1955) Television's impact on society. American Psychologist, 10, 630-641. Robinson, J. P. (2011) Arts and leisure participation among IT users: Further evidence of time enhancement over time displacement. Social Science Computer Review, 29, 470-480. |