The International Research Foundation
for English Language Education
ASSESSING SECOND LANGUAGE LISTENING SKILLS: SELECTED REFERENCES
(Last updated 4 September 2017)
Ableeva, R. (2007). Assessing listening for development. In R. Alanen & S. Poyhonen (Eds.), Language in action: Vygotsky and Leontievian legacy today (pp. 352-379). Newscastle upon Tyne, England: Cambridge Scholars.
Ableeva, R. (2008). The effects of dynamic assessment on L2 listening comprehension. In J. P. Lantolf & M. E. Poehner (Eds.), Sociocultural theory and the teaching of second languages (pp. 57-86). London, UK: Equinox.
Ableeva, R., & Lantolf, J. P. (2011). Mediated dialogue and the microgenesis of second language listening comprehension. Assessment in Education,18, 133-149.
Arnold, J. (2000). Seeing though listening comprehension exam anxiety. TESOL Quarterly, 34, 777-786.
Aryadoust, V. (2013). Building a validity argument for a listening test of academic proficiency. Cambridge, UK:Cambridge Scholars Publishing.
Bacheller, F. (1980). Communicative effectiveness as predicted by judgments of the severity of learner errors in dictation. In J. W. Oller & K. Perkins, (Eds.), Research in language testing (pp. 66-71). Rowley, MA: Newbury House.
Bae, J., & Bachman, L.F. (1998). A latent variable approach to listening and reading: Testing factorial invariance across two groups of children in the Korean/English Two-Way Immersion Program. Language Testing, 15(3), 380-414.
Batty, A. O. (2014). A comparison of video- and audio-mediated listening tests with many-facet Rasch modeling and differential distractor functioning. Language Testing, 32(1), 3-20.
Bejar, I., Douglas, Jamieson, D., Nissan, J., & Turner, S., (2000). TOEFL 2000 listening framework: A working paper.TOEFL Monograph Series (No. 19). Princeton: Educational Testing Service.
Berne, J. E. (1995). How does varying pre-listening activities affect second language listening comprehension? Hispania, 78, 316-329.
Blau, E. K. (1990). The effect of syntax, speed, and pauses on listening comprehension. TESOL Quarterly, 24(4), 746-753.
Boyle, J. P. (1983). Problems in testing listening comprehension. World Language English,3(1), 44-48.
Breeze, R., & Miller, P. (2012) Predictive validity of the IELTS listening test as an indicator of student coping ability in English-medium undergraduate courses in Spain. In L. Taylor & C. Weir (Eds.), Studies in Language Testing 34: Research in reading and listening assessment (pp. 487-518). Cambridge: Cambridge University Press.
Brindley, G. (1997). Investigating second language listening ability: Listening skills and item difficulty. In G. Brindley & G. Wigglesworth, (Eds.), Access: Issues in language test design and delivery (pp. 65-85). Sydney, Australia: Macquarie University, National Center for English Language Teaching and Research.
Brindley, G. (1998). Assessing listening abilities. Annual Review of Applied Linguistics,18, 171-191.
Brindley, G., & Slatyer, H. (2002). Exploring task difficulty in ESL listening assessment. Language Testing, 19(4), 369-394.
Buck, G. (1988). Testing listening comprehension in Japanese university entrance examinations. JALT Journal, 10(1 & 2), 15-42.
Buck, G. (1991). The testing of listening comprehension: An introspective study. Language Testing, 8(1), 67-91.
Buck, G. (2001). Assessing listening. Cambridge: Cambridge University Press.
Buck, G., & Tatsuoka, K. (1998a). Application of rule-space methodology to listening test data. Language Testing, 15, 118-142.
Buck, G., & Tatsuoka, K. (1998b). Application of rule-space procedure to language testing: Examining attributes of a free response listening test. Language Testing, 15(2), 199-157.
Call, M. (1985). Auditory short-term memory, listening comprehension and the input hypothesis. TESOL Quarterly, 19(4), 765-781.
Canale, M. (1984). Considerations in the testing of reading and listening proficiency. Foreign Language Annals,17(4), 349-357.
Canale, M., Child, J. R., Jones, R. L., Liskin-Gasparro, J. E., & Lowe, P. (1984). The testing of reading and listening proficiency: A synthesis. Foreign Language Annals, 17(4), 389-391.
Carrell, P. L. (2007). Notetaking strategies and the relationship to performance on listening comprehension and communicative assessment tasks (TOEFL Monograph Series No. 35). Princeton, NJ: Educational Testing Service.
Cervantes, R., & Gainer, G. (1992). The effects of syntactic simplification and repetition on listening comprehension. TESOL Quarterly, 26, 767-770.
Chang, A. C.-S., & Read, J. (2006). The effects of listening support on the listening performance of EFL learners. TESOL Quarterly, 40, 375-397.
Child, J.R. (1984). Testing language proficiency in the receptive skills: Native vs. learner performances. Foreign Language Annals, 17(4), 361-364.
Cox, T. L. & Clifford, R. (2014). Empirical validation of listening proficiency guidelines. Foreign Language Annals, 47(3), 379-403.
de Jong, J. (1984). Testing foreign language listening comprehension. Language Testing, 1(1), 97-100.
de Jong, J., & Glas, C. A. W. (1987). Validation of listening comprehension tests using item response theory. Language Testing, 4(2), 170-194.
Douglas, D., (1988). Testing listening comprehension in the context of the ACTFL Proficiency Guidelines. Studies in Second Language Acquisition, 10(2), 245-261.
Douglas, D., (1989). Testing listening comprehension in the context of the ACTFL Proficiency Guidelines. Applied Language Learning, 1(1), 53-73.
Dunkel, P.A. (1988). The content of L1 and L2 students’ lecture notes and its relation to test performance. TESOL Quarterly, 22(2), 259-281.
Dunkel, P.A. (1991). Computerized testing of nonparticipatory L2 listening comprehension proficiency: An ESL prototype development effort. The Modern Language Journal, 75(1), 64-73.
Dunkel, P. A. (1991). The use of PC-generated speech technology in the development of an L2 listening comprehension proficiency test: A prototype design effort. In M. C. Pennington & V. Stevens, (Eds.), Computers in applied linguistics: An international perspective (pp. 273-293). Clevedon, England: Multilingual Matters.
Dunkel, P. A., Henning, G., & Chaudron, C. (1993). The assessment of an L2 listening comprehension construct: A tentative model for test specification and development. Modern Language Journal, 77(2), 180-191.
Emery, P. G. (1980). Evaluating spoken English: A new approach to the testing of listening comprehension. English Language Teaching Journal, 34(2), 96-98.
Engelskirchen, A., Cottrell, E., & Oller, Jr., J. (1981). A study of the reliability and validity of the Ilyin Oral Interview. In A. Palmer, R. Groot, & G. Trosper, (Eds.), The construct validation of tests of communicative competence (pp. 83-93). Washington, DC: TESOL.
Enright, M. K., Bridgeman, B., Eignor, D., Lee, Y. W., & Powers, D. E. (2008). Prototyping measures of listening, reading, speaking, and writing. In C. A. Chapelle, M. K. Enright, & J. M. Jamieson (Eds.), Building a validity argument for the Test of English as a Foreign Language (pp. 145–186). New York, NY: Routledge.
Feak, C., & Salehzadeh, J. (2001). Challenges and issues in developing an EAP video listening placement assessment: A view from one program. English for Specific Purposes, 20, 477-493.
Field, J. (2013). Good at listening, or good at listening tests? In T. Pattison (Ed.), IATEFL 2012: Glasgow Conference Selections (pp. 64-66). Canterbury, UK: IATEFL.
Flowerdew, J., & Miller, L. (2005). Second language listening: Theory and practice. Cambridge: Cambridge University Press.
Fouly, K., & Cziko, G. A. (1985). Determining the reliability, validity, and scalability of thegraduated dictation test. Language Learning, 35, 555-566.
Freedle, R., & Kostin, I. (1999). Does the text matter in a multiple-choice test of comprehension? The case for the construct validity of TOEFL’s minitalks. Language Testing, 16(1), 2-35.
Frost, K., Elder, C., & Wigglesworth, G. (2012). Investigating the validity of an integrated listening-speaking task: A discourse-based analysis of test takers’ oral performances. Language Testing, 29(3), 345-369.
Gales, S., Gradman, H. L., & Spolsky, B. (1977). Toward the measurement of functional proficiency: Contextualization of the noise test. TESOL Quarterly, 11(1), 51-57.
Gefen, R. (1981). Testing listening comprehension in the schools of Israel. English Language Teaching Journal, 35(3), 252-255.
Ginther, A. (2002). Context and content visuals and performance on listening comprehension stimuli. Language Testing, 19(2), 133-167.
Goh, C. (2010). Listening as process: Learning activities for self-appraisal and self-regulation. In N. Harwood (Ed.), English language teaching materials: Theory and practice (pp. 179 - 206). Cambridge: Cambridge University Press.
Gorsuch, G. J. (2004). Test takers’ experiences with computer-administered listening comprehension tests: Interviewing for qualitative explorations of test validity. CALICO, 21(2), 339-372.
Gruba, P. (1997). The role of video media in listening assessment. System, 25(3), 335-345.
Hale, G. A., & Courtney, R. (1994). The effects of note-taking on listening comprehension in the Test of English as a Foreign Language. Language Testing, 11(1), 29-47.
Harding, L. (2011). Accent and listening assessment. New York, NY: Peter Lang.
Harding, L. (2012). Accent, listening assessment and the potential for a shared-L1 advantage: A DIF perspective. Language Testing, 29(2), 163-180.
Harding, L. & Ryan, K. (2009). Decision making in marking open-ended listening test items: The case of the OET. Spaan Fellow Working Papers in Second or Foreign Language Assessment, 7, 99–114.
Heller, A., Lynch, T., & Wright, L. (1995). A comparison of listening and speaking tests for student placement. Edinburgh Working Papers in Applied Linguistics, 6, 27-40.
Iimura, H. (2007). The listening process: Effects of question types and repetition. Language Education & Technology, 44, 75-85.
In’nami, Y. (2006). The effects of test annxiety on listening performance. System, 34, 317-340. doi:10.1016/j.system.2006.04.005
International Listening Association (1995). A ILA definition of listening. The Listening Post, 53, 1-5.
Irvine, P., Atai, P., & Oller, J. W. (1974). Cloze, dictation, and the Test of English as a Foreign Language. Language Learning, 24, 245-252.
Jensen, C., & Hensen, C. (1995). The effect of prior knowledge on EAP listening-test performance. Language Testing, 12(1), 99-119.
Jones, R. L. (1984). Testing the receptive skills: Some basic considerations. Foreign Language Annals, 17(4), 365-367.
Kaga, M. (1991). Dictation as a measure of Japanese proficiency. Language Testing, 8(2), 112-124.
Kim, Y. M., Yun, J. H., Lee, B. C., & Park, J. S. (2012). Validating 2012 English reading and listening test items for College Scholastic Ability Test. Seoul, Korea: Korea Institute for Curriculum and Evaluation.
Lee, H., & Winke, P. (2013). The differences among three-, four-, and five-option-item formats in the context of a high-stakes English-language listening test. Language Testing, 30(1), 99-123.
Lee, Y.-W., & Sawaki, Y. (2009). Application of three cognitive diagnosis models to ESL reading and listening assessments. Language Assessment Quarterly, 6(3), 239-263.
Lin, N.-H. J. (1982). Developing integrative language testing techniques: The graduate dictation and the copytest. TESL Studies, 5, 108-129.
Liskin-Gasparro, J. E. (1984). Practical considerations in receptive skills testing. Foreign Language Annals, 17(4), 369-373.
Londe, Z. C. (2009). The effects of video media in English as a second language listening comprehension tests. Issues in Applied Linguistics, 17(1), 41-50.
Long, D. R., & Macián, J. L. (1994). Listening skills: Acquisition and assessment. In C. Hancock (Ed.), Teaching, testing, and assessment: Making the connection (Northeast Conference Reports, pp. 111-138). Lincolnwood, IL: National Textbook Company.
Lund, R. J. (1991). A comparison of second language listening and reading comprehension. Modern Language Journal, 75, 196-204.
Lutje Spelberg, H. C., de Boer, P., & van den Bos, K. P. (2000). Item type comparisons of language comprehension tests. Language Testing, 17(3), 311-322.
Major, R. C., Fitzmaurice, S. F., Bunta, F., & Balasubramanian, C. (2002). The effects of nonnative accents on listening comprehension: Implications for ESL assessment.TESOL Quarterly, 36(2), 173-190.
McNamara, T. F. (1991). Test dimensionality: IRT analysis of an ESP listening test. Language Testing, 8(2), 139-159.
Mead, N., & Rubin, D. L. (1985). Assessing listening and speaking skills. Retrieved from
Morris, S. (1983). Dictation: A technique in need of reappraisal. ELT Journal, 37, 121-126.
Natalicio, D. S. (1979). Repetition and dictation and language testing techniques. Modern Language Journal, 63(4), 165-176.
Oakeshott-Taylor, J. (1979). Cloze procedure and foreign language listening skills. International Review of Applied Linguistics, 17(2), 150-158.
Ockey, G. J. (2007). Construct implications of including still image or video in computer-based listening tests. Language Testing, 24(4), 517-537.
Ockey, G. J., & French, R. (2016). From one to multiple accents on a test of L2 listening comprehension. Applied Linguistics, 37(5), 693-715.
Ockey, G. J., Papageorgiou, S., & French, R. (2016). Effects of strength of accent on an L2 interactive lecture listening comprehension test. International Journal of Listening, 30(1-2), 84-98.
Oller, J. W., Jr. (1971). Dictation as a device for testing foreign language proficiency. English Language Teaching, 23, 254-259.
Oller, J. W., Jr. & V. Streiff. (1975). Dictation: A test of grammar-based expectancies. English Language Teaching, 30, 25-36.
Olson, K. (2003). LSAT listening assessment: Theoretical background and specifications. Law School Admission Council (LSAC) Research Report 03-02. Retrieved from
Papageorgiou, S., Stevens, R., & Goodwin, S. (2012). The relative difficulty of dialogic and monologic input in a second-language listening comprehension test. Language Assessment Quarterly, 9(4), 375-397.
Progosh, D. (1996). Using video for listening assessment: Opinions of test-takers. TESL Canada Journal, 14(1), 34-46.
Randall , M. (1997). Orthographic knowledge, phonological awareness and the teaching of English: An analysis of word dictation errors in English of Malaysian secondary school pupils. RELC Journal, 28, 1-21.
Ray, S. (1991). The development and evaluation of a dictation test of English-language proficiency: A case study of the ethics of testing. Dissertation Abstract International, 53(03), 784A. (UMI No. 9210550).
Romeo, K., & Hubbard, P. (2008). Pervasive CALL learner training for improving listening proficiency. Proceedings of the Third WorldCALL Conference, Fukuoka, Japan, August, (2008).
Ross, S., & Langville, J. (1997). Negotiated discourse and interlanguage accent effects on a second language listening test. In G. Brindley & G. Wigglesworth, (Eds.), Access: Issues in language test design and delivery (pp. 87-116). Sydney, Australia: Macquarie University, National Center for English Language Teaching and Research.
Rost, M. (2002). Teaching and researching listening. Harlow, England: Pearson Education.
Sakai, H. (2009). Effect of repetition of exposure and proficiency level in L2 listening tests. TESOL Quarterly, 43(2), 360-372.
Savignon, S. (1986). Dictation as a measure of communicative competence in French as a second language. Language Learning,32(1), 33-51.
Scott, M. L., Stansfield, C.W., & Kenyon, D.M. (1996). Examining validity in a performance test: The listening summary translation exam (LSTE) - Spanish version. Language Testing, 13(1), 83-109.
Sheerin, S. (1987). Listening comprehension: Teaching or testing. ELT Journal, 41(2), 126-131.
Sherman, J. (1997). The effects of question preview in listening comprehension tests. Language Testing, 14, 185-213.
Shin, D. (1998). Using videotaped lectures for testing academic listening proficiency. International Journal of Listening, 12, 57-80.
Shohamy, E., & Inbar, O. (1991). Validation of listening comprehension tests: The effect of text and question type. Language Testing, 8(1), 23-40.
Stansfield, C. W. (1985). A history of dictation in foreign language teaching and testing. The Modern Language Journal, 69(2), 121-128.
Suvorov, R. (2009). Context visuals in L2 listening tests: The effects of photographs and video vs. audio-only format. In C. A. Chapelle, H. G. Jun, & I. Katz (Eds.) Developing and evaluating language learning materials (pp. 53-68). Ames, IA: Iowa State University.
Suvorov, R. (2015). The use of eye tracking in research on video-based second language L2 listening assessment: A comparison of context videos and content videos. Language Testing, 32(4), 463-483.
Tatsuki, D. H. (1996). The relationship of dictation errors to learner proficiency. Dissertation Abstract International, 57 (09), 3903A. (UMI No. 9707025)
Tonnes-Schnier, F., & Scheibner-Herzig, G. (1988). Measuring communicative effectiveness through dictation. IRAL, XXVI, 35-43.
Vandergrift, L. & Goh, C. (2009). Teaching and testing listening comprehension. In M. H. Long & C. J. Doughty (Eds.), The handbook of language teaching (pp. 395-411). West Sussex, UK: Wiley-Blackwell.
Wagner, E. (2002). Video listening tests: A pilot study. Working Papers in TESOL & Applied Linguistics, Teachers College, Columbia University, 2(1). Retrieved from
Wagner, E. (2007). Are they watching? Test-taker viewing behavior during an L2 video listening test. Language Learning & Technology, 11(1), 67-86.
Wagner, E. (2008). Video listening tests: What are they measuring? Language Assessment Quarterly, 5(3), 218-243.
Wagner, E. (2010). Test-takers’ interaction with an L2 video listening test. System, 38, 280-291.
Wagner, E. (2010). The effect of the use of video texts on ESL listening test-taker performance. Language Testing, 27, 493-513.
Wagner, E. (2013). An investigation of how the channel of input and access to test questions affect L2 listening test performance. Language Assessment Quarterly, 10(2), 178-195.
Wagner, E. & Toth, P.D. (2014). Teaching and testing L2 Spanish listening using scripted vs. unscripted texts. Foreign Language Annals, 47(3), 404-422.
Weir, C. J., & Vidaković, I. (2013). The measurement of listening ability 1913 – 2012. In C.J. Weir, I. Vidaković, & E.D. Galaczi (Eds.), Measured constructs: A history of English language examinations 1913-2012. Studies in Language Testing 37(pp. 347 – 419). Cambridge, UK: Cambridge University Press.
Wilson, M. (2003). Discovery listening – improving perceptual processing. ELT Journal, 57(4), 335-343.
Wu, Y. (1998). What do tests of listening comprehension test? A retrospection study of EFL test-takers performing a multiple-choice task. Language Testing, 15(1), 21-44.
Wyatt, D. H. (1984). Computer-assisted teaching and testing of reading and listening. Foreign Language Annals, 17(4), 393-407.
Yi’an, W. (1998). What do tests of listening comprehension test? A retrospective study of EFL test-takers performing a multiple-choice task. Language Testing, 15(1), 21-44.
Youn, S. J., & Im, S. (2016). Testing measurement invariance of an EAP listening placement test across undergraduate and graduate students. Papers in Language Testing and Assessment, 5(2), 26-42.
Young, D. J. (1987). The relationship between a communicative competence oriented dictation and ACTFL's oral proficiency interview. Hispania, 70, 643-649.
1
177 Webster St., #220, Monterey, CA 93940 USA
Web: / Email:
