In corpus-based interpreting studies, typical challenges exist in the time-consuming and labour-intensive nature of transcribing spoken data and in identifying prosodic properties. This paper addresses these challenges by exploring methods for the automatic compilation of multimodal interpreting corpora, with a focus on English/Chinese Consecutive Interpreting. The results show that: 1) automatic transcription can achieve an accuracy rate of 95.3% in transcribing consecutive interpretations; 2) prosodic properties related to filled pauses, unfilled pauses, articulation rate, and mispronounced words can be automatically extracted using our rule-based programming; 3) mispronounced words can be effectively identified by employing Confidence Measure, with any word having a Confidence Measure lower than 0.321 considered as mispronounced; 4) automatic alignment can be achieved through the utilisation of automatic segmentation, sentence embedding, and alignment techniques. This study contributes to interpreting studies by broadening the empirical understanding of orality, enabling multimodal analyses of interpreting products, and providing a new methodological solution for the construction and utilisation of multimodal interpreting corpora. It also has implications in exploring applicability of new technologies in interpreting studies.
Artetxe, M., & Schwenk, H. (2019). Massively multilingual sentence embeddings for zero-shot cross-lingual transfer and beyond. Transactions of the Association for Computational Linguistics, 7, 597–610. https://doi.org/10.1162/tacl_a_00288.
Barras, C., Geoffrois, E., Wu, Z., & Liberman, M. (2001). Transcriber: Development and use of a tool for assisting speech corpora production. Speech Communication, 33(1–2), 5–22.
Bendazzoli, C. (2012). From international conferences to machine-readable corpora and back: An ethnographic approach to simultaneous interpreter-mediated communicative events. In Breaking ground in corpus-based interpreting studies (Vol. 147, pp. 91–117). Springer.
Bendazzoli, C. (2017). Benefits and drawbacks of English as a lingua franca and as a working language: The case of conferences mediated by simultaneous interpreters. In English in Italy. LInguistic, educational and professional challenges (pp. 119–141). FrancoAngeli.
Bendazzoli, C. (2018). Corpus-based interpreting studies: Past, present and future developments of a (wired) cottage industry. In Making way in corpus-based interpreting studies (pp. 1–19). Springer.
Bendazzoli, C., Bertozzi, M., & Russo, M. (2020). From text to multimodal resources: Advancing interpretation research from an already existing corpus. Meta, 65(1), 210–235.
Bendazzoli, C., Russo, M., & Defrancq, B. (2018). Corpus-based interpreting studies: A booming research field. INTRALINEA ON LINE TRANSLATION JOURNAL, 20, 1–2.
Bernardini, S., Ferraresi, A., Russo, M., Collard, C., & Defrancq, B. (2018). Building interpreting and intermodal corpora: A how-to for a formidable task. In Making way in corpus-based interpreting studies (pp. 21–42). Springer.
Biagini, M. (2012). Data collection in the courtroom: Challenges and perspectives for the researcher. In Breaking ground in corpus-based interpreting studies (Vol. 147, pp. 231–252). Springer.
Boersma, P., & Van Heuven, V. (2001). Speak and unSpeak with PRAAT. Glot International, 5(9/10), 341–347.
Braune, F., & Fraser, A. (2010). Improved unsupervised sentence alignment for symmetrical and asymmetrical parallel corpora. Coling 2010: Posters, 81–89.
Bühler, H. (1986). Linguistic (semantic) and extra-linguistic (pragmatic) criteria for the evaluation of conference interpretation and interpreters. Multilingua, 5(4), 231–235.
Bührig, K., Kliche, O., Meyer, B., & Pawlack, B. (2012). The corpus “Interpreting in Hospitals”: Possible applications for research and communication training. In Multilingual corpora and multilingual corpus analysis (pp. 305–315). John Benjamins.
Chahuneau, V., Smith, N. A., & Dyer, C. (2013). Knowledge-rich morphological priors for bayesian language models. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 1206–1215).
Collados Aís, Á. (2002). Quality assessment in simultaneous interpreting: The importance of nonverbal communication. The Interpreting Studies Reader, 327–336.
Collados Aís, Á., Fernández Sánchez, M. M., Iglesias Fernández, E., Pérez-Luzardo, J., Pradas Macías, E. M., Stévaux, E., Blasco Mayor, M. J., & Jiménez Ivars, A. (2004). Presentación de Proyecto de Investigación sobre Evaluación de la Calidad en Interpretación Simultánea (Bff2002-00579). Actas Del IX Seminario Hispano-Ruso de Traducción e Interpretación. Moscú (Moscow): Universidad Estatal Lingüística de Moscú (MGLU), 3–15.
Conneau, A., Lample, G., Ranzato, M., Denoyer, L., & Jégou, H. (2018). Word translation without parallel data (arXiv:1710.04087). arXiv. https://doi.org/10.48550/arXiv.1710.04087.
Dal Fovo, E. (2018). The use of dialogue interpreting corpora in healthcare interpreter training: Taking stock. The Interpreters’ Newsletter, 23, 83–113.
Defrancq, B. (2015). Corpus-based research into the presumed effects of short EVS. Interpreting, 17(1), 26–45. https://doi.org/10.1075/intp.17.1.02def.
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-Training of deep bidirectional transformers for language understanding. arXiv:1810.04805 [Cs]. http://arxiv.org/abs/1810.04805.
Falbo, C. (2012). CorIT (Italian television interpreting corpus): Classification criteria. In S. S. Francesco, & C. Falbo (Eds.), Breaking ground in corpus-based interpreting studies (pp. 155–186). Springer. https://doi.org/10.3726/978-3-0351-0377-9/6.
Falbo, C. (2018). La collecte de corpus d’interprétation: Un défi permanent. Meta, 63(3), 649–664. https://doi.org/10.7202/1060167ar.
Fu, R. (2016). Comparing modal patterns in Chinese-English interpreted and translated discourses in diplomatic setting: A systemic functional approach. Babel, 62(1), 104–121. https://doi.org/10.1075/babel.62.1.06fu.
Gao, F., & Wang, B. (2017). A multimodal corpus approach to dialogue interpreting studies in the Chinese context: Towards a multi-layer analytic framework. The Interpreters’ Newsletter, 22, 17–38. https://doi.org/10.13137/2421-714X/20736.
Garside, R. (1987). The CLAWS word-tagging system. In R. Garside, G. Leech, & G. Sampson (Eds.), The computational analysis of English: A corpus-based approach. London: Longman.
Gavioli, L. (2015). On the distribution of responsibilities in treating critical issues in interpreter-mediated medical consultations: The case of “le spieghi (amo)”. Journal of Pragmatics, 76, 169–180.
Halliday, M. A. K. (2014). Language as social semiotic. In The discourse studies reader (pp. 263–272). Amsterdam: John Benjamins.
Han, C., Chen, S., Fu, R., & Fan, Q. (2020). Modeling the relationship between utterance fluency and raters’ perceived fluency of consecutive interpreting. Interpreting. International Journal of Research and Practice in Interpreting, 22(2), 211–237. https://doi.org/10.1075/intp.00040.han.
House, J., Meyer, B., & Schmidt, T. (2012). CoSi-A corpus of consecutive and simultaneous interpreting. Multilingual Corpora and Multilingual Corpus Analysis, 295–304.
Hu, K., & Tao, Q. (2013). The Chinese-English conference interpreting corpus: Uses and limitations. Meta, 58(3), 626–642. https://doi.org/10.7202/1025055ar.
Huang, P.-S., Kumar, K., Liu, C., Gong, Y., & Deng, L. (2013). Predicting speech recognition confidence using deep learning with word identity and score features. In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (pp. 7413–7417). https://doi.org/10.1109/ICASSP.2013.6639103.
Jiang, H. (2005). Confidence measures for speech recognition: A survey. Speech Communication, 45(4), 455–470.
Kress, G. (2009). Multimodality: A social semiotic approach to contemporary communication. Routledge.
Kress, G., & Van Leeuwen, T. (2001). Multimodal discourse, the modes and media of contemporary communication.
Kurz, I. (1993). Conference interpretation: Expectations of different user groups. The Interpreters’ Newsletter, 5, 13–21.
Kurz, I., & Pöchhacker, F. (1995). Quality in TV interpreting. Translatio-Nouvelles de La FIT-FIT Newsletter, 15(3), 4.
Mead, P. (2000). Control of pauses by trainee interpreters in their A and B languages. The Interpreters’ Newsletter, 10(200), 89–102.
Monti, C., Bendazzoli, C., Sandrelli, A., & Russo, M. (2005). Studying directionality in simultaneous interpreting through an electronic corpus: EPIC (European parliament interpreting corpus). Meta, 50(4). https://doi.org/10.7202/019850ar.
Moser, P. (1996). Expectations of users of conference interpretation. Interpreting, 1(2), 145–178. https://doi.org/10.1075/intp.1.2.01mos.
Ouyang, Q. (2020). Effects of non-verbal paralanguage capturing on meaning transfer in consecutive interpreting | Semantic Scholar. In Multimodal approaches to Chinese-English translation and interpreting (1st ed., pp. 198–218). Routledge.
Pan, J. (2019). The Chinese/English political interpreting corpus (CEPIC): A new electronic resource for translators and interpreters. In Proceedings of the Human-Informed Translation and Interpreting Technology Workshop (HiT-IT 2019) (pp. 82–88). https://doi.org/10.26615/issn.2683-0078.2019_010.
Pietrandrea, P., Kahane, S., Lacheret-Dujour, A., & Sabio, F. (2014a). The notion of sentence and other discourse units in corpus annotation. Spoken Corpora and Linguistic Studies, 331–364.
Pietrandrea, P., Kahane, S., Lacheret-Dujour, A., & Sabio, F. (2014b). The notion of sentence and other discourse units in corpus annotation. John Benjamins.
Plevoets, K., & Defrancq, B. (2016). The effect of informational load on disfluencies in interpreting: A corpus-based regression analysis. Translation and Interpreting Studies, 11(2), 202–224. https://doi.org/10.1075/tis.11.2.04ple.
Pöchhacker, F., & Zwischenberger, C. (2010). Survey on quality and role: Conference interpreters’ expectations and self-perceptions. AIIC Communicate! Spring, 53. https://www.google.com/url?sa=i&url=https%3A%2F%2Faiic.org%2Fdocument%2F9646%2F&psig=AOvVaw1Llr7q-D8wduUFUPEDnqmr&ust=1713378241518000&source=images&cd=vfe&opi=89978449&ved=0CAgQr5oMahcKEwio45K9rceFAxUAAAAAHQAAAAAQBA.
Poyatos, F. (2002). Nonverbal communication across disciplines: Volume 1: Culture, sensory interaction, speech, conversation. John Benjamins.
Rognes, T. (2001). ParAlign: A parallel sequence alignment algorithm for rapid and sensitive database searches. Nucleic Acids Research, 29(7), 1647–1652.
Russo, M. (2018). Speaking patterns and gender in the European parliament interpreting corpus: A quantitative study as a premise for qualitative investigations. In Making way in corpus-based interpreting studies (pp. 115–131). Springer.
Russo, M., Bendazzoli, C., & Defrancq, B. (2018). Making way in corpus-based interpreting studies. Springer.
Russo, M., Bendazzoli, C., Sandrelli, A., & Spinolo, N. (2012). The European parliament interpreting corpus (EPIC): Implementation and developments. In Breaking ground in corpus-based interpreting studies (pp. 53–90). Springer.
Sandrelli, A. (2012). Introducing FOOTIE (Football in Europe): Simultaneous interpreting in football press conferences. In Breaking ground in corpus-based interpreting studies (pp. 119–153). Springer.
Sandrelli, A., & Bendazzoli, C. (2005). Lexical patterns in simultaneous interpreting: A preliminary investigation of EPIC (European parliament interpreting corpus). In Proceedings from the corpus linguistics conference series (Vol. 1).
Schmid, H. (1999). Improvements in part-of-speech tagging with an application to German. In Natural language processing using very large corpora (pp. 13–25). Springer.
Schmidt, T., & Wörner, K. (2009). EXMARaLDA–Creating, analysing and sharing spoken language corpora for pragmatic research. Pragmatics, 19(4), 565–582.
Setton, R. (2011). Corpus-based interpreting studies (CIS): Overview and prospects. Corpus-Based Translation Studies: Research and Applications, 33–75.
Shlesinger, M. (1998). Corpus-based interpreting studies as an offshoot of corpus-based translation studies. Meta, 43(4), 486–493. https://doi.org/10.7202/004136ar.
Stachowiak-Szymczak, K. (2019). Eye movements and gestures in simultaneous and consecutive interpreting. Springer.
Tavakoli, P., & Skehan, P. (2005). Strategic planning, task structure and performance testing. In Planning and task performance in a second language (pp. 239–273). John Benjamins.
Thompson, B., & Koehn, P. (2020). Exploiting sentence Order in document alignment (arXiv:2004.14523). arXiv. https://doi.org/10.48550/arXiv.2004.14523.
Thormundsson, B. (2021). Speech-to-Text transcript accuracy rate among leading companies worldwide in 2021. https://www.statista.com/statistics/1133833/speech-to-text-transcript-accuracy-rate-among-leading-companies/.
Tiedemann, J. (2011). Bitext alignment (1st ed.). Cham: Springer. https://doi.org/10.1007/978-3-031-02142-8.
Tiselius, E., & Sneed, K. (2020). Gaze and eye movement in dialogue interpreting: An eye-tracking study. Bilingualism: Language and Cognition, 23(4), 780–787.
Varga, D., Halácsy, P., Kornai, A., Nagy, V., Németh, L., & Trón, V. (2007). Parallel corpora for medium density languages. Amsterdam Studies In the Theory and History Of Linguistic Science Series, 4(292), 247.
Vondřička, P. (2014). Aligning parallel texts with InterText. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14) (pp. 1875–1879).
Wang, B. (2012). A descriptive study of norms in interpreting: Based on the Chinese-English consecutive interpreting corpus of Chinese premier press conferences. Meta, 57(1), 198–212. https://doi.org/10.7202/1012749ar.
Wang, B., & Li, T. (2015). An empirical study of pauses in Chinese-English simultaneous interpreting. Perspectives, 23(1), 124–142. https://doi.org/10.1080/0907676X.2014.948885.
Wang, X., & Wang, B. (2022). Identifying fluency parameters for a machine-learning-based automated interpreting assessment system. Perspectives, 1–17. https://doi.org/10.1080/0907676X.2022.2133618.
Wang, B., & Zou, B. (2018). Exploring language specificity as a variable in Chinese-English interpreting. A corpus-based investigation. In Making way in corpus-based interpreting studies (pp. 65–82). Springer.
Yang, L. (2018). Effects of three tasks on interpreting fluency. The Interpreter and Translator Trainer, 12(4), 423–443. https://doi.org/10.1080/1750399X.2018.1540211.