Advancing Multimodal Collaboration Analytics

A Scoping Review

Authors

DOI:

https://doi.org/10.18608/jla.2025.8625

Keywords:

multimodal collaboration analytics, MMCA, collaborative learning, scoping review, research paper

Abstract

The advent of advanced technology has opened new horizons for studying collaborative learning, although ambiguity remains in the classification and rationale for combining modalities in multimodal collaboration analytics (MMCA). Addressing this gap is crucial for the progression of collaborative learning practices and research. This review critically examines and classifies the modalities employed in MMCA studies while elucidating the rationales for their combined use and their resulting empirical contributions to collaborative learning research. A scoping review of 36 empirical studies informs the development of a framework for classifying modalities used in MMCA. We also review the rationales underlying the use of different combinations of modalities and how MMCA literature contributes to our understanding of collaboration. The review results in a definitional framework comprising five categories: auditory, visual, physiological, kinesthetic, and tactile. The findings reveal diverse arrangements of modalities. We find that the underlying rationales for combining modalities are based on technical, practical/pedagogical, methodological, or theoretical premises, which lead to different empirical contributions. Conducting MMCA research is motivated by the need for a holistic comprehension of learner behaviours, interactions, and cognitive processes during collaboration, transcending the limitations of single modalities in isolation. These findings offer both a theoretical and practical guidepost for enhancing MMCA research and applications.

References

Andriessen, J., & Baker, M. (2020). On collaboration: Personal, educational and societal arenas. Sense-Brill.

Arksey, H., & O’Malley, L. (2005). Scoping studies: Towards a methodological framework. International Journal of Social Research Methodology, 8(1), 19–32. https://doi.org/10.1080/1364557032000119616

Blikstein, P., & Worsley, M. (2016). Multimodal learning analytics and education data mining: Using computational technologies to measure complex learning tasks. Journal of Learning Analytics, 3(2), 220–238. https://doi.org/10.18608/jla.2016.32.11

Booth, A., Sutton, A., Clowes, M., & Martyn-St James, M. (2022). Systematic approaches to a successful literature review (3rd ed.). SAGE Publications.

Buckingham Shum, S., Echeverria, V., & Martinez-Maldonado, R. (2019). The multimodal matrix as a quantitative ethnography methodology. In B. Eagan, M. Misfeldt, & A. Siebert-Evenstone (Eds.), Advances in quantitative ethnography: First international conference, ICQE 2019, Madison, WI, USA, October 20–22, 2019, proceedings (pp. 26–40). Springer Cham. https://doi.org/10.1007/978-3-030-33232-7_3

Bunt, H., Beun, R.-J., & Borghuis, T. (1998). Multimodal human–computer communication: Systems, techniques, and experiments. Springer Berlin, Heidelberg. https://doi.org/10.1007/BFb0052309

Chejara, P., Prieto, L. P., Ruiz-Calleja, A., Rodríguez-Triana, M. J., Shankar, S. K., & Kasepalu, R. (2020). Quantifying collaboration quality in face-to-face classroom settings using MMLA. In A. Nolte, C. Alvarez, R. Hishiyama, I.-A. Chounta, M. J. Rodríguez-Triana, & T. Inoue (Eds.), Collaboration technologies and social computing: 26th international conference, CollabTech 2020, Tartu, Estonia, September 8–11, 2020, proceedings (pp. 159–166). Springer Cham. https://doi.org/10.1007/978-3-030-58157-2_11

Chua, Y. H. V., Dauwels, J., & Tan, S. C. (2019). Technologies for automated analysis of co-located, real-life, physical learning spaces: Where are we now? In C. Brooks, R. Ferguson, & U. Hoppe (Program chairs), Learning Analytics to Promote Inclusion and Success: Proceedings of the 9th International Conference on Learning Analytics & Knowledge (pp. 11–20). ACM Press. https://doi.org/10.1145/3303772.3303811

Echeverria, V., Martinez-Maldonado, R., & Buckingham Shum, S. (2019). Towards collaboration translucence: Giving meaning to multimodal group data. In S. Brewster, G. Fitzpatrick, A. Cox, & V. Kostakos (Chairs), CHI 2019: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Paper 39). ACM Press. https://doi.org/10.1145/3290605.3300269

Fernandez Nieto, G. M., Kitto, K., Buckingham Shum, S., & Martinez-Maldonado, R. (2022). Beyond the learning analytics dashboard: Alternative ways to communicate student data insights combining visualisation, narrative and storytelling. In A. F. Wise, R. Martinez-Maldonado, & I. Hilliger (Program chairs), Learning Analytics for Transition, Disruption and Social Change: The Twelfth International Conference on Learning Analytics & Knowledge (pp. 219–229). ACM Press. https://doi.org/10.1145/3506860.3506895

Fernandez-Nieto, G. M., Echeverria, V., Buckingham Shum, S., Mangaroska, K., Kitto, K., Palominos, E., Axisa, C., & Martinez-Maldonado, R. (2021). Storytelling with learner data: Guiding student reflection on multimodal team data. IEEE Transactions on Learning Technologies, 14(5), 695–708. https://doi.org/10.1109/TLT.2021.3131842

Huang, K., Bryant, T., & Schneider, B. (2019). Identifying collaborative learning states using unsupervised machine learning on eye-tracking, physiological and motion sensor data. In C. F. Lynch, A. Merceron, M. Desmarais, & R. Nkambou (Eds.), Proceedings of the 12th International Conference on Educational Data Mining (pp. 318–323).

Giannakos, M., & Cukurova, M. (2023). The role of learning theory in multimodal learning analytics. British Journal of Educational Technology, 54(5), 1246–1267. https://doi.org/10.1111/bjet.13320

Di Mitri, D., Schneider, J., Specht, M., & Drachsler, H. (2018). From signals to knowledge: A conceptual model for multimodal learning analytics. Journal of Computer Assisted Learning, 34(4), 338–349. https://doi.org/10.1111/jcal.12288

Fritzsch, B. (Ed.). (2021). The senses: A comprehensive reference (2nd Ed.). Elsevier.

Graesser, A. C., Greiff, S., Stadler, M., & Shubeck, K. T. (2020). Collaboration in the 21st century: The theory, assessment, and teaching of collaborative problem solving. Computers in Human Behavior, 104, Article 106134. https://doi.org/10.1016/j.chb.2019.09.010

Kim, Y.-J., & Yoo, J.-H. (2020). The utilization of debriefing for simulation in healthcare: A literature review. Nurse Education in Practice, 43, Article 102698. https://doi.org/10.1016/j.nepr.2020.102698

Kim, T., Chang, A., Holland, L., & Pentland, A. S. (2008). Meeting mediator: Enhancing group collaboration using sociometric feedback. In B. Begole & D. W. McDonald (General chairs), Proceedings of the 2008 ACM Conference on Computer Supported Cooperative Work (pp. 457–466). ACM Press. https://doi.org/10.1145/1460563.1460636

Ma, Y., Celepkolu, M., & Boyer, K. E. (2022). Detecting impasse during collaborative problem solving with multimodal learning analytics. In A. F. Wise, R. Martinez-Maldonado, & I. Hilliger (Program chairs), Learning Analytics for Transition, Disruption and Social Change: The Twelfth International Learning Analytics & Knowledge Conference (pp. 45–55). ACM Press. https://doi.org/10.1145/3506860.3506865

Mao, A., Mason, W., Suri, S., & Watts, D. J. (2016). An experimental study of team size and performance on a complex task. PLoS ONE, 11(4), Article e0153048. https://doi.org/10.1371/journal.pone.0153048

Madan, A., Caneel, R., & Pentland, A. S. (2004). GroupMedia: Distributed multi-modal interfaces. Proceedings of the 6th International Conference on Multimodal Interfaces (pp. 309–316). ACM Press. https://doi.org/10.1145/1027933.1027983

Malmberg, J., Järvelä, S., Holappa, J., Haataja, E., Huang, X., & Siipo, A. (2019). Going beyond what is visible: What multichannel data can reveal about interaction in the context of collaborative learning? Computers in Human Behavior, 96, 235–245. https://doi.org/10.1016/j.chb.2018.06.030

Martinez-Maldonado, R., Dimitriadis, Y., Martinez-Monés, A., Kay, J., & Yacef, K. (2013). Capturing and analyzing verbal and physical collaborative learning interactions at an enriched interactive tabletop. International Journal of Computer-Supported Collaborative Learning, 8(4), 455–485. https://doi.org/10.1007/s11412-013-9184-1

Martinez-Maldonado, R., Echeverria, V., Fernandez Nieto, G., & Buckingham Shum, S. (2020). From data to insights: A layered storytelling approach for multimodal learning analytics. In R. Bernhaupt, F. Mueller, D. Verwrij, J. Andres, J. McGrenere, A. Cockburn, I. Avellino, A. Goguey, P. Bjørn, S. Zhao, B. P. Samson, & R. Kocielnik (Chairs), CHI’20: Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Paper 21). ACM Press. https://doi.org/10.1145/3313831.3376148

Martinez-Maldonado, R., Gašević, D., Echeverria, V., Fernandez Nieto, G., Swiecki, Z., & Buckingham Shum, S. (2021). What do you mean by collaboration analytics? A conceptual model. Journal of Learning Analytics, 8(1), 126–153. https://doi.org/10.18608/jla.2021.7227

Martinez-Maldonado, R., Kay, J., Buckingham Shum, S., & Yacef, K. (2019). Collocated collaboration analytics: Principles and dilemmas for mining multimodal interaction data. Human–Computer Interaction, 34(1), 1–50. https://doi.org/10.1080/07370024.2017.1338956

Martinez-Maldonado, R., Power, T., Hayes, C., Abdiprano, A., Vo, T., Axisa, C., & Buckingham Shum, S. (2017). Analytics meet patient manikins: Challenges in an authentic small-group healthcare simulation classroom. In A. F. Wise, P. H. Winne, G. Lynch, X. Ochoa, I. Molenaar, S. Dawson, & M. Hatala (Chairs), Understanding, Informing and Improving Learning with Data: The Seventh International Learning Analytics & Knowledge Conference (pp. 90–94). ACM Press. https://doi.org/10.1145/3027385.3027401

Marlow, S., Bisbey, T., Lacerenza, C., & Salas, E. (2018). Performance measures for health care teams: A review. Small Group Research, 49(3), 306–356. https://doi.org/10.1177/1046496417748196

Moher, D., Liberati, A., Tetzlaff, J., Altman, D. G., & The PRISMA Group*. (2009). Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. PLoS Medicine, 6(7), Article e1000097. https://doi.org/10.1371/journal.pmed.1000097

Muukkonen, H., Lakkala, M., Lahti-Nuuttila, P., Ilomäki, L., Karlgren, K., & Toom, A. (2020). Assessing the development of collaborative knowledge work competence: Scales for higher education course contexts. Scandinavian Journal of Educational Research, 64(7), 1071–1089. https://doi.org/10.1080/00313831.2019.1647284

Munn, Z., Peters, M. J. D., Stern, C., Tufanaru, C., McArthur, A., & Aromataris, E. (2018). Systematic review or scoping review? Guidance for authors when choosing between a systematic or scoping review approach. BMC Medical Research Methodology, 18, Article 143. https://doi.org/10.1186/s12874-018-0611-x

Müller, P., Huang, M. X., & Bulling, A. (2018). Detecting low rapport during natural interactions in small groups from non-verbal behaviour. In S. Berkovsky, Y. Hijikata, J. Rekimoto, M. Burnett, M. Billinghurst, & A. Quigley (Chairs), IUI 2018: Proceedings of the 23rd International Conference on Intelligent User Interfaces (pp. 153–164). ACM Press. https://doi.org/10.1145/3172944.3172969

Nakano, Y. I., Nihonyanagi, S., Takase, Y., Hayashi, Y., & Okada, S. (2015). Predicting participation styles using co-occurrence patterns of nonverbal behaviours in collaborative learning. In Z. Zhang, P. Cohen, D. Bohus, R. Horaud, & H. Meng (Eds.), ICMI’15: Proceedings of the 2015 ACM International Conference on Multimodal Interaction (pp. 91–98). ACM Press. https://doi.org/10.1145/2818346.2820764

Neubauer, C., Woolley, J., Khooshabeh, P., & Scherer, S. (2016). Getting to know you: A multimodal investigation of team behavior and resilience to stress. In Y. I. Nakano, E. André, T. Nishida, L.-P. Morency, C. Busso, & C. Pelachaud (Chairs), ICMI’16: Proceedings of the 18th ACM International Conference on Multimodal Interaction (pp. 193–200). ACM Press. https://doi.org/10.1145/2993148.2993195

Noroozi, O., Alikhani, I., Järvelä, S., Kirschner, P. A., Juuso, I., & Seppänen, T. (2019). Multimodal data to design visual learning analytics for understanding regulation of learning. Computers in Human Behavior, 100, 298–304. https://doi.org/10.1016/j.chb.2018.12.019

Noël, R., Miranda, D., Cechinel, C., Riquelme, F., Primo, T. T., & Munoz, R. (2022). Visualizing collaboration in teamwork: A multimodal learning analytics platform for non-verbal communication. Applied Sciences, 12(15), Article 7499. https://doi.org/10.3390/app12157499

Ochoa, X., Chiluiza, K., Granda, R., Falcones, G., Castells, J., & Guamán, B. (2018). Multimodal transcript of face-to-face group-work activity around interactive tabletops. In A. Pardo, K. Bartimote, G. Lynch, S. Buckingham Shum, R. Ferguson, A. Merceron, & X. Ochoa (Eds.), Companion Proceedings of the 8th International Conference on Learning Analytics & Knowledge. SoLAR.

Olsen, J. K., Sharma, K., Rummel, N., & Aleven, V. (2020). Temporal analysis of multimodal data to predict collaborative learning outcomes. British Journal of Educational Technology, 51(5), 1527–1547. https://doi.org/10.1111/bjet.12982

Ouhaichi, H., Spikol, D., & Vogel, B. (2021). MBOX: Designing a flexible IoT multimodal learning analytics system. In M. Chang, N.-S. Chen, D. G. Sampson, & A. Tlili (Eds.), 2021 International Conference on Advanced Learning Technologies (ICALT) (pp. 122–126). IEEE. https://doi.org/10.1109/ICALT52272.2021.00044

Oviatt, S., & Cohen, A. (2014). Written activity, representations and fluency as predictors of domain expertise in mathematics. In A. A. Salah, J. Cohn, B. Schuller, O. Aran, L.-P. Morency, & P. R. Cohen (Chairs), ICMI’14: Proceedings of the 2014 International Conference on Multimodal Interaction (pp. 10–17). ACM Press. https://doi.org/10.1145/2663204.2663245

Praharaj, S., Scheffel, M., Drachsler, H., & Specht, M. (2021). Literature review on co-located collaboration modeling using multimodal learning analytics: Can we go the whole nine yards? IEEE Transactions on Learning Technologies, 14(3), 367–385. https://doi.org/10.1109/TLT.2021.3097766

Peng, S., & Nagao, K. (2021). Recognition of students’ mental states in discussion based on multimodal data and its application to educational support. IEEE Access, 9, 18235–18250. https://doi.org/10.1109/ACCESS.2021.3054176

Praharaj, S., Scheffel, M., Drachsler, H., & Specht, M. (2018). Multimodal analytics for real-time feedback in co-located collaboration. In V. Pammer-Schindler, M. Pérez-Sanagustín, H. Drachsler, R. Elferink, & M. Scheffel (Eds.), Lifelong technology-enhanced learning: 13th European conference on technology enhanced learning, EC-TEL 2018, Leeds, UK, September 3–5, 2018, proceedings (pp. 187–201). Springer Cham. https://doi.org/10.1007/978-3-319-98572-5_15

Quek, F., McNeill, D., Bryll, R., Duncan, S., Ma, X.-F., Kirbas, C., McCullough, K. E., & Ansari, R. (2002). Multimodal human discourse: Gesture and speech. ACM Transactions on Computer–Human Interaction, 9(3), 171–193. https://doi.org/10.1145/568513.568514

Reilly, J. M., & Schneider, B. (2019). Predicting the quality of collaborative problem solving through linguistic analysis of discourse. In C. F. Lynch, A. Merceron, M. Desmarais, & R. Nkambou (Eds.), Proceedings of the 12th International Conference on Educational Data Mining (EDM 2019) (pp. 149–157). International Educational Data Mining Society. https://educationaldatamining.org/edm2019/proceedings/

Reimann, P., Markauskaite, L., & Bannert, M. (2014). e-Research and learning theory: What do sequence and process mining methods contribute? British Journal of Educational Technology, 45(3), 528–540. https://doi.org/10.1111/bjet.12146

Roschelle, J., & Teasley, S. D. (1995). The construction of shared knowledge in collaborative problem solving. In C. O’Malley (Ed.), Computer supported collaborative learning (pp. 69–97). NATO ASI Series, vol 128. Springer. https://doi.org/10.1007/978-3-642-85098-1_5

Scherer, S., Weibel, N., Morency, L.-P., & Oviatt, S. (2012). Multimodal prediction of expertise and leadership in learning groups. MLA’12: Proceedings of the 1st International Workshop on Multimodal Learning Analytics (Article 1). ACM Press. https://doi.org/10.1145/2389268.2389269

Schneider, B., & Pea, R. (2015). Does seeing one another’s gaze affect group dialogue? A computational approach. Journal of Learning Analytics, 2(2), 107–133. https://doi.org/10.18608/jla.2015.22.9

Schneider, B., Sung, G., Chng, E., & Yang, S. (2021). How can high-frequency sensors capture collaboration? A review of the empirical links between multimodal metrics and collaborative constructs. Sensors, 21(24), Article 8185. https://doi.org/10.3390/s21248185

Sinclair, A. J., & Schneider, B. (2021). Linguistic and gestural coordination: Do learners converge in collaborative dialogue? In S. Hsiao, S. Sahebi, F. Bouchet, & J.-J. Vie (Eds.), Proceedings of the 14th International Conference on Educational Data Mining (pp. 431–438). International Educational Data Mining Society. https://files.eric.ed.gov/fulltext/ED615472.pdf

Spikol, D., Ruffaldi, E., & Cukurova, M. (2017). Using multimodal learning analytics to identify aspects of collaboration in project-based learning. In B. K. Smith, M. Borge, E. Mercier, & K. Y. Lim (Eds.), Making a difference: Prioritizing equity and access in CSCL, 12th International Conference on Computer Supported Collaborative Learning (CSCL) 2017 (pp. 263–270). International Society of the Learning Sciences. https://repository.isls.org/handle/1/240

Spikol, D., Ruffaldi, E., Landolfi, L., & Cukurova, M. (2017). Estimation of success in collaborative learning based on multimodal learning analytics features. In M. Chang, N.-S. Chen, R. Huang, Kinshuk, D. G. Sampson, & R. Vasiu (Eds.), The 17th IEEE International Conference on Advanced Learning Technologies (ICALT 2017) (pp. 269–273). IEEE. https://doi.org/10.1109/ICALT.2017.122

Spikol, D., Ruffaldi, E., Dabisias, G., & Cukurova, M. (2018). Supervised machine learning in multimodal learning analytics for estimating success in project-based learning. Journal of Computer Assisted Learning, 34(4), 366–377. https://doi.org/10.1111/jcal.12263

Stewart, A. E. B., Keirn, Z., & D’Mello, S. K. (2021). Multimodal modeling of collaborative problem-solving facets in triads. User Modelling and User-Adapted Interaction, 31(4), 713–751. https://doi.org/10.1007/s11257-021-09290-y

Sturm, J., Herwijnen, O. H., Eyck, A., & Terken, J. (2007). Influencing social dynamics in meetings through a peripheral display. In K. Mase, D. Massaro, K. Takeda, D. Roy, & A. Potamianos (Chairs), ICMI’07: Proceedings of the Ninth International Conference on Multimodal Interfaces (pp. 263–270). ACM Press. https://doi.org/10/b3fcjc

Sun, C., Shute, V. J., Stewart, A., Yonehiro, J., Duran, N., & D’Mello, S. (2020). Towards a generalized competency model of collaborative problem solving, Computers & Education, 143, Article 103672. https://doi.org/10.1016/j.compedu.2019.103672

Theobald, E. J., Eddy, S. L., Grunspan, D. Z., Wiggins, B. L., & Crowe, A. J. (2017). Student perception of group dynamics predicts individual performance: Comfort and equity matter. PLoS ONE, 12(7), Article e0181336. https://doi.org/10.1371/journal.pone.0181336

Turk, M. (2014). Multimodal interaction: A review. Pattern Recognition Letters, 36, 189–195. https://doi.org/10.1016/j.patrec.2013.07.003

Viswanathan, S. A., & VanLehn, K. (2018). Using the tablet gestures and speech of pairs of students to classify their collaboration. IEEE Transactions on Learning Technologies, 11(2), 230–242. https://doi.org/10/gds4fq

Vrzakova, H., Amon, M. J., Stewart, A., Duran, N. D., & D’Mello, S. K. (2020). Focused or stuck together: Multimodal patterns reveal triads’ performance in collaborative problem solving. In C. Rensing, H. Drachsler, V. Kovanović, N. Pinkwart, M. Scheffel, & K. Verbert (Chairs), Celebrating 10 years of LAK: Shaping the Future of the Field: The Tenth International Conference on Learning Analytics & Knowledge (pp. 295–304). ACM Press. https://doi.org/10.1145/3375462.3375467

Worsley, M., & Blikstein, P. (2014). Deciphering the practices and affordances of different reasoning strategies through multimodal learning analytics. In X. Ochoa, M. Worsley, K. Chiluiza, & S. Luz (Chairs), MLA’14: Proceedings of the 2014 ACM Multimodal Learning Analytics Workshop and Grand Challenge (pp. 21–27). ACM Press. https://doi.org/10.1145/2666633.2666637

Wise, A. F., Knight, S., & Buckingham Shum, S. (2021). Collaborative learning analytics. In U. Cress, C. Rosé, A. F. Wise & J. Oshima (Eds.), International handbook of computer-supported collaborative learning (pp. 425–443). Springer Cham. https://doi.org/10.1007/978-3-030-65291-3_23

Zhao, L., Swiecki, Z., Gašević, D., Yan, L., Dix, S., Jaggard, H., Wotherspoon, R., Osborne, A., Li, X., Alfredo, R., & Martinez-Maldonado, R. (2023). METS: Multimodal learning analytics of embodied teamwork learning. In I. Hilliger, H. Khosravi, B. Rienties, & S. Dawson (Chairs), Towards Trustworthy Learning Analytics: The Thirteenth International Conference on Learning Analytics & Knowledge (pp. 186–196). ACM Press. https://doi.org/10.1145/3576050.3576076

Zhao, L., Yan, L., Gašević, D., Dix, S., Jaggard, H., Wotherspoon, R., Alfredo, R., Li, X., & Martinez-Maldonado, R. (2022). Modelling co-located team communication from voice detection and positioning data in healthcare simulation. In A. F. Wise, R. Martinez-Maldonado, & I. Hilliger (Chairs), Learning Analytics for Transition, Disruption and Social Change: The Twelfth International Conference on Learning Analytics & Knowledge (pp. 370–380). ACM Press. https://doi.org/10.1145/3506860.3506935

Downloads

Published

2025-06-10

How to Cite

Esterhazy, R., Kaliisa, R., Sanchez, D., Langford, M., & Damsa, C. (2025). Advancing Multimodal Collaboration Analytics: A Scoping Review. Journal of Learning Analytics, 12(2), 105-124. https://doi.org/10.18608/jla.2025.8625