Towards an integration of text and graph clustering methods as a lens for studying social interaction in MOOCs

Authors

  • Diyi Yang Carnegie Mellon University
  • Miaomiao Wen Carnegie Mellon University
  • Abhimanu Kumar Carnegie Mellon University
  • Eric P. Xing Carnegie Mellon University
  • Carolyn Penstein Rose Carnegie Mellon University

DOI:

https://doi.org/10.19173/irrodl.v15i5.1853

Keywords:

online learning, MOOCs, Learning Analytics

Abstract

In this paper, we describe a novel methodology, grounded in techniques from the field of machine learning, for modeling emerging social structure as it develops in threaded discussion forums, with an eye towards application in the threaded discussions of massive open online courses (MOOCs). This modeling approach integrates two simpler, well established prior techniques, namely one related to social network structure and another related to thematic structure of text. As an illustrative application of the integrated technique’s use and utility, we use it as a lens for exploring student dropout behavior in three different MOOCs. In particular, we use the model to identify twenty emerging subcommunities within the threaded discussions of each of the three MOOCs. We then use a survival model to measure the impact of participation in identified subcommunities on attrition along the way for students who have participated in the course discussion forums of the three courses. In each of three MOOCs we find evidence that participation in two to four subcommunities out of the twenty is associated with significantly higher or lower dropout rates than average. A qualitative post-hoc analysis illustrates how the learned models can be used as a lens for understanding the values and focus of discussions within the subcommunities, and in the illustrative example to think about the association between those and detected higher or lower dropout rates than average in the three courses. Our qualitative analysis demonstrates that the patterns that emerge make sense: It associates evidence of stronger expressed motivation to actively participate in the course as well as evidence of stronger cognitive engagement with the material in subcommunities associated with lower attrition, and the opposite in subcommunities associated with higher attrition. We conclude with a discussion of ways the modeling approach might be applied, along with caveats from limitations, and directions for future work.

Author Biographies

Diyi Yang, Carnegie Mellon University

Diyi Yang is a Master's student in the Master's of Language Technologies program at the Language Technologies Institute in the School of Computer Science at Carnegie Mellon University.  Her research focuses on Learning Analytics and Social Recommendation in MOOCs.

Miaomiao Wen, Carnegie Mellon University

Miaomiao Wen is a PhD student in the Language and Information Technologies PhD program at the Language Technologies Institute in the School of Computer Science at Carnegie Mellon University.  Her research focuses on Learning Analytics and Discrouse Analysis in MOOCs.

Abhimanu Kumar, Carnegie Mellon University

Abhimanyu Kumar is a Master's student in the Master's of Language Technologies program at the Language Technologies Institute in the School of Computer Science at Carnegie Mellon University.  His research focuses on Machine Learning and Probabilistic Graphical Models.

Eric P. Xing, Carnegie Mellon University

Dr. Xing is an Associate Professor with a joint appointment between the Language Technologies Institute and the Machine Learning Department in the School of Computer Science at Carnegie Mellon University.  His research program is focused on development of highly scalable probabilistic graphical modeling approaches.

Carolyn Penstein Rose, Carnegie Mellon University

Dr. Rose is an Associate Professor with a joint appointment between the Language Technologies Institute and the Human-Computer Interaction Institute in the School of Computer Science at Carnegie Mellon University.  Her research program is focused on better understanding the social and pragmatic nature of conversation, and using this understanding to build computational systems that can improve the efficacy of conversation between people, and between people and computers.

Published

2014-10-06

How to Cite

Yang, D., Wen, M., Kumar, A., Xing, E. P., & Rose, C. P. (2014). Towards an integration of text and graph clustering methods as a lens for studying social interaction in MOOCs. The International Review of Research in Open and Distributed Learning, 15(5). https://doi.org/10.19173/irrodl.v15i5.1853