Asee peer logo

Leveraging Machine Learning Techniques to Analyze Computing Persistence in Undergraduate Programs

Download Paper |

Conference

2020 ASEE Virtual Annual Conference Content Access

Location

Virtual On line

Publication Date

June 22, 2020

Start Date

June 22, 2020

End Date

June 26, 2021

Conference Session

Computing and Information Technology Division Technical Session 1

Tagged Division

Computing and Information Technology

Page Count

15

DOI

10.18260/1-2--34921

Permanent URL

https://sftp.asee.org/34921

Download Count

632

Request a correction

Paper Authors

biography

Leila Zahedi Florida International University Orcid 16x16 orcid.org/0000-0002-7325-1025

visit author page

Leila Zahedi is a Ph.D. student in the School of Computing and Information Science (SCIS) at Florida
International University. Her main focus is on educational data science and machine learning. Her current research focuses on broadening participation in computing fields in order to attract minorities.

visit author page

biography

Stephanie J. Lunn Florida International University Orcid 16x16 orcid.org/0000-0003-3840-1822

visit author page

Stephanie J. Lunn is a Ph.D. candidate in the School of Computing and Information Sciences at Florida International University (FIU). Her research interests span the fields of computing education, human computer interaction, data science, and machine learning. Previously, Stephanie received her B.S. and M.S. degrees in Neuroscience from the University of Miami, in addition to a B.S. degree in Computer Science from FIU.

visit author page

biography

Samira Pouyanfar Microsoft

visit author page

Samira Pouyanfar earned her Ph.D. in Computer Science from Florida International University, Miami, USA in 2019. She received a Master degree in Artificial Intelligence from Sharif University of Technology in 2012 and a Bachelor degree in Computer Engineering from University of Isfahan in 2009. Her research interests include Artificial Intelligence, data science, machine learning, deep learning, and big data. She has published over 30 research papers in international journals and conference proceedings. She is currently working as a data scientist at Microsoft Corporation in Seattle, Washington.

visit author page

biography

Monique S. Ross Florida International University Orcid 16x16 orcid.org/0000-0002-6320-636X

visit author page

Monique Ross earned a doctoral degree in Engineering Education from Purdue University. She has a Bachelor’s degree in Computer Engineering from Elizabethtown College, a Master’s degree in Computer Science and Software Engineering from Auburn University, eleven years of experience in industry as a software engineer, and three years as a full-time faculty in the departments of computer science and engineering. Her interests focus on broadening participation in engineering through the exploration of: 1) race, gender, and identity in the engineering workplace; 2) discipline-based education research (with a focus on computer science and computer engineering courses) in order to inform pedagogical practices that garner interest and retain women and minorities in computer-related engineering fields. 

visit author page

biography

Matthew W. Ohland Purdue University, West Lafayette Orcid 16x16 orcid.org/0000-0003-4052-1452

visit author page

Matthew W. Ohland is Associate Head and Professor of Engineering Education at Purdue University. He has degrees from Swarthmore College, Rensselaer Polytechnic Institute, and the University of Florida. His research on the longitudinal study of engineering students, team assignment, peer evaluation, and active and collaborative teaching methods has been supported by the National Science Foundation and the Sloan Foundation and his team received for the best paper published in the Journal of Engineering Education in 2008, 2011, and 2019 and from the IEEE Transactions on Education in 2011 and 2015. Dr. Ohland is an ABET Program Evaluator for ASEE. He was the 2002–2006 President of Tau Beta Pi and is a Fellow of the ASEE, IEEE, and AAAS.

visit author page

Download Paper |

Abstract

Student retention in computing fields remains one of the most significant concerns for science, technology, engineering, or mathematics careers. Looking further at computing fields, it is clear that enrollment in such programs has not kept pace with the industry demands. This is in part due to retention challenges in computer science. Retention refers to the number of students who start their career at a college and will graduate. Many factors may play a role in a students’ decision to drop out. Thus, finding meaningful patterns from historical data can help education researchers reveal the possible reasons for students’ withdrawal from a university and provide guidelines and mechanisms that lead to improving retention rates in such fields. Since pathway patterns for computing students are different from engineering students, it is crucial to specifically explore computing fields in order to find ways to increase the retention rate for such majors. To achieve this goal, this study is conducted on a large longitudinal database - Multiple-Institution Database for Investigating Engineering Longitudinal Development (MIDFIELD) to determine the importance of different factors (features) in the graduation of computing students. The results of this work provides a predictive model of graduation, across multiple U.S. institutions leveraging various machine learning (ML) techniques (including: random forest, support vector machine, naïve bayes, and decision tree classification) to find the most important factors, as well as the most accurate predictive model for graduation. Participants of this study included approximately 40,000 students who enrolled in computing majors and had the opportunity to graduate within six years. The identified students, at some point, were enrolled in one of the following computing disciplines; namely computer engineering, software engineering, computer science, computer programming, computing and information sciences, and information technology. Drawing on Astin’s inputs-environment-outcome (I-E-O) model, we first explored the importance of each feature on students’ graduation. According to the model, student outcomes (e.g., graduation) are dependent on two types of factors, including inputs (e.g., demographic features) and environment (e.g., experiences in college). Results suggest that educational variables, including cumulative GPA, number of terms registered, being a transfer student, and the institution, are the most important features, respectively. Results also showed that random forest produced a more accurate result on this dataset compared to other predictive models. We anticipate findings from this ongoing research will give insight to the computing education community and researchers to better understand the relative success of computing students, in turn providing strategic solutions to attain higher retention rates.

Zahedi, L., & Lunn, S. J., & Pouyanfar, S., & Ross, M. S., & Ohland, M. W. (2020, June), Leveraging Machine Learning Techniques to Analyze Computing Persistence in Undergraduate Programs Paper presented at 2020 ASEE Virtual Annual Conference Content Access, Virtual On line . 10.18260/1-2--34921

ASEE holds the copyright on this document. It may be read by the public free of charge. Authors may archive their work on personal websites or in institutional repositories with the following citation: © 2020 American Society for Engineering Education. Other scholars may excerpt or quote from these materials with the same citation. When excerpting or quoting from Conference Proceedings, authors should, in addition to noting the ASEE copyright, list all the original authors and their institutions and name the host city of the conference. - Last updated April 1, 2015