Hamed Alhoori

Hamed Alhoori

I am an Associate Professor in the Computer Science Department at Northern Illinois University and the Director of the Data Analytics Theory and Applications (DATA) Laboratory. Our research bridges artificial intelligence, large language models, machine learning, text mining, and data science to address complex challenges in scientific discovery, societal impact, and evidence-based decision-making. Our work spans AI for Science and the Science of Science, where we develop models that improve research reproducibility, uncover patterns in scholarly data, and predict the impact of scientific work. Our innovative approaches help bridge the gap between academic research and public understanding by analyzing sentiment, engagement, and the spread of scientific information. In the areas of AI Safety and AI in Engineering, we develop fair and reliable systems for real-world deployment. Among our projects, we collaborate with Argonne National Laboratory and Spirit AeroSystems, soon to be part of Boeing, to create AI-powered inspection systems for improved manufacturing safety and efficiency. Our research, supported by federal agencies and industry partners, appears in leading academic venues and has earned multiple awards, reflecting our dedication to advancing responsible AI applications that benefit science, industry, and society.

Acknowledgments: My research has been supported by NSF, DOE, ANL, NIU, TAMU, UOB, QNRF, and ADHO. Thank you!

Interests

  • Generative AI (genAI)
  • Large language models (LLMs)
  • Data Science
  • Science of Science
  • Machine Learning
  • Text Mining
  • Social Media Mining
  • Computational Social Science

Education

  • PhD in Computer Science

    Texas A&M University, College Station, Texas

  • M.S. in Computer Science

    Texas A&M University, College Station, Texas

  • BSc in Computer Science

    University of Bahrain

News

Research Assistant positions

I am looking for highly motivated and hard-working undergraduate and graduate (MS/PhD) students to work on exciting projects in Large language models (LLMs), Generative AI, Machine Learning, and Computational social science.

Recent & Upcoming Talks

Experience

 
 
 
 
 

Visiting Scientist

Argonne National Laboratory

2016 – 2016 Illinois
 
 
 
 
 

Research Associate

Qatar University

2013 – 2014 Qatar
 
 
 
 
 

Research Assistant

Texas A&M University

2011 – 2015 Texas

Projects

Identifying Reproducible Research Using Human-in-the-loop Machine Learning

Create datasets, reproducibility metrics, and machine learning models that estimate a confidence level in the reproducibility of a published work.
Quickly discover relevant content by filtering publications.

Early indicators of scientific impact: Predicting citations with altmetrics

Identifying important scholarly literature at an early stage is vital to the academic research community and other stakeholders such as …

Evaluating the Effects of Acid Fracture Etching Patterns on Conductivity Estimation Using Machine Learning Techniques

The successful design of an acid fracture job requires accurate prediction of fractured well productivity. Productivity estimation …

Analyzing Twitter Bot Activity on Academic Articles

Given its ascendancy as a way to make connections worldwide, social media is affecting all areas of people’s lives. This paper focuses …

Data-Driven Acid Fracture Conductivity Correlations Honoring Different Mineralogy and Etching Patterns

Acid-fracturing operations are mainly applied in tight carbonate formations to create a highly conductive path. Estimating the …

Measuring the Diversity of Facebook Reactions to Research

Online and in the real world, communities are bonded together by emotional consensus around core issues. Emotional responses to …

Students

Ph.D. Students (Current and Former)

  1. Akhil Pandey Akella. (2024) Reproducibility, AI4Science, LLM. Research Scientist, CSSI, Kellogg School of Management, Northwestern University.
  2. Abdul Rahman Shaikh. Machine Learning, Computer Vision, Visual Analytics
  3. Harish Varma Siravuri. Knowledge Graph, LLM
  4. Murtuza Shahzad Syed. Concept Drift
  5. Miftahul Jannat Mokarrama. LLM, Public Policy, Social Data Science
  6. Venkata Devesh Reddy Seethi. Machine Learning, Computer Vision
  7. Ashiqur Rahman. Machine Learning, Visualization
  8. Dalia Khaizaran. LLM for science
  9. Adiba Ibnat Hossain. AI for science

M.S. Thesis Supervisor

  1. Rami Lake (Spring 2024) Machine Learning. AI Engineer - IT360
  2. Miftahul Jannat Mokarrama (Fall 2023) Public Policy, Reproducibility, Social Data Science
  3. Ashiqur Rahman (Summer 2022) Detecting COVID-19 Misinformation and Public Opinion on Covid-19 Vaccine
  4. Abdul Rahman Shaikh (Spring 2022) Modeling the Broader Impact of Science and Health Using Social Media
  5. Murtuza Shahzad Syed (Fall 2020) Development of Machine Learning Models to Predict the Online Impact of Research
  6. Cole Freeman (Spring 2020) The Emotions of Science: Using Social Media to Gauge Public Emotions Toward Research Topics
  7. Akhil Pandey Akella (Fall 2019) Using Machine Learning Models to Discover Promising Research
  8. Harish Varma Siravuri (Spring 2018) Assessment of Societal Impact of Research. Data Scientist at Nielsen

M.S. Thesis Committee Member

  1. Venkata Devesh Reddy Seethi (Fall 2020)
  2. Manohar Sai Jasti (Fall 2019) Data Scientist at Kaizen Analytix
  3. Mrinal Kanti Roy (Fall 2019) Software Engineer at Coyote Logistics
  4. Vishrant Krishna Gupta (Fall 2018) Software Engineer III at Groupon
  5. Ashli Fain (Fall 2018) Software Engineer at American Express
  6. Eric Lavin (Spring 2018) Data Scientist at Allstate
  7. Bharat Kale (Spring 2018) PhD student in Data Visualization

M.S. Students Advising (Semester-Long Research Project)

  1. Ashiqur Rahman (Spring 2020 - Summer 2020) Research Assistant
  2. Srikanth Nagidi (Summer 2019 – Fall 2019)
  3. Pavan Kondamudi (Spring 2017 – Spring 2018) Data Scientist at Oracle
  4. Pradeep Maddipatla (Fall 2017 – Spring 2018)
  5. Vishal Panguru (Spring 2017) Data Engineer III at Anthem, Inc.
  6. Yaswanth Vayalpati (Fall 2017) Developer at Mastech Digital
  7. Brian Homerding (Spring 2017) Engineer at Argonne Laboratory
  8. Jagadeesh Vinnakota (Fall 2016) AI Consultant at Home Depot
  9. Saiteja Yagni (Fall 2016) Data Engineer at Cloudwick
  10. Reajeswari Gundu (Fall 2016) Full stack developer at UCLA Health
  11. Kartheek Chintalapati (Fall 2016) Developer at K-Rise Systems
  12. Sai Krishna Vemuri (Fall 2016) Engineer at HERE Technologies
  13. Aparajita Kamath (Fall 2016) Senior Software Developer at Westpac
  14. Wesam Alruwaili (Summer 2016 – Fall 2016) Instructor at the Jouf University
  15. Himanshu Verma (Summer 2016 – Fall 2016) AI Engineer at Ford Motor Company
  16. Kavya Devarapally (Summer 2016) Engineer at Cerner Corporation
  17. Avinash Chirumamilla (Summer 2016) Software Engineer at Microsoft

Undergraduate

  1. Enrique Nueve (Fall 2018 – Spring 2020) Machine Learning Researcher at Argonne Laboratory
  2. Ethan Pitre (Summer 2019) Software Developer at Epic
  3. Sahithi Challapalli (Fall 2018 – Spring 2019) Research rookie
  4. Aleena Ahmed (Fall 2018) Business Intelligence Intern at Zebra
  5. Luis Arredondo (Fall 2017 – Spring 2018) Honors capstone project. “A Study of Altmetrics Using Sentiment Analysis,” MS in CS at UIC
  6. Joseph McDade (Fall 2017 – Spring 2018) Honors capstone project. “Can We Predict Reproducible Scholarly Research?” Consultant at Red Hat
  7. Olsi Shehu (Fall 2017) Research rookie. Consultant at NxT Team
  8. Shawn Dust (Summer 2017 – Fall 2017) Software Engineer at TransUnion
  9. Justin Bradley (Summer 2017)
  10. James Bonasera (Summer 2017) Campus Innovator at Discover
  11. Eric Youngberg (Spring 2017) Honors capstone project, “Improving Speech and Speaker Recognition For Multi-Speaker Conversations,” Software Developer at Sasaki
  12. Bradley Protano (Spring 2017) Software Engineer at Discover
  13. Jamieson Walker (Fall 2016 – Spring 2017) Software Engineer at Broadcom Inc.
  14. Jonathan Gaff (Spring 2016 – Summer 2016) Engineer at University of Chicago
  15. Christian Bailey (Summer 2016 – Summer 2017)
  16. Alexandre Sopha (Summer 2016) Software Engineer at Capital One

Teaching

  • CSCI 680: Large Language Models: Spring 2025
  • CSCI 637: Pattern Recognition and Data Mining II (Spring 2024)
  • CSCI 636: Pattern Recognition and Data Mining I (Fall 2024)
  • CSCI 490/642: Information Storage and Retrieval (Spring 2023)
  • CSCI 490/641: Big Data Analytics (Fall 2019, Fall 2018, Spring 2018)
  • CSCI 490/680: Data Science and Analytics (Spring 2017)
  • CSCI 490/680: Mining Massive Datasets (Spring 2016)
  • CSCI 630: Computer Networks (Fall 2017, Fall 2016)
  • CSCI 490/512: Computer Networks (Spring 2020, Fall 2019, Spring 2018)
  • CSCI 340: Data Structures and Algorithm Analysis (Fall 2018, Spring 2018)

Service

Editorial Board

  • International Journal of Digital Libraries (IJDL) (2020 – present)

Program Committee Member

  • The International AAAI Conference on Web and Social Media (ICWSM) 2018, 2019, 2020, 2021, 2022, 2023, 2024
  • ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023, 2024
  • The International ACM Conference on Web Science 2020, 2021, 2022, 2023, 2024, 2025
  • Empirical Methods in Natural Language Processing (EMNLP) 2021
  • IEEE International Conference on Multimedia Big Data 2021
  • International Conference on Theory and Practice of Digital Libraries (TPDL) 2017, 2018, 2019, 2020, 2021
  • The annual conference of the Asia-Pacific chapter of the Association for Computational Linguistics (AACL) 2020
  • International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (KMIS) 2020
  • Second Workshop on Scholarly Document Processing (SDP) at NAACL 2021
  • ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR) 2018 Doctoral Consortium
  • The International Conference on Social Informatics (SocInfo) 2019, 2020
  • The International Conference on Advanced Collaborative Networks, Systems and Applications (COLLA) 2018, 2019
  • The ACM International Conference on Information and Knowledge Management (CIKM), 2018
  • Southern Data Science conference 2019
  • The Eleventh International Conference on Creative Content Technologies CONTENT 2019
  • Workshop on Altmetrics for Research Outputs Measurement and Scholarly Information Management (AROSIM) 2018
  • Workshop “Scholarly Big Data: AI Perspectives, Challenges, and Ideas” at the International Joint Conference on Artificial Intelligence 2016

Reviewer

  • ACL/IJCNLP 2021
  • Computer Supported Cooperative Work (CSCW) 2021
  • BMC Medical Informatics and Decision Making 2021
  • Expert Systems with Applications, 2020
  • Journal of Informetrics, 2020
  • Journal of Network and Computer Applications, 2019
  • Journal of the Association for Information Science and Technology (JASIST), 2017, 2018
  • International Journal on Digital Libraries (IJDL), 2015, 2017, 2018, 2019
  • Social Network Analysis and Mining (SNAM), 2017, 2018
  • iConference 2015, 2017, 2018, 2019, 2020, 2021
  • Scientometrics 2018
  • PLOS ONE 2017, 2020

Conference Session Chair

  • “Scholarly Documents,” JCDL 2019, University of Illinois at Urbana-Champaign.
  • “High Performance towards Big Data”, CIKM 2016: The 25th ACM International Conference on Information and Knowledge Management.

Internal Service

  • Member, Personnel Committee (Fall 2021 – present)
  • Member, College Council (Fall 2021 – present)
  • Chair, Colloquium Committee (Fall 2018 – Spring 2020)
  • Member, Undergraduate Studies Committee (Fall 2018 – present)
  • Member, Colloquium Committee (Fall 2015 – Spring 2018)
  • Member, Graduate Studies Committee (Fall 2016 – Spring 2019)
  • Member, Advisory Committee (Fall 2017 – Spring 2018)
  • Member, Grade Review Board Committee (Fall 2017 – Spring 2019)

Contact

Web Analytics