Research Interests
- Artificial intelligence: applied machine learning, applied artificial intelligence, large language models
- Data science: applied data science, text mining, innovation measurement, social media analysis
- Data quality: data quality evaluation and improvement in machine learning, data centric AI
- Natural language processing: information retrieval, recommendation system, legal NLP
- Informatics: health informatics, legal informatics
Education
- Ph.D. in Information Science concentrated on Data Science, 2022
- University of North Texas, USA
- M.S. in Information Science, 2017
- Wuhan University, China
- B.S. in Information Science & B.S. in English Literature, 2014
- Central China Normal University, China
Funded Research Grants
- External Grants: [EG1] 2023-2025. Senior Personnel (PI: Dr. Ting Xiao). NSF REU Site “Beyond Language: Training to Create and Share Vector Embeddings across Applications”, National Science Foundation. Amount: $403,547.
- Internal Grants: [IG1] 2024. PI. UNT Junior Faculty Summer Research Grant, University of North Texas. Award Amount: $5,000.
[EG2] 2022-2025. Co-PI (PI: Dr. Junhua Ding). NSF HSI Implementation and Evaluation Project “Developing a High-Quality Academic Environment for Broadening Participation of Hispanic Students in Computing”, National Science Foundation. Award Amount: $499,608.
[IG2] 2024. PI. UNT COI Seed Funding “Utilizing AI/ML to Enhance Personalized Health Information Services for Hispanic Populations during Disaster Recovery”, College of Information, University of North Texas. Award Amount: $5,000.
[IG3] 2022. PI. UNT COI Seed Funding “Towards a large-scale and high-quality corpus for legal argument mining”, College of Information, University of North Texas. Award Amount: $9,975.
Selected Publications
- (*) indicates the corresponding author. IF is based on JCR 2023.
- Journal Articles: [J25] Wang, Z., Zhang, H., Chen, J., & Chen, H.* (2024). An Effective Framework for Measuring the Novelty of Scientific Articles through Integrated Topic Modeling and Cloud Model. Journal of Informetrics, 18(4), 101587. IF =3.4.
- Conference Papers: [C20] Ying, D., Yu, F., Chen, H., & Lu, W. (2024). Fine-Grained, Accurate Data Generation and Multimodal Layout Analysis for Academic Papers. In Proceedings of the 2024 ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL 2024), just accepted. ACM.
- For the full list of publications, please refer to my Google Scholar or the publications listed on the UNT IDEA Lab website!
[J24] Wang, Z., Zhang, H., Chen, H.*, Feng, Y., & Ding, J. (2024). Content-based quality evaluation of scientific papers using coarse feature and knowledge entity network. Journal of King Saud University-Computer and Information Sciences, 102119. IF = 5.2
[J23] Zhao, H., Chen, H., Ruggles, T. A., Feng, Y., Singh, D., & Yoon, H. J. (2024). Improving Text Classification with Large Language Model-Based Data Augmentation. Electronics, 13(13), 2535. IF = 2.6
[J22] Tu, F., Wu, L., Kinshuk, Ding, J., & Chen, H.* (2024). Exploring the Influence of Regulated Learning Processes on Learners' Prestige in Project-Based Collaborative Learning. Education and Information Technologies, just accepted. IF =5.5.
[J21] Chen, H.*, Kim, J., Chen, J., & Sakata, A. (2024). Demystifying oral history with natural language processing and data analytics: A case study of the Densho Digital Collection. The Electronic Library, ahead-of-print. IF = 2.42
[J20] Huang, J., Chen, H., Yu, F., & Lu, W. (2024). From Detection to Application: Recent Advances in Understanding Scientific Tables and Figures. ACM Computing Surveys, ahead-of-print. IF = 16.6
[J19] Wang, Z., Qiao, X., Chen, J., Li, L., Zhang, H., Ding, J., & Chen, H.* (2024). Exploring and evaluating the index for interdisciplinary breakthrough innovation detection. The Electronic Library, ahead-of-print. IF = 2.42
[J18] Nguyen, H., Chen, H., Chen, J., Kargozari, K., & Ding, J. (2023). Construction and Evaluation of a Domain-Specific Knowledge Graph for Knowledge Discovery. Information Discovery & Delivery . IF = 2.245
[J17] Wang, Z., Chen, J., Chen, J., & Chen, H.* (2023). Identifying interdisciplinary topics and their evolution based on BERTopic. Scientometrics, 1-26. IF = 4.1
[J16] Wang, Z., Peng, S., Chen, J., Kapasule, A. G., & Chen, H.* (2023). Detecting interdisciplinary semantic drift for knowledge organization based on normal cloud model. Journal of King Saud University-Computer and Information Sciences, 35(6), 101569. IF = 6.9
[J15] Wang, Z., Peng, S., Chen, J., Zhang, X., & Chen, H.* (2023). ICAD-MI: Interdisciplinary concept association discovery from the perspective of metaphor interpretation. Knowledge-Based Systems, 275, 110695. IF = 8.8
[J14] Zhang, L., Lu, W., Chen, H., Huang, Y., & Cheng, Q. (2022). A comparative evaluation of biomedical similar article recommendation. Journal of Biomedical Informatics, 131, 104106. IF = 8.00
[J13] Chen, H., Pieptea, L. F., & Ding, J. (2022). Construction and Evaluation of a High-Quality Corpus for Legal Intelligence Using Semiautomated Approaches. IEEE Transactions on Reliability, 71(2), 657-673. IF = 5.9
[J12] Chen, H., Wu, L., Chen, J., Lu, W., & Ding, J. (2022). A comparative study of automated legal text classification using random forests and deep learning. Information Processing & Management, 59(2), 102798. IF = 8.6
[J11] Chen, H.*, Nguyen, H., & Alghamdi, A. (2022). Constructing a high-quality dataset for automated creation of summaries of fundamental contributions of research articles. Scientometrics, 127(12), 7061-7075. IF = 4.1
[J10] Wang, Z., Wang, K., Liu, J., Huang, J., & Chen, H.* (2022). Measuring the innovation of method knowledge elements in scientific literature. Scientometrics, 127(5), 2803-2827. IF = 4.1
[J9] Zhang, Y., Zhao, R., Wang, Y., Chen, H., Mahmood, A., Zaib, M., ... & Sheng, Q. Z. (2022). Towards employing native information in citation function classification. Scientometrics, 1-21. IF = 4.1
[J8] Chen, H., Chen, J., & Ding, J. (2021). Data evaluation and enhancement for quality improvement of machine learning. IEEE Transactions on Reliability, 70(2), 831-847. IF = 5.9
[J7] Tran, N., Chen, H., Jiang, J., Bhuyan, J., & Ding, J. (2021). Effect of Class Imbalance on the Performance of Machine Learning-based Network Intrusion . International Journal of Performability Engineering, 17(9): 741-755. IF = 1.14
[J6] Chen, H.* (2021). A New Citation Recommendation Strategy Based on Term Functions in Related Studies Section. Journal of Data and Information Science, 6(3), 75-98. IF = 1.889
[J5] Chen, H., Chen, J., & Nguyen, H. (2021). Demystifying COVID-19 publications: institutions, journals, concepts, and topics. Journal of the Medical Library Association: JMLA, 109(3), 395. IF = 3.18
[J4] Chen, H., Yang, Y., Lu, W., & Chen, J. (2020). Exploring multiple diversification strategies for academic citation contexts recommendation. The Electronic Library, 38(4), 821-842. IF = 2.42
[J3] Tang, M., Chen, J., Chen, H., Xu, Z., Wang, Y., Xie, M., & Lin, J. (2020). An ontology-improved vector space model for semantic retrieval. The Electronic Library, 38(5/6), 919-942. IF = 2.42
[J2] Lu, W., Luo, M., Zhang, Z., Zhang, G., Ding, H., Chen, H., & Chen, J. (2019). Result diversification in image retrieval based on semantic distance. Information Sciences, 502, 59-75. IF = 8.1
[J1] Zhang, Z., Chen, H.*, & Xiao, B. (2019). Understanding eWOM of Chinese Governments information service: a perceived value-based perspective. Information Discovery & Delivery, 47(4), 251-258. IF = 2.245
[C19] Ying, D., Yu, F., Chen, H., & Lu, W. (2024). DIG: Complex Layout Document Image Generation with Authentic-looking Text for Enhancing Layout Analysis. In Proceedings of the 32nd ACM International Conference on Multimedia (ACM Multimedia 2024), just accepted. ACM.
[C18] Zhou, Y., Tu, F., Sha, K., Ding, J., & Chen, H.* (2024). A Survey on Data Quality Dimensions and Tools for Machine Learning. In Proceedings of the 2024 IEEE International Conference On Artificial Intelligence Testing (AITest 2024) (pp. 120-131). IEEE.
[C17] Ding, J., Nguyen, H., & Chen, H. (2024). Evaluation of Question-Answering Based Text Summarization using LLM . In Proceedings of the 2024 IEEE International Conference On Artificial Intelligence Testing (AITest 2024) (pp. 142-149). IEEE.
[C16] Chen, H.*, Cherukuri, K., Zhu, X., & Yang, S. (2024). Are Prompts All You Need?: Chatting with ChatGPT on Disinformation Policy Understanding. In proceedings of the 87th Annual Meeting of the American Society for Information, Science & Technology (ASIS&T 2024), just accepted.
[C15] Zhang, X., Chong, M., Hagen, L., & Chen, H.* (2024). A Framework for Assessing Country Reputation: Case Study of China during the COVID-19 Pandemic. In Proceedings of the 25th Annual International Conference on Digital Government Research (dg.o' 24) (pp. 1008-1010). ACM.
[C14] Kargozari, K., Ding, J., & Chen, H. (2023). Evaluating the Impact of Incentive/Non-incentive Reviews on Customer Decision-making. In Proceedings of the 2023 IEEE International Conference On Artificial Intelligence Testing (AITest 2023) (pp. 160-168). IEEE. (Best Paper Award)
[C13] Nguyen, H., Chen, H.*, Maganti, R., Hossain, K. T., & Ding, J. (2023). Measurement and Identification of Informative Reviews for Automated Summarization. In Proceedings of the 2023 IEEE International Conference On Artificial Intelligence Testing (AITest 2023) (pp. 146-151). IEEE.
[C12] Zhao, H., Chen, H., & Yoon, H. J. (2023). Enhancing Text Classification Models with Generative AI-aided Data Augmentation. In Proceedings of the 2023 IEEE International Conference On Artificial Intelligence Testing (AITest 2023) (pp. 138-145). IEEE. (Best Student Paper Award)
[C11] Ding, J., Chen, H., Kolapudi, S., Pobbathi, L., & Nguyen, H. (2023). Quality Evaluation of Summarization Models for Patent Documents. In Proceedings of the 2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security (QRS 2023) (pp. 250-259). IEEE.
[C10] Feng, Y., Vanam, S., Cherukupally, M., Zheng, W., Qiu, M., & Chen, H. (2023). Investigating Code Generation Performance of Chat-GPT with Crowdsourcing Social Data. In Proceedings of the 47th IEEE International Conference on Computers, Software, and Applications (COMPSAC 2023) (pp. 1-10). IEEE. (Best Paper Award Nominate)
[C9] Nguyen, H., Oladapo, L., Ali, I., Chen, H., & Chen, J. (2023). Fighting Misinformation: Where Are We and Where to Go?. In Proceedings of the International Conference on Information (iConference 2023) (pp. 371-394). Springer, Cham.
[C8] Chen, H.*, & Kanuboddu, B. N. (2021). A fine-grained annotation scheme for research contribution in academic literature. In Proceedings of the 18th International Conference on Scientometrics and Informetrics (pp. 241-248).
[C7] Chong, M., & Chen, H.* (2021). Racist Framing through Stigmatized Naming: A Topical and Geo‐locational Analysis of Chinavirus and Chinesevirus on Twitter. In Proceedings of the Association for Information Science and Technology, 58(1), 70-79.
[C6] Chen, H., Chen, J., & Ding, J. (2020). Data Evaluation and Enhancement for Quality Improvement of Machine Learning. In Proceedings of the 2020 IEEE 20th International Conference on Software Quality, Reliability and Security (QRS 2020), 13-13. IEEE. (Best Paper Award Nominate)
[C5] Chen, H., Cao, G., Chen, J., & Ding, J. (2019). A practical framework for evaluating the quality of knowledge graph. In Proceedings of the China Conference on Knowledge Graph and Semantic Computing (CCKS 2019) (pp. 111-122). Springer, Singapore.
[C4] Ding, J., Jin, W., & Chen, H. (2018, October). Regression-Based Documents Reranking for Precision Medicine. In Proceedings of the 2018 IEEE 18th International Conference on Bioinformatics and Bioengineering (BIBE) (pp. 283-286). IEEE.
[C3] Chen, H., Ding, J., Chen, J., & Cao, G. (2018). Designing a novel framework for precision medicine information retrieval. In Proceedings of the International Conference on Smart Health (ICSH 2018) (pp. 167-178). Springer, Cham.
[C2] Chen, J., Chen, M., Qu, J., Chen, H., & Ding, J. (2018). Smart and connected health projects: Characteristics and research challenges. In Proceedings of the International Conference on Smart Health (ICSH 2018) (pp. 154-164). Springer, Cham.
[C1] Zhang, Q., Lu, W., Yang, Y., Chen, H., & Chen, J. (2017). Automatic identification of research articles containing data usage statements. In Knowledge Discovery and Data Design Innovation: Proceedings of the International Conference on Knowledge Management (ICKM 2017) (pp. 67-87).
Teaching
- Fall 2024: INFO 5731: Computational Methods for Information Systems (Face to face)
- Summer 2024: INFO 5810: Data Analysis and Knowledge Discovery (Online)
- Spring 2024: INFO 5731: Computational Methods for Information Systems (Face to face)
- Spring 2024: INFO 5506: Applications of Artificial Intelligence in Health (Face to face)
- For the full list of courses I have taught and their evaluation, please refer to my Previous Scheduled Teaching!
Invited Talk and Presentations
- [2024-07-11]: Deep Learning and Large Language Models for Software Engineering and Testing, Erik Jonsson School of Engineering and Computer Science, The University of Texas at Dallas.
- [2024-04-19]: Data Evaluation and Improvement for Machine Learning Systems, Erik Jonsson School of Engineering and Computer Science, The University of Texas at Dallas.
- [2022-11-16]: Informatics applications in multiple fields: A bibliometrics analysis, Department of Information Science, University of North Texas.
- [2022-10-22]: Measuring the novelty of scientific literature through contribution sentences analysis, 2022 Information Processing & Management Conference.
- [2022-05-31]: Fine-grained semantic analysis for academic data mining and evaluation, National Science Library, University of Chinese Academic of Science.
- [2022-03-03]: The applications of machine learning and natural language processing in healthcare and medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong.
- [2021-11-17]: Construction and evaluation of high-quality corpus for legal intelligence using semi-automated approaches, Department of Information Science, University of North Texas.
- [2021-10-24]: Building high-performance machine learning systems for domain-specific applications, School of Information Management, Sun Yat-Sen University.
- Please feel free to contact me if you are interested in inviting me to give a research talk for your institute or group!
Professional Service
- Leadership:
- Chair, ASIS&T SIG-STI (Special Interest Group – Scientific and Technical Information), 2024-present.
- Journal Editors:
- Co-editor of The Electrical Library, 2022-present.
- Guest editor of Computer Standards & Interfaces special issue on "Applications of Generative AI", ongoing.
- Guest editor of Electronics special issue on "Intelligent Data and Information Processing", ongoing.
- Guest editor of Electronics special issue on "Applications of Deep Learning Techniques", 2023-2024.
- Guest editor of The Electrical Library special issue on "Innovation Measurement for Scientific Communication in the Era of Big Data", 2023-2024.
- Guest editor of Information Discovery & Delivery special issue on "Information and Data Quality for Intelligent Systems", 2022-2023.
- Guest editor of Frontiers in Big Data special issue on "Data Quality for Big Data and Machine Learning", 2022-2023.
- Editorial Board Member:
- Data Intelligence, 2023-present
- Frontiers in Research Metrics and Analytics, 2023-present
- Heliyon Information Science, 2022-present
- Conference Organizing Committee:
- Workshop and Tutorial Co-chair of the 24th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2024).
- Program Co-chair of the 6th IEEE International Conference on Artificial Intelligence Testing (IEEE AITest 2024).
- Local Chair of the 2nd IEEE Conference on Mobility: Operations, Services, and Technologies (IEEE MOST 2024).
- Workshops and Round Table Chair of the 18th International ISKO Conference (ISKO International Conference 2024).
- Website Chair of the 5th IEEE International Conference on Artificial Intelligence Testing (IEEE AITest 2023).
- Publicity Chair of the 18th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2018).
- Co-chair of Workshops:
- DQIS workshop@ IEEE QRS 2021, 2022, 2023, 2024.
- EEKE+AI Informatics workshop@ JCDL 2023, iConference 2024.
- IMSC workshop@ JCDL 2023.
- Journal reviewer:
- The Electrical Library (2017-present)
- IEEE Access (2018-present)
- Information Discovery and Delivery (2018-present)
- Scientometrics (2021-present)
- Computer Standards & Interfaces (2021-present)
- JAMIA Open (2021-present)
- Journal of Computational Social Science (2022-present)
- Aslib Journal of Information Management (2022-present)
- Applied Intelligence (2022-present)
- ACM Transactions on Asian and Low-Resource Language Information Processing (2022-present)
- Artificial Intelligence Review (2022-present)
- Information Processing and Management (2022-present)
- Journal of Informetrics (2022-present)
- Information Resources Management Journal (2022 - present)
- Frontiers in Public Health (2022-present)
- Neural Processing Letters (2022-present)
- Machine Learning (2022-present)
- Sensors (2022-present)
- Computational Intelligence and Neuroscience (2022-present)
- PeerJ Computer Science (2022-present)
- Mathematics (2022-present)
- Computer Animation and Virtual Worlds (2023-present)
- Complex & Intelligent Systems (2023-present)
- Knowledge and Information Systems (2023-present)
- Expert Systems with Applications (2023-present)
- PLOS One (2023-present)
- Journal of the Association for Information Science and Technology (2023-present)
- Knowledge-Based Systems (2023-present)
- IEEE Transactions on Reliability (2023-present)
- IEEE Transactions on Big Data (2023-present)
- IEEE Transactions on Artificial Intelligence (2024-present)
- IEEE Transactions on Computational Social Systems (2024-present)
- IEEE Transactions on Emerging Topics in Computational Intelligence (2024-present)
- IEEE Transactions on Knowledge and Data Engineering (2024-present)
- Education and Information Technologies (2024-present)
- PC member and reviewer for Conferences and Workshops:
- IEEE COMPSAC (2020, 2021, 2022, 2023, 2024)
- IEEE ICSC (2020, 2021, 2022, 2023, 2024)
- IEEE Big Data (2019, 2020, 2021, 2022, 2023, 2024)
- IEEE QRS (2020, 2021, 2022, 2023, 2024)
- IEEE ETCEA (2022)
- iConference (2021, 2023, 2024, 2025)
- JCDL (2021, 2022, 2023, 2024)
- ASIS&T (2021, 2023, 2024)
- ALLDATA (2022, 2023, 2024)
- IEEE AITest (2023, 2024)
- IEEE DSA (2023, 2024)
Awards
- 2022. Great Grads Spring 2022, College of Information, University of North Texas
- 2021 - 2022. UNT COVID-19 Student Success Award (Spring 2022, Summer 2021, Fall 2021), University of North Texas
- 2021. UNT Toulouse Graduate School Summer Award, University of North Texas
- 2021. Linda Schamber Writing Award for paper “Topic sentiments toward the COVID-19 vaccine on Twitter”, College of Information, University of North Texas
- 2021. The Melba S. Harvill Endowed Scholarship, Department of Information Science, University of North Texas
- 2020. The Dewey E. Carroll Graduate Fellowship Award, Department of Information Science, University of North Texas
- 2020. The Donald B. and Ana D. Cleveland Medical Informatics Endowed scholarship, Department of Information Science, University of North Texas
- 2019. The Mark E. Rorvig Endowed Graduate Fellowship Award, Department of Information Science, University of North Texas
- 2019. Outstanding Reviewer for The Electronic Library, Emerald
- 2019. The Donald B. and Ana D. Cleveland Medical Informatics Endowed scholarship, Department of Information Science, University of North Texas
- 2018. The Howard Griesdorf Interdisciplinary Ph.D. Award, Department of Information Science, University of North Texas