Research Areas

My research is related to natural language processing, text mining, and data mining. The specific areas include
R1: Relation Extraction
Extract semantic relations from unstructured data.
R2: Document Classification and Annotation
Annotate texts automatically by keywords with deep learning.
R3: Knowledge Organisation
Organise information & knowledge to support access and retrieval.
R4: Information Behaviour
Searching, online self-archiving, health information behaviour.


(Marked by research areas R1-R4)
  • H. Dong, V. Suárez-Paniagua, W. Whiteley, H. Wu, Explainable Automated Coding of Clinical Notes using Hierarchical Label-wise Attention Networks and Label Embedding Initialisation. arXiv:2010.15728, accepted to Journal of Biomedical Informatics, 2021, pp. 1-21 (suppl. pp. 1-4). (R2) [preprint on ArXiv] [github]
  • A. Casey, E. Davidson, M.T.C. Poon, H. Dong, D. Duma, A. Grivas, ..., B. Alex, A Systematic Review of Natural Language Processing Applied to Radiology Reports. arXiv:2102.09553, 2021, pp. 1-38. [preprint on ArXiv]
  • H. Dong, Learning and Leveraging Structured Knowledge from User-Generated Social Media Data. Doctoral Thesis. University of Liverpool. Apr 2020. (R1,R2,R3) [pdf on Bibsonomy]
  • H. Dong, W. Wang, K. Huang, F. Coenen, Automated Social Text Annotation with Joint Multi-Label Attention Networks. IEEE Transactions on Neural Networks and Learning Systems, 2020, pp. 1-15. (R2) [pdf on Bibsonomy] [github]
  • H. Dong, W. Wang, F. Coenen, K. Huang, Knowledge Base Enrichment by Relation Learning from Social Tagging Data. Information Sciences, vol.526, 2020, pp. 203-220. (R1, R3) [pdf on Bibsonomy] [github]
  • H. Dong, W. Wang, K. Huang, F. Coenen, Joint Multi-Label Attention Networks for Social Text Annotation. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2019), Volume 1 (Long and Short Papers), pp. 1348-1354. (R2) [pdf] [github] [data] [poster]
  • J. Lee, S. Oh, H. Dong, F. Wang, G. Burnett. Motivations for Self-Archiving on an Academic Social Networking Site: A Study on ResearchGate. Journal of the Association for Information Science and Technology (JASIST), vol. 70, 2019, pp. 563-574. (R4) [pdf]
  • H. Dong. Matching Linked Data for Cross-Lingual Genealogical Services: Learn Chinese Surnames in Shanghai Library Open Data Application Development Contest. Journal of Academic Libraries [in Chinese], vol. 36, no. 4, 2018, pp. 50-57+103. (R3) [abstract in English] [pdf in Chinese] [github] [android 4.0-5.x apk] [android 9+ apk]
  • Y. Chen, H. Dong, W. Wang, Topic-Graph Based Recommendation on Social Tagging Systems: a study on ResearchGate, in Proceedings of the 2018 International Conference on Data Science and Information Technology (DSIT 2018), Singapore, July 20-22, 2018, pp. 138-143. (R3) [pdf on ResearchGate]
  • H. Dong, W. Wang, F. Coenen, Learning Relations from Social Tagging Data, PRICAI 2018: Trends in Artificial Intelligence, 15th Pacific Rim International Conference on Artificial Intelligence, Nanjing, China, August 28–31, Proceedings, Part I. Lecture Notes in Computer Science, Lecture Notes in Artificial Intelligence, vol 11012. Springer, Cham, 2018, pp. 29-41. (R1) [pdf on ResearchGate] [slides]
  • H. Dong, W. Wang, F. Coenen. Rule for Inducing Hierarchies from Social Tagging Data, in Chowdhury G., McLeod J., Gillet V., Willett P. (eds) Transforming Digital Worlds. iConference 2018, Sheffield, UK, 25-28 March. Lecture Notes in Computer Science, vol 10766. Springer, Cham, 2018, pp. 345-355. (R1) [pdf on ResearchGate] [slides]
  • H. Dong. Enrichment of Cross-Lingual Information on Chinese Genealogical Linked Data, in iConference 2017 Proceedings, Vol. 2, Wuhan, China, 22-25 March, 2017, pp. 31-42. (R3) [pdf] [slides] [github] [android 4.0-5.x apk] [android 9+ apk]
  • H. Dong, W. Wang, F. Coenen. Deriving Dynamic Knowledge from Academic Social Tagging Data: A Novel Research Direction, in iConference 2017 Proceedings, Wuhan, China, 22-25 March, 2017, pp. 661-666. (R1) [pdf] [github] [poster] [data]
  • J. Lee, S. Oh, H. Dong, F. Wang, G. Burnett, A Framework for Studying Motivations for Self-archiving on Academic Social Network Sites, in iConference 2017 Proceedings, Wuhan, China, 22-25 March, 2017, pp. 684-687. (R4) [pdf] [poster]
  • B. Yu, J. Lee, H. Dong. Health Information Seeking Behavior of Individuals with Hearing Loss in an Online Community, in iConference 2016 Proceedings, Philadelphia, Pennsylvania, USA, 20-23 March, 2016, pp 1-4. (R4) [pdf] [poster]
  • H. Dong, W. Wang, H.-N. Liang, Learning Structured Knowledge from Social Tagging Data: A Critical Review of Methods and Techniques, in 2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity), 8th IEEE International Conference on Social Computing and Networking (IEEE SocialCom 2015), Chengdu, China, 19-21 December, 2015, pp. 307-314. (R1) [pdf on ResearchGate] [slides]
  • H. Dong, "Cognitive Invisibility" of Postgraduate Students’ Information Seeking on the Web: a Test of Mansourian’s Model," Master Dissertation, Information School, University of Sheffield, 2014, pp. 1-51. (R4) [pdf revised on ResearchGate] [pdf original]


  • H. Dong, H. Wu, C. Sudlow. Initialising Label Embedding for Automated Medical Coding of Clinical Notes. Presented virtually at Healthcare Text Analytics Conference (HealTAC 2020). [programme] [slides] [github]
  • H. Dong. Automatic Annotation of Documents Shared by Users: a Deep Learning Model Integrating Semantic Knowledge and Reading Behaviour (给用户分享的文本自动"贴标签"— 一个融合语义知识和阅读行为的深度学习模型). Presented at The 3rd San Ren Xing Semantic Salon (第三届三人行语义沙龙) in Information Technology for Libraries (IT4L), Topic: "Artificial Intelligence in Libraries" (AI在图情——2019图书馆前沿技术论坛) [in Chinese], Shanghai, China, 14 Aug 2019. (R1) [slides]
  • H. Dong. Introduction to BERT and Transformer: Pre-trained Self-attention Models to Leverage Unlabeled Corpus Data. Presented at PremiLab @ XJTLU, Suzhou, China, 4 Apr 2019. (R2) [slides]
  • H. Dong. iConference Doctoral Colloquium. iConference 2018, Sheffield, UK, 25 Mar 2018. [online brochure]
  • H. Dong. Understanding Social Tags: Relation Extraction and Tag Annotation. Presented at NLP@UoL, Liverpool, UK, 23 Mar 2018. (R1, R2) [slides]
  • H. Dong. How to Automatically Derive Semantic Relations from Social Tags? (如何自动建构社会标签中的语义关系?). Presented at The 1st San Ren Xing Semantic Salon (第一届三人行语义沙龙) [in Chinese], Shanghai, China, 19 Aug 2017. (R1) [slides]
  • H. Dong. Linked Data Consumption and Application Development: a case-study on the Genealogical Open Data from Shanghai Library (关联数据的消费与应用构建: 以上海图书馆家谱开放数据为例). Presented at The 13th Academic Digital Library Symposium (ADLS 2016, 第十三届数字图书馆前沿问题高级研讨班) [in Chinese], Shanghai Library, Shanghai, P.R. China, 5 Dec 2016. [slides] [github] [android 4.0-5.x apk] [android 9+ apk]
  • H. Dong, W. Wang. Learning Structured Knowledge from Social Media Data. Presented at Research talk with visitors from Sungkyul University (Korea). Suzhou, China, 14 Jan 2016. (R1)
  • H. Dong, B. Yu. Modeling Health-Related Topics in an Online Forum Designed for the Deaf & Hard of Hearing. Presented at 1st XJTLU Research Symposium on Healthy Ageing & Society, Suzhou, China, 14 Dec 2015. (R4) [slides]