Short Bio

I am a research scientist in Google Research working on data mining and natural language processing. My research aims to develop automated methods for mining knowledge from text data without excessive human annotations.

I completed my Ph.D. at University of Illinois, Urbana-Champaign (UIUC), advised by Jiawei Han in the Data Mining Group. Prior to UIUC, I received my bachelor degree from Shanghai Jiao Tong University IEEE Honored Class, under the supervision of Prof. Xinbing Wang.

What's new!

May 2022 - Collaborated with Yunyi Zhang, our work on unsupervised key event discovery is accepted in KDD 2022.

Apr. 2022 - Gave a talk at Brandeis University.

Apr. 2022 - Gave a guest lecture at Virginia Tech CS 5824 Advanced Machine Learning.

Feb. 2022 - Two papers on document-level relation extraction and unsupervised constituency parsing are accepted in ACL 2022.

Feb. 2022 - Gave a guest lecture at Emory University CS 570 Data Mining.

Jan. 2022 - Collaborated with Dongha Lee, our work on Topic Taxonomy Completion has been accepted into WWW 2022.

Sept. 2021 - I have joined Google Research as a research scientist.

Aug. 2021 - Together with Xiaotao Gu, Yu Meng, our tutorial on Automated Taxonomy Discovery and Exploration is accepted into ICDM 2021.

Aug. 2021 - One paper on Open-domain Event Type Induction is accepted into EMNLP 2021 with its implementation in Github.

May 2021 - One paper on Efficient Text Encoder Pre-training is accepted into the Finding of ACL 2021.

March 2021 - One paper on Weakly-supervised Hierarchical Multi-Label Text Classification is accepted into NAACL 2021.

Dec. 2020 - One paper on Self-Supervised Taxonomy Completion is accepted into AAAI 2021.

Sept. 2020 - Two papers on Neural Linguistic Steganography and Joint Entity Set Expansion and Synonym Discovery are accepted into EMNLP 2020.

Area of Interests

My primary areas of interests in research include:

  • Data Mining
  • Natural Language Processing
  • Information Retrieval
  • Applied Machine Learning

I have also worked on:

  • Interactive Data Visualization
  • Data Wrangling
  • Web Development


Email jmshen1994[at]