NLP

Recent News

2020

Oct. covidAsk, a real-time COVID-19 domain QA system, accepted to EMNLP NLP-COVID Workshop 2020.
Oct
. Our team KU has won the eighth BioASQ challenge in 8b Phase B (Results - KU Team, News).
Sep. BioBERT was included in the Best Papers for the NLP Section of the IMIA 2020 Yearbook (link).
Sep. 2 papers accepted to EMNLP 2020.
Apr. 2 papers accepted to ACL 2020.
Jan. Wonjin Yoon received the NAVER Ph.D Fellowship Award. Congratulations!

2019

Oct. 1 paper accepted to Nucleic Acids Research.
Sep. Our team KU has won the seventh BioASQ challenge in 7b Phase B (Results - KU Team).
Aug. 1 paper accepted to Bioinformatics.
May. 1 paper accepted to ACL 2019.

Publications

2020

Answering Questions on COVID-19 in Real-Time
Jinhyuk Lee, Sean S. Yi, Minbyul Jeong, Mujeen Sung, Wonjin Yoon, Yonghwa Choi, Miyoung Ko, Jaewoo Kang*
NLP-COVID Workshop (EMNLP 2020) (Long)
[Paper] [Code] [Web Service]

Look at the First Sentence: Position Bias in Question Answering
Miyoung Ko, Jinhyuk Lee*, Hyunjae Kim, Gangwoo Kim, Jaewoo Kang*
Conference on Empirical Methods in Natural Language Processing (EMNLP 2020) (Long)
[Paper] [Code]

Adversarial Subword Regularization for Robust Neural Machine Translation
Jungsoo Park, Mujeen Sung, Jinhyuk Lee*, Jaewoo Kang*
Conference on Empirical Methods in Natural Language Processing (EMNLP 2020/Findings) (Short)
[Paper] [Code]

Transferability of Natural Language Inference to Biomedical Question Answering
Minbyul Jeong, Mujeen Sung, Gangwoo Kim, Donghyeon Kim, Wonjin Yoon, Jaehyo Yoo, Jaewoo Kang*
BioASQ Workshop (CLEF 2020)
[Paper] [Code]

Biomedical Entity Representations with Synonym Marginalization
Mujeen Sung, Hwisang Jeon, Jinhyuk Lee*, Jaewoo Kang*
Annual Conference of the Association for Computational Linguistics (ACL 2020) (Long)
[Paper] [Code]

Contextualized Sparse Representations for Real-Time Open-Domain Question Answering
Jinhyuk Lee, Minjoon Seo, Hannaneh Hajishirzi, Jaewoo Kang
Annual Conference of the Association for Computational Linguistics (ACL 2020) (Short)
[Paper] [Code]

Building a PubMed knowledge graph
Jian Xu,
Sunkyu Kim, Min Song, Minbyul Jeong, Donghyeon Kim, Jaewoo Kang*, Justin F. Rousseau, Xin Li, Weijia Xu, Vetle I. Torvik, Yi Bu, Chongyan Chen, Islam Akef Ebeid, Daifeng Li & Ying Ding
Scientific Data 2020
[Paper]

BioBERT: a pre-trained biomedical language representation model for biomedical text mining
Jinhyuk Lee†, Wonjin Yoon†, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, Jaewoo Kang*
Bioinformatics 2020
[Paper] [Code]

2019

Pre-trained Language Models for Biomedical Question Answering
Wonjin Yoon, Jinhyuk Lee, Donghyeon Kim, Minbyul Jeong, Jaewoo Kang*
BioASQ Workshop (ECML PKDD 2019)
[Paper] [Code]

CollaboNet: collaboration of deep neural networks for biomedical named entity recognition
Wonjin Yoon†, Chan Ho So†, Jinhyuk Lee, and Jaewoo Kang*
Bioinformatics 2019
[Paper] [Code]

A Neural Named Entity Recognition and Multi-Type Normalization Tool for Biomedical Text Mining
Donghyeon Kim, Jinhyuk Lee, Chan Ho So, Hwisang Jeon, Minbyul Jeong, Yonghwa Choi, Wonjin Yoon, Mujeen Sung, Jaewoo Kang*
IEEE Access 2019
[Paper] [Code] [Demo]

Can Machines Learn to Comprehend Scientific Literature?
Donghyeon Park, Yonghwa Choi, Daehan Kim, Minhwan Yu, Seongsoon Kim and Jaewoo Kang*
IEEE Access 2019
[Paper]

Real-Time Open-Domain Question Answering on Wikipedia with Dense-Sparse Phrase Index
Minjoon Seo†, Jinhyuk Lee†, Tom Kwiatkowski, Ankur Parikh, Ali Farhadi, Hannaneh Hajishirzi
Annual Conference of the Association for Computational Linguistics (ACL 2019) (Long)
[Paper] [Code]

ChimerDB 4.0: an updated and expanded database of fusion genes
Ye Eun Jang†, Insu Jang†, Sunkyu Kim†, Subin Cho†, Daehan Kim, Keonwoo Kim, Jaewon Kim, Jimin Hwang, Sangok Kim, Jaesang Kim, Jaewoo Kang, Byungwook Lee*, Sanghyuk Lee*
Nucleic Acids Research 2019
[Paper] [Demo]

2018

Ranking Paragraphs for Improving Answer Recall in Open-Domain Question Answering
Jinhyuk Lee, Seongjun Yun, Hyunjae Kim, Miyoung Ko, and Jaewoo Kang*
Conference on Empirical Methods in Natural Language Processing (EMNLP 2018) (Short)
[Paper] [Code]

Learning User Preferences and Understanding Calendar Contexts for Event Scheduling
Donghyeon Kim†, Jinhyuk Lee†, Donghee Choi, Jaehoon Choi and Jaewoo Kang*
International Conference on Information and Knowledge Management (CIKM 2018)
[Paper] [Code]

A Deep Neural Spoiler Detection Model using a Genre-Aware Attention Mechanism
Buru Chang, Hyunjae Kim, Raehyun Kim, Daehan Kim, and Jaewoo Kang*
The 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2018)
[Paper] [Code]

A Pilot Study of Biomedical Text Comprehension using an Attention-Based Deep Neural Reader: Design and Experimental Analysis
Seongsoon Kim, Donghyeon Park, Yonghwa Choi, Kyubum Lee, Byounggun Kim, Minji Jeon, Jihye Kim, Aik Choon Tan, Jaewoo Kang*
JMIR Medical Informatics 2018
[Paper]

Deep learning of mutation-gene-drug relations from the literature
Kyubum Lee, Byounggun Kim, Yonghwa Choi, Sunkyu Kim, Wonho Shin, Sunwon Lee, Sungjoon Park, Seongsoon Kim, Aik Choon Tan, Jaewoo Kang*
Bioinformatics 2018
[Paper]

Drug drug interaction extraction from the literature using a recursive neural network
Sangrak Lim, Kyubum Lee, Jaewoo Kang*
PLOS ONE 2018
[Paper] [Code]

Chemical-gene relation extraction using recursive neural network
Sangrak Lim, Jaewoo Kang*
Database 2018
[Paper] [Code]

2017

Name Nationality Classification with Recurrent Neural Networks
Jinhyuk Lee, Seongjun Yun, Hyunjae Kim, Miyoung Ko, and Jaewoo Kang*
International Joint Conference on Artificial Intelligence (IJCAI 2017)
[Paper] [Code]

Constructing and Evaluating a Novel Crowdsourcing-based Paraphrased Opinion Spam Dataset
Seongsoon Kim†, Seongwoon Lee†, Donghyeon Park, and Jaewoo Kang*
International World Wide Web Conference (WWW 2017)
[Paper] [Code]

ChimerDB 3.0: an enhanced database for fusion genes from cancer transcriptome and literature data mining
Myunggyo Lee†,
Kyubum Lee†, Namhee Yu†, Insu Jang†, Ikjung Choi, Pora Kim, Ye Eun Jang, Byounggun Kim, Sunkyu Kim, Byungwook Lee, Jaewoo Kang*, and Sanghyuk Lee*
Nucleic Acids Research 2017
[Paper] [Code]

2016

SEMO: Searching Majority Opinions on Movies using SNS and QA Threads
Jukyong Lee, Yonghwa Choi, Suhkyung Kim, Seongsoon Kim, Jaewoo Kang*
The 25th International World Wide Web Conference (WWW 2016) (Demo)
[Paper]

BEST: Next-Generation Biomedical Entity Search Tool for Knowledge Discovery from Biomedical Literature
Sunwon Lee, Donghyeon Kim, Kyubum Lee, Jaehoon Choi, Seongsoon Kim, Minji Jeon, Sangrak Lim, Donghee Choi, Sunkyu Kim, Aik-Choon Tan, Jaewoo Kang*
PLOS ONE 2016
[Paper] [Code] [Demo]

HiPub: translating PubMed and PMC texts to networks for knowledge discovery
Kyubum Lee, Wonho Shin, Byounggun Kim, Sunwon Lee, Yonghwa Choi, Sunkyu Kim, Minji Jeon, Aik Choon Tan, Jaewoo Kang*
Bioinformatics 2016 (Applications Note)
[Paper]

BRONCO: Biomedical entity Relation ONcology COrpus for extracting gene-variant-disease-drug relations
Kyubum Lee, Sunwon Lee, Sungjoon Park, Sunkyu Kim, Suhkyung Kim, Kwanghun Choi, Aik Choon Tan*, Jaewoo Kang*
Database 2016
[Paper] [Demo]

2015

Deep Semantic Frame-based Deceptive Opinion Spam Analysis
Seongsoon Kim, Hyeokyoon Chang, Seongwoon Lee, Minhwan Yu, Jaewoo Kang*
In Proceedings of ACM International Conference on Information and Knowledge Management (CIKM 2015)
[Paper]

Smith Search: Opinion-Based Restaurant Search Engine
Jaehoon Choi, Donghyeon Kim, Donghee Choi, Seongsoon Kim, Sangrak Lim, Youngjae Choi and Jaewoo Kang*
Proceedings of the 24st International Conference on World Wide Web (WWW 2015) (Demo)
[Paper]

~2014

BOSS: context-enhanced search for biomedical objects
Jaehoon Choi, Donghyeon Kim, Seongsoon Kim, Sunwon Lee, Kyubum Lee and Jaewoo Kang*
BMC Medical Informatics and Decision Making 2012
[Paper]

Consento: A New Framework for Opinion Based Entity Search and Summarization
Jaehoon Choi, Donghyeon Kim, Seongsoon Kim, Junkyu Lee, Sunwon Lee and Jaewoo Kang*
21st ACM International Conference on Information and Knowledge Management (CIKM 2012) (Short)
[Paper]

Consento: A Consensus Search Engine for Answering Subjective Queries
Jaehoon Choi, Donghyeon Kim, Seongsoon Kim, Junkyu Lee, Sangrak Im, Sunwon Lee and Jaewoo Kang*
Proceedings of the 21st International Conference on World Wide Web (WWW 2012) (Poster)
[Paper]

BOSS: A Biomedical Object Search System
Jaehoon Choi, Donghyeon Kim, Seongsoon Kim, Sunwon Lee, Kyubum Lee and Jaewoo Kang*
ACM Fifth International Workshop on Data and Text Mining in Biomedical Informatics (DTMBIO 2011)
[Paper]

A Scalable Method for Detecting Multiple Loci Associated with Traits using TF-IDF Weighting and Association Rule Mining
Sunwon Lee, Jaewoo Kang and Junho Oh
IEEE International conference on Bioinformatics and Biomedicine Workshops (BIBMW 2010)
[Paper]