BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//NJBDA - New Jersey Big Data Alliance - ECPv6.15.20//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:NJBDA - New Jersey Big Data Alliance
X-ORIGINAL-URL:https://njbda.org
X-WR-CALDESC:Events for NJBDA - New Jersey Big Data Alliance
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20230312T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20231105T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20240310T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20241103T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20250309T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20251102T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20240206T143000
DTEND;TZID=America/New_York:20240206T153000
DTSTAMP:20260420T033107
CREATED:20240202T165810Z
LAST-MODIFIED:20240202T170109Z
UID:2317-1707229800-1707233400@njbda.org
SUMMARY:NJIT Data Science Seminar Series
DESCRIPTION:Data Science Seminar Series in collaboration with the Department of Data Science \n\n\n\n“Structure-Enhanced Text Mining for Understanding and Augmenting Scientific Discovery” \n\n\n\nYu Zhang University of Illinois Urbana-Champaign \n\n\n\nLocation: Guttenberg Information Technologies Center (GITC) Building\, Room 4402 (4th floor lecture hall) (Coffee served at 2:15 PM)  \n\n\n\nZoom Meeting Link \n\n\n\nHosted by Shuai Zhang  \n\n\n\nLanguage models pre-trained on large-scale text corpora have achieved remarkable success in building text mining systems. Meanwhile\, text is usually accompanied by various types of structural signals\, such as document metadata\, concept ontologies\, and citation networks\, that can potentially benefit the understanding of text. To enhance the effectiveness of text mining methods\, my research focuses on teaching language models to exploit structural information for both fundamental tasks and advanced domain-specific applications\, with an emphasis on understanding and augmenting scientific discovery. In the first part of the talk\, I will present structure-aware classification algorithms that can predict relevant categories of a scientific paper from hundreds of thousands of candidate classes. These methods have been adapted into the Microsoft Academic Graph production pipeline. The second part of the talk will introduce seed-guided topic mining approaches that find category-indicative entities and structural signals. In the third part\, I will discuss how to leverage multi-task language model pre-training techniques to facilitate advanced applications in the scientific domain\, such as patient-to-article retrieval and paper-reviewer matching. Finally\, I will outline future research directions\, including structure-aware usage of large language models\, flexible translation between different types of scientific data\, and data mining for accelerating science and innovation.  \n\n\n\nYu Zhang is a Ph.D. candidate in the Department of Computer Science at the University of Illinois Urbana-Champaign\, advised by Prof. Jiawei Han. Prior to UIUC\, he received his B.Sc. degree in Computer Science from Peking University. Yu’s research focuses on structure-enhanced text mining and its applications in scientific literature understanding. His first-authored papers have been published in top-tier venues in the fields of data mining\, natural language processing\, and information retrieval. Yu has been awarded the UIUC Dissertation Completion Fellowship and the Yunni & Maxine Pao Memorial Fellowship.
URL:https://njbda.org/event/njit-data-science-seminar-series/
LOCATION:Guttenberg Information Technologies Center (GITC)\, 218 Central Ave\, Newark\, New Jersey\, 07102\, United States
CATEGORIES:lectures/talks
ATTACH;FMTTYPE=image/jpeg:https://njbda.org/wp-content/uploads/2024/02/GITC-update-2-Large.jpeg
END:VEVENT
END:VCALENDAR