I'm a PhD candidate at the University of Illinois at Urbana-Champaign (UIUC) advised by Prof. Mani Golparvar-Fard and Julia Hockenmaier. I'm currently working at Document Crunch AI R&D team, for analyzing information in construction contracts and specifications and developing Retrieval-Augmented Generation (RAG) in an LLM-based question answering system.
My research focuses on Natural Language Processing (NLP) and Computer Vision, including:
Simultaneously, my contributions extend to efficiency and effectiveness in various general NLP tasks, where I proposed novel approaches to resolve general problems, yielding a paper accepted to EMNLP 2023 Findings and preparing three papers toward top-tier AI/ML conferences.
University of Illinois at Urbana-Champaign
Aug 2019 - Current
Hanyang University, Seoul, South Korea
Mar 2015 - Feb 2019
Machine Learning Intern, Document Crunch
May 2024 - Current
Research and Teaching Assistant, University of Illinois at Urbana-Champaign
Aug 2019 - Current
William E. O'Neil Award, UIUC
2024 - 2025
Scholarship, Korean-American CEPM Association and HanmiGlobal
2024
Grand prize, 4th Industrial Revolution Imagination Contest, Hanyang Univ
2017
Encouragement Prize, Design Construction Competition, KSCE
2016
4-Year Full Scholarship, POSCO
2015 - 2018
I'm interested in natural language processing, computer vision, multimodal learning, deep learning, generative AI, and human-computer interaction. Most of my research is about inferring the knowledge (i.e., best practices) of construction planning and controls from textual (e.g., schedules, daily reports, change orders, RFI, etc.) and visual (e.g., images, point clouds, etc.) information. Representative papers are highlighted.
A Vision Transformer-based image captioning model, VisualSiteDiary, which creates human-readable captions for daily progress and work activity log, and enhances image retrieval tasks. Present a new image-caption pair dataset (VSD dataset) and Demo toward a real-time construction site daily log reporting. Superior-quality captions are generated compared to the state-of-the-art image captioning models.
A new NLP transformer model, UniformatBridge, to automatically map schedule activities to ASTM UniFormat classes, embedding construction schedule sequencing knowledge. UniformatBridge serves as a universal identifier enabling automated creation of 4D BIMs and streamlines mapping between schedule, cost and payment application data.
A NLP transformer method for deciphering implicit construction dependencies with the role and flexibility of activity relationships. A method for revising lookahead plans based on the flexibility of activity dependencies, mitigating the risk of delays.
This paper presents the first attempt to automate linking look-ahead planning tasks to master-schedule activities following an NLP-based multi-stage ranking formulation. Our model employs distance-based matching for candidate generation and a Transformer architecture for final matching.
An LSTM-RNN-based forecasting model is presented for investigation of PV sites. Time series data of spatial and meteorological conditions are considered and this work allows to search and evaluate suitable locations for PV plants in a wide area.
This paper presents a Voice-activated Artificial Intelligence (AI) assistant, CONSTRUCTVOICEBOT. Our solution leverages the first domain-specific Speech-to-Text Transformer model, called CON-WHISPER, with a synthetic voice-text dataset from 35 commercial building projects. We delve into practical applications through use cases such as Time and Material reporting, daily construction reporting, quality assurance, and curation of construction workflows.
Built on UniformatBridge, ASTM Uniformat classification is utilized to map color-coded 3D point clouds aligned with schedule activities without relying on BIM as a baseline. Exemplary results on tied new transformer-based models with few-shot learning are shown.
AIConstruct system is presented to demonstrate, for the first time, how the integration of text and image can create seamless data synchronization for construction progress monitoring and automated schedule generation, unlocking a new research paradigm.
Building on the predefined rules and heuristics formulated in the Defense Contract Management Agency (DCMA)’s 14 Point Schedule Quality Assessment, this paper explores the feasibility of heuristic-based and deep learning methods to assess a project schedule health from qualitative and quantitative perspectives.
A close examination on the problems underpinning construction scheduling theory and practice such as sequencing logic and activity description by offering a systematic review on: 1) the way in which BIM-driven schedules are formalized; and 2) the challenges of tying in Building Information Modeling (BIM) with project schedules and/or BIM-driven schedule creation techniques.
We present a simple, but effective method to incorporate syntactic dependency information directly into transformer-based language models (e.g. RoBERTa) for Aspect-Based Sentiment Classification (ABSC). Yet, SIR-ABSC outperforms these more complex models, yielding new state-of-the-art results on ABSC.
We introduce PTMoE-Cap, a simple yet effective approach for generating stylized image captions, leveraging the synergy of Mixture-of-Experts (MoE) and prompt learning techniques as a effective routing source.
We leverage meticulously crafted adversarial noise to generate a parameter mask, effectively resetting certain parameters and rendering them unlearnable. A novel approach called Attack-and-Reset for Unlearning (ARU) outperforms current state-of-the-art results on two facial machine-unlearning benchmark datasets.
Vice President | Korean Women’s Grad. Stdnt. Assoc. of Civil and Env. Eng., UIUC | 2024 – Present
Student President | Korean-American CEPM Association | 2024 – Present
Reviewer | ASCE Journal of Construction Engineering and Management | 2024 – Present
Reviewer | Elsevier Automation in Construction | 2024 – Present
Reviewer | ASCE Journal of Computing in Civil Engineering | 2023 – Present
Member | Association for Computational Linguistics | 2023 – Present
Student member | ASCE Data Sensing and Analysis Committee | 2023 – Present
Student member | ASCE Visualization, Information Modeling, and Simulation | 2023 – Present
Student member | American Society of Civil Engineers (ASCE) 2022 – Present
Student member | Korean Society of Civil Engineers (KSCE) | 2018 – 2019
Director of Students’ Association Marketing Depart. | Hanyang Univ. | Mar. 2015 – Dec 2016