Yoonhwa (Yuna) Jung

I'm a PhD candidate at the University of Illinois at Urbana-Champaign (UIUC) advised by Prof. Mani Golparvar-Fard and Julia Hockenmaier. I'm currently working at Document Crunch AI R&D team, for analyzing information in construction contracts and specifications and developing Retrieval-Augmented Generation (RAG) in an LLM-based question answering system.

My research focuses on Natural Language Processing (NLP) and Computer Vision, including:

  • analyzing construction best practices to generate actionable insights for automated project planning and controls;
  • leveraging synergies between lean construction, BIM and Reality modeling for construction production management;
  • enhancing the accuracy and reliability of generative AI models and creating multimodal models for proactive construction management systems

Simultaneously, my contributions extend to efficiency and effectiveness in various general NLP tasks, where I proposed novel approaches to resolve general problems, yielding a paper accepted to EMNLP 2023 Findings and preparing three papers toward top-tier AI/ML conferences.

Email  /  LinkedIn  /  Scholar  / 

Education

University of Illinois at Urbana-Champaign

  • Ph.D. Candidate in Civil Engineering — AI in Construction
  • Master of Computer Science
  • Master of Science in Civil Engineering, Construction Eng. & Mgmt.

Aug 2019 - Current

Hanyang University, Seoul, South Korea

  • B.S. in Civil Engineering
  • High Honors for Academic Achievement (Sep 2015)

Mar 2015 - Feb 2019

Work Experience

Machine Learning Intern, Document Crunch

  • Recognized as an Inspired Team Memeber, Aug 2024
  • Improving reliability of AI chat service and analysis for consturction specifications and drawings

May 2024 - Current

Research and Teaching Assistant, University of Illinois at Urbana-Champaign

  • NLP-based Schedule Analytics to Generate Actionable Insights for Project Planning and Controls
  • Taught guest lectures for CEE320 Construction Engineering and Management, Leader TA

Aug 2019 - Current

Awards & Scholarships

William E. O'Neil Award, UIUC

2024 - 2025

2024

Grand prize, 4th Industrial Revolution Imagination Contest, Hanyang Univ

2017

Encouragement Prize, Design Construction Competition, KSCE

2016

4-Year Full Scholarship, POSCO

2015 - 2018

Research

I'm interested in natural language processing, computer vision, multimodal learning, deep learning, generative AI, and human-computer interaction. Most of my research is about inferring the knowledge (i.e., best practices) of construction planning and controls from textual (e.g., schedules, daily reports, change orders, RFI, etc.) and visual (e.g., images, point clouds, etc.) information. Representative papers are highlighted.

Journal Papers

VisualSiteDiary: A Detector-Free Vision-Language Transformer Model for Captioning Photologs for Daily Construction Reporting and Image Retrievals

Yoonhwa Jung, Ikhyun Cho, Shun-Hsiang Hsu, Mani Golparvar-Fard
Automation in Construction, 2024
project page / video / Journal paper

A Vision Transformer-based image captioning model, VisualSiteDiary, which creates human-readable captions for daily progress and work activity log, and enhances image retrieval tasks. Present a new image-caption pair dataset (VSD dataset) and Demo toward a real-time construction site daily log reporting. Superior-quality captions are generated compared to the state-of-the-art image captioning models.

UniformatBridge: Transformer language model for mapping construction schedule activities to uniformat categories

Yoonhwa Jung, Julia Hochenmaier, Mani Golparvar-Fard
Automation in Construction, 2024
Journal paper / Conference paper

A new NLP transformer model, UniformatBridge, to automatically map schedule activities to ASTM UniFormat classes, embedding construction schedule sequencing knowledge. UniformatBridge serves as a universal identifier enabling automated creation of 4D BIMs and streamlines mapping between schedule, cost and payment application data.

Construction schedule augmentation with implicit dependency constraints and automated generation of lookahead plan revisions

Fouad Amer, Yoonhwa Jung, Mani Golparvar-Fard
Automation in Construction, 2023
Journal paper

A NLP transformer method for deciphering implicit construction dependencies with the role and flexibility of activity relationships. A method for revising lookahead plans based on the flexibility of activity dependencies, mitigating the risk of delays.

Transformer machine learning language model for auto-alignment of long-term and short-term plans in construction

Fouad Amer, Yoonhwa Jung, Mani Golparvar-Fard
Automation in Construction, 2021
Journal paper

This paper presents the first attempt to automate linking look-ahead planning tasks to master-schedule activities following an NLP-based multi-stage ranking formulation. Our model employs distance-based matching for candidate generation and a Transformer architecture for final matching.

Long short-term memory recurrent neural network for modeling temporal patterns in long-term power forecasting for solar PV facilities: Case study of South Korea

Yoonhwa Jung, Jaehoon Jung, Byungil Kim, SangUk Han
Journal of Cleaner Production, 2020
Journal paper

An LSTM-RNN-based forecasting model is presented for investigation of PV sites. Time series data of spatial and meteorological conditions are considered and this work allows to search and evaluate suitable locations for PV plants in a wide area.



*Cited over 130

Peer-reviewed Conference Papers

Voice-assisted AI-Chatbot for Construction: Speech-to-Text Deep Learning Transformer trained with Synthetic Datasets

Yoonhwa Jung, Mani Golparvar-Fard
Computing in Civil Engineering, 2024
Conference paper

This paper presents a Voice-activated Artificial Intelligence (AI) assistant, CONSTRUCTVOICEBOT. Our solution leverages the first domain-specific Speech-to-Text Transformer model, called CON-WHISPER, with a synthetic voice-text dataset from 35 commercial building projects. We delve into practical applications through use cases such as Time and Material reporting, daily construction reporting, quality assurance, and curation of construction workflows.

Evaluation of Mapping Computer Vision Segmentation from Reality Capture to Schedule Activities for Construction Monitoring in the Absence of Detailed BIM

Juan D. Núñez-Morales, Yoonhwa Jung, Mani Golparvar-Fard
Proceedings of the 41th ISARC, 2024
Conference paper (Equal contribution)

Built on UniformatBridge, ASTM Uniformat classification is utilized to map color-coded 3D point clouds aligned with schedule activities without relying on BIM as a baseline. Exemplary results on tied new transformer-based models with few-shot learning are shown.

Bi-Directional Image-to-Text Mapping for NLP-Based Schedule Generation and Computer Vision Progress Monitoring

Juan D. Núñez-Morales, Yoonhwa Jung, Mani Golparvar-Fard
Construction Research Congress, 2024
Conference paper

AIConstruct system is presented to demonstrate, for the first time, how the integration of text and image can create seamless data synchronization for construction progress monitoring and automated schedule generation, unlocking a new research paradigm.

Integrated Heuristic and Machine Learning Approach for Schedule Health Monitoring in Construction

Yoonhwa Jung, Fouad Amer, Mani Golparvar-Fard
Construction Research Congress, 2022
Journal paper (in preparation)/ Conference paper

Building on the predefined rules and heuristics formulated in the Defense Contract Management Agency (DCMA)’s 14 Point Schedule Quality Assessment, this paper explores the feasibility of heuristic-based and deep learning methods to assess a project schedule health from qualitative and quantitative perspectives.

Review
Paper

A systematic review on the requirements on BIM maturity and formal representation of sequencing knowledge for automated construction scheduling

Yoonhwa Jung, Fouad Amer, Mani Golparvar-Fard
Proceedings of the 38th International Conference of CIB W78, Luxembourg, 2021
Conference paper

A close examination on the problems underpinning construction scheduling theory and practice such as sequencing logic and activity description by offering a systematic review on: 1) the way in which BIM-driven schedules are formalized; and 2) the challenges of tying in Building Information Modeling (BIM) with project schedules and/or BIM-driven schedule creation techniques.


Computer Science domain papers

SIR-ABSC: Incorporating Syntax into RoBERTa-based Sentiment Analysis Models with a Special Aggregator Token

Ikhyun Cho, Yoonhwa Jung, Julia Hockenmaier
EMNLP Findings, 2023
project page/ Paper

We present a simple, but effective method to incorporate syntactic dependency information directly into transformer-based language models (e.g. RoBERTa) for Aspect-Based Sentiment Classification (ABSC). Yet, SIR-ABSC outperforms these more complex models, yielding new state-of-the-art results on ABSC.

Prompting for Mixture-of-Experts: A Prompt-based Mixture-of-Experts framework for Stylized Image Captioning

Ikhyun Cho, Yoonhwa Jung, Julia Hockenmaier
Preparing a submission, 2025

We introduce PTMoE-Cap, a simple yet effective approach for generating stylized image captions, leveraging the synergy of Mixture-of-Experts (MoE) and prompt learning techniques as a effective routing source.

Machine
Unlearning

Attack and Reset for Unlearning: Exploiting Adversarial Noise toward Machine Unlearning through Parameter Re-initialization

Yoonhwa Jung, Ikhyun Cho, Shun-Hsiang Hsu, Julia Hockenmaier
Preparing a submission, 2025
project page/ arXiv

We leverage meticulously crafted adversarial noise to generate a parameter mask, effectively resetting certain parameters and rendering them unlearnable. A novel approach called Attack-and-Reset for Unlearning (ARU) outperforms current state-of-the-art results on two facial machine-unlearning benchmark datasets.

Teaching

UIUC CEE 320 - Construction Engineering
Teaching Assistant (Leader TA), Fall 2023
ㄴ (selected one of the best courses from student evaluation)
Teaching Assistant, Fall 2021, Spring 2022, Fall 2022, Spring 2023

Summer High School Science Program
Teacher, Mirim Girls’ Info. Sci. High School, Seoul, Jul. 2015

Services

Vice President | Korean Women’s Grad. Stdnt. Assoc. of Civil and Env. Eng., UIUC | 2024 – Present

Student President | Korean-American CEPM Association | 2024 – Present

Reviewer | ASCE Journal of Construction Engineering and Management | 2024 – Present

Reviewer | Elsevier Automation in Construction | 2024 – Present

Reviewer | ASCE Journal of Computing in Civil Engineering | 2023 – Present

Member | Association for Computational Linguistics | 2023 – Present

Student member | ASCE Data Sensing and Analysis Committee | 2023 – Present

Student member | ASCE Visualization, Information Modeling, and Simulation | 2023 – Present

Student member | American Society of Civil Engineers (ASCE) 2022 – Present

Student member | Korean Society of Civil Engineers (KSCE) | 2018 – 2019

Director of Students’ Association Marketing Depart. | Hanyang Univ. | Mar. 2015 – Dec 2016

Affiliations

Document Crunch Document Crunch
2024 - present
RAAMAC AI Lab RAAMAC AI Lab
2019 - present
UIUC CS NLP Group UIUC CS NLP Group
2022 - present
UIUC CEE UIUC CEE
2019 - present
Hanyang Univ. Hanyang Univ.
2015 - 2019