David!
I'm an undergraduate student in Engineering Science (Machine Intelligence) at the University of Toronto.
My research experience spans machine learning and data science, with a focus on multimodal dialogue systems, medical image analysis, and generative models.
I also have practical experience with large-scale datasets, experimental design, and visualization pipelines, and occasionally work with frontend frameworks such as React, Vue, and Next.js to support research tools and interfaces.

Selected papers and preprints.
Guo, D.*, Sun, M.*, Jiang, Y.*, Liang, J., & Sanner, S.
Accepted
Liang, J.*, Liu, Y. S.*, Guo, D.*, Sun, M., Jiang, Y., & Sanner, S.
Accepted
A selection of projects I've worked on with public repositories
Modeled healthy spinal geometry with a bi-directional quantile regression framework (XGBoost) that predicts normative vertebral volume and inter-centroid distances from demographic and biomarker features. Built a population Statistical Shape Model via Generalized Procrustes Analysis to capture vertebral shape variance, then integrated both modules into a Python library and dashboard for clinician-facing assessment of metastasis-related deformities.

This project implements multi-task learning to add a classifier head to nnUNetv2's Residual Encoder preset. This classifier is used to augment nnUNetv2's existing segmentation, specifically in the context of lesion subtypes in pancreatic cancers.

In this on-going project, I am investigating the efficacy of using Wasserstein GANs with Gradient Penalty to generate images to balance the WM-811K silicon wafer map dataset. Being able to generate synthetic data will aid in training image detection models later on to improve fabrication efficacy. Preliminary results show improvement for problem classes.

In this project, we developed a model for music generation using autoencoders and transformer mixture distribution models, implemented in TensorFlow. Building on techniques like variational autoencoders (VAEs) and transformers, our approach processes high-dimensional music data to create coherent compositions. By using a sliding window method and training the model on both diverse and classical music datasets, we aimed to capture melodic patterns effectively. While the model performs well at learning tonality, there are still challenges with rhythm and long-term structure. We're continuing to explore ways to enhance the model’s rhythmic coherence and overall musicality.
University Email
davidmy.guo@mail.utoronto.ca
Personal Email
davidguo123456@gmail.com
Phone
(604) 825 9637
Location
Toronto, ON