cv
may not be up-to-date.
Basics
| Name | Tianhao Fu |
| Label | AI Scientist |
| [email protected] | |
| Url | https://atatc.me |
| Summary | 2T9 EngSci at University of Toronto; Research Team Lead at UTMIST; Intern at Vector Institute; Chief of Project Neura |
Work
-
2025.09 - Present Research Team Lead
University of Toronto Machine Intelligence Student Team
The AIP Project.
- Developed the AIP Project (Automated Iterative Pseudo-labeling)
-
2024.08 - Present Research Intern
Vector Institute (Bo Wang Lab)
Competing in medical AI challenges presented by MICCAI.
- Ranked 4th in the PANORAMA Challenge
- Ranked 4th in the SegSTRONG-C Challenge
- Helped with Fast nnU-Net
-
2019.06 - Present Chief
Project Neura
Gathering researchers and developers to bring their ideas into reality. Developed projects like LEADS and MIP Candy.
- Developed MIP Candy (A Candy for Medical Image Processing)
- Developed LEADS (Lightweight Embedded Assisted Driving System)
- Built an infrastructure cloud compute platform with Docker
Education
-
2025.09 - Present Canada
-
2022.09 - 2025.06 Canada
-
2018.09 - 2022.06 China
Awards
- 2026
- 2025
Digital Citizenship Graduation Award
STEM Fellowship, Royal Bank of Canada, LinkedIn Learning, and Villanova College
In recognition of exemplary digital leadership and the promotion of Canadian values.
- 2025
Ontario Scholar
Ontario Ministry of Education
Certificates
| IELTS (8.0 / 9.0) | ||
| British Council, IDP IELTS, and Cambridge University Press & Assessment | 2024-08-19 |
Publications
-
2026.02.24 MIP Candy: A Modular PyTorch Framework for Medical Image Processing
arXiv
Medical image processing demands specialized software that handles high-dimensional volumetric data, heterogeneous file formats, and domain-specific training procedures. Existing frameworks either provide low-level components that require substantial integration effort or impose rigid, monolithic pipelines that resist modification. We present MIP Candy (MIPCandy), a freely available, PyTorch-based framework designed specifically for medical image processing. MIPCandy provides a complete, modular pipeline spanning data loading, training, inference, and evaluation, allowing researchers to obtain a fully functional process workflow by implementing a single method, πππππ_ππππ πππ, while retaining fine-grained control over every component. Central to the design is π»ππ’πππ, a deferred configuration mechanism that enables runtime substitution of convolution, normalization, and activation modules without subclassing. The framework further offers built-in k-fold cross-validation, dataset inspection with automatic region-of-interest detection, deep supervision, exponential moving average, multi-frontend experiment tracking (Weights & Biases, Notion, MLflow), training state recovery, and validation score prediction via quotient regression. An extensible bundle ecosystem provides pre-built model implementations that follow a consistent trainer--predictor pattern and integrate with the core framework without modification. MIPCandy is open-source under the Apache-2.0 license and requires Python~3.12 or later.
-
2025.11.20 Artificial intelligence and radiologists in pancreatic cancer detection using standard of care CT scans (PANORAMA): an international, paired, non-inferiority, confirmatory, observational study
Lancet Oncology
Pancreatic ductal adenocarcinoma (PDAC) has the worst prognosis among major cancer types, primarily due to late diagnosis on contrast-enhanced CT. Artificial intelligence (AI) can improve diagnostic performance, but robust benchmarks and reliable comparison to radiologists' performance are scarce. We established an open-source benchmark with the aim of investigating AI systems for PDAC detection on CT and compared them to radiologists' performance, at scale.
-
2024.10.23 LEADS: Lightweight Embedded Assisted Driving System
arXiv
With the rapid development of electric vehicles, formula races that face high school and university students have become more popular than ever as the threshold for design and manufacturing has been lowered. In many cases, we see teams inspired by or directly using toolkits and technologies inherited from standardized commercial vehicles. These architectures are usually overly complicated for amateur applications like the races. In order to improve the efficiency and simplify the development of instrumentation, control, and analysis systems, we propose LEADS (Lightweight Embedded Assisted Driving System), a dedicated solution for such scenarios.
-
2024.07.16 SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions β An EndoVisβ24 Challenge
arXiv
Surgical data science has seen rapid advancement due to the excellent performance of end-to-end deep neural networks (DNNs) for surgical video analysis. Despite their successes, end-to-end DNNs have been proven susceptible to even minor corruptions, substantially impairing the model's performance. This vulnerability has become a major concern for the translation of cutting-edge technology, especially for high-stakes decision-making in surgical data science. We introduce SegSTRONG-C, a benchmark and challenge in surgical data science dedicated, aiming to better understand model deterioration under unforeseen but plausible non-adversarial corruption and the capabilities of contemporary methods that seek to improve it. Through comprehensive baseline experiments and participating submissions from widespread community engagement, SegSTRONG-C reveals key themes for model failure and identifies promising directions for improving robustness. The performance of challenge winners, achieving an average 0.9394 DSC and 0.9301 NSD across the unreleased test sets with corruption types: bleeding, smoke, and low brightness, shows inspiring improvement of 0.1471 DSC and 0.2584 NSD in average comparing to strongest baseline methods with UNet architecture trained with AutoAugment. In conclusion, the SegSTRONG-C challenge has identified some practical approaches for enhancing model robustness, yet most approaches relied on conventional techniques that have known, and sometimes quite severe, limitations. Looking ahead, we advocate for expanding intellectual diversity and creativity in non-adversarial robustness beyond data augmentation or training scale, calling for new paradigms that enhance universal robustness to corruptions and may enable richer applications in surgical data science.
-
2023.06.14 A deep learning model for accurate and robust internet traffic classification
Applied and Computational Engineering
Network traffic classification is significant due to the fast growth of the number of internet users. The traditional way of classifying the large number of traffic generated by these users is becoming less effective. Therefore, many researchers made a network traffic classifier based on deep learning. However, those classifiers do not provide far better results and perform poorly when dealing with encrypted information. This paper tries to approach highly accurate and robust results in both encrypted and unencrypted networks by using machine learning algorithms. The algorithm used is the convolutional neural network (CNN). The performance of the proposed CNN is compared with that of the classical LeNet-5 network. Experimental results show that the classifier based on the proposed CNN performed better when dealing with both encrypted and unencrypted datasets, achieving a maximum average accuracy of 83.55%. Moreover, it is not sensitive to hyper-parameter choices, indicating its superiority in robustness. Compared with traditional network classifiers, the network classifier based on CNN can improve accuracy and improve stability.
Languages
| English | |
| Fluent |
| Mandarin | |
| First Language |
Projects
- 2023.11 - 2025.09
LEADS
Lightweight Embedded Assisted Driving System
- Developed an onboard instrumentation system that displays data like the wheel speed
- Developed a webpage dashboard that remotely monitors the vehiclesβ status from the Pit crew
- Developed a multi-camera streaming and recording system
- Developed support for saving and replaying vehicle data
- Developed an efficient real-time data link using pure TCP/IP
- Developed a data analysis tool set that teaches the driver how to drive
- Customized the Ubuntu OS
- Developed support for multiple screens
- Developed GPS support
- 2025.08 - Present
MIP Candy
A Candy for Medical Image Processing
- Developed a framework that brings ready-to-use training, inference, and evaluation pipelines together with aesthetics, so users can focus on their experiments, not boilerplate
- Developed a visualization system that helps the user see the complex data
- Accelerated training with a preloading mechanism
- Accelerated sliding window custom CUDA kernels
- 2025.09 - Present
The AIP Project
Automated Iterative Pseudo-labeling
- Proposed an innovative way to make pseudo-labeling more efficient and effective
- 2025.11 - 2025.12
CIV102 Bridge Project
Best Winner Solution for Automated Calculation and Optimization with Extensive Docs
- Top 1 in the cohort and 2nd in the class of 2025
- Developed a complex cross-section composition and solving system
- Developed a bridge solver
- Calculated the optimal cross-section dimensions by turning the design into a COP (Convex Optimization Problem)