CS Events
PhD DefenseControllable and Efficient Generative Models: Methods and Applications in Medical Imaging |
|
||
Monday, December 01, 2025, 03:30pm - 05:30pm |
|||
Speaker: Xiaoxiao He
Location : CoRE 301
Committee:
Prof. Dimitris N. Metaxas (Chair)
Prof. Konstantinos Michmizos
Prof. Hongyi Wang
Event Type: PhD Defense
Abstract: Deep generative models have revolutionized visual computing, achieving unprecedented realism in synthesizing images, videos, and 3D scenes. However, as the quality of generation matures, the critical challenge shifts towards achieving precise, user-driven control and practical, efficient deployment, especially in high-stakes domains like medical imaging. This dissertation addresses two fundamental obstacles hindering this transition. The first is the "controlled generation" in emerging generative architectures, such as discrete diffusion and next-scale visual autoregressive models, whose non-invertible sampling mechanisms prevent the recovery of latent codes necessary for high-fidelity editing. The second is the "practicality gap" in clinical applications, where a confluence of data scarcity (few-shot learning), data heterogeneity (non-IID distributions), stringent privacy constraints (requiring Federated Learning), and communication bottlenecks impede the collaborative development of robust AI models.This dissertation presents a cohesive body of work that bridges these gaps, progressing from foundational algorithmic innovations to their application in real-world medical imaging. The primary contributions are fourfold: (1) DICE, a pioneering inversion framework that enables, for the first time, controllable editing for discrete diffusion and masked generative models; (2) VARIN, the first noise inversion-based editing technique for next-scale visual autoregressive models, which introduces a novel pseudo-inverse for the argmax operator; (3) DMCVR, a morphology-guided diffusion model that solves the clinical problem of 3D cardiac volume reconstruction from sparse MRI by leveraging explicit anatomical conditioning; and (4) a comprehensive suite of Federated Learning frameworks that integrate few-shot learning, dual knowledge distillation, and parameter-efficient fine-tuning (LoRA) to enable robust, private, and communication-efficient medical image analysis on decentralized data. Collectively, this research provides a validated blueprint for building the next generation of visual computing systems that are not only powerful but also controllable, efficient, and trustworthy.
:
Contact Professor Dimitris Metaxas (Chair)
Zoom Link: https://rutgers.zoom.us/j/91697056257?pwd=clvcAbVYdZ7aLEKspNRtAuV9BZ7RNA.1
Subscribe to RSS Feed