UP Lab

members/xiatian.jpeg

Xiatian Zhu, a Senior Lecturer affiliated with the Surrey Institute for People-Centred Artificial Intelligence and the Centre for Vision, Speech and Signal Processing (CVSSP) at the University of Surrey in Guildford, UK, leads the Universal Perception (UP) Lab. Previously a research scientist at Samsung AI Centre, Cambridge, UK, Dr. Zhu holds a Ph.D. from the Queen Mary University of London.

The UP Lab is dedicated to advancing Multimodal Generative AI (GenAI) by modelling diverse data and observations, such as images, videos, audio, text, 3D points, LiDAR, radar, and remote sensing etc. Our applications span a wide range of domains, such as creative media, fashion design, healthcare, climate science, finance, cybersecurity, chemical engineering, sports, gaming, and social issues. Guided by our principle of putting people at the heart of AI, we prioritize the ethical integration of AI capacities with real-world application norms, emphasizing sustainable and responsible AI practices. This ensures our AI initiatives are grounded in community needs and values, aiming to develop transformative technologies that positively impact society and enhance the well-being of individuals and communities.

News

Oct 9, 2024 Area chair for IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)
Oct 1, 2024 Welcome Heng Rao on board
Sep 26, 2024 5 NeurIPS papers (2 spotlight, 3 posters) and 1 workshop paper accepted
Jul 8, 2024 2 ECCV papers and 1 workshop paper accepted
Jun 4, 2024 Welcome Zhe Zhang visiting the UP lab
May 6, 2024 Welcome Yuchao Li visiting the UP lab
Mar 11, 2024 Welcome Tian Zhang visiting the UP lab
Mar 8, 2024 1 CVPR paper accepted
Jan 16, 2024 1 ICLR paper on high-resolution 3D object generation accepted
Dec 20, 2023 A paper on high dynamic range (HDR) video reconstruction accepted to IEEE TPAMI
Dec 11, 2023 1 AAAI paper on diffusion generation for sound event detection accepted for oral presentation
Oct 10, 2023 Welcome Wenqing Wang on board
Sep 22, 2023 1 NeurIPS paper and 2 workshop papers accepted
Sep 12, 2023 1 BMVC (oral) paper accepted
Jul 17, 2023 4 ICCV papers and 3 workshop papers accepted

Selected publications

  1. NeurIPS
    AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
    Bhosale, Swapnil, Yang, Haosen, Kanojia, Diptesh, Deng, Jiankang, and Zhu, Xiatian
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems 2024
  2. NeurIPS (spotlight)
    Tetrahedron Splatting for 3D Generation
    Gu, Chun, Yang, Zeyu, Pan, Zijie,  Zhu, Xiatian, and Zhang, Li
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems 2024
  3. NeurIPS
    Recognize Any Regions
    Yang, Haosen, Ma, Chuofan, Wen, Bin, Jiang, Yi, Yuan, Zehuan, and Zhu, Xiatian
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems 2024
  4. NeurIPS (spotlight)
    Motion Forecasting in Continuous Driving
    Song, Nan, Zhang, Bozhou,  Zhu, Xiatian, and Zhang, Li
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems 2024
  5. NeurIPS
    Cloud Object Detector Adaptation by Integrating Different Source Knowledge
    Li, Shuaifeng, Ye, Mao, Zhou, Lihua, Li, Nianxin, Xiao, Siying, Tang, Song, and Zhu, Xiatian
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems 2024
  6. ECCV
    PartCraft: Crafting Creative Objects by Parts
    Ng, Kam Woh,  Zhu, Xiatian, Song, Yi-Zhe, and Xiang, Tao
    In European Conference on Computer Vision 2024
  7. ECCV
    Bayesian Detector Combination for Object Detection with Crowdsourced Annotations
    Tan, Zhi Qin, Isupova, Olga, Carneiro, Gustavo,  Zhu, Xiatian, and Li, Yunpeng
    In European Conference on Computer Vision 2024
  8. ICLR
    Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
    Pan, Zijie, Lu, Jiachen,  Zhu, Xiatian, and Zhang, Li
    In International Conference on Learning Representations 2024
  9. CVPR
    Source-Free Domain Adaptation with Frozen Multimodal Foundation Model
    Tang, Song, Su, Wenxin, Ye, Mao, and Zhu, Xiatian
    In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2024
  10. AAAI (oral)
    DiffSED: Sound Event Detection with Denoising Diffusion
    Bhosale, Swapnil, Nag, Sauradip, Kanojia, Diptesh, Deng, Jiankang, and Zhu, Xiatian
    In AAAI Conference on Artificial Intelligence 2024
  11. TPAMI
    Compressed-SDR to HDR Video Reconstruction
    Wang, Hu, Ye, Mao,  Zhu, Xiatian, Li, Shuai, Li, Xue, and Zhu, Ce
    IEEE Transactions on Pattern Analysis and Machine Intelligence 2024