Mohan Zhou

Ph.D. studentmhzhou99[at]outlook[dot]com
[Google Scholar][GitHub][Kaggle][DBLP]

About

Mohan Zhou is currently a Ph.D. student at Harbin Institute of Technology, China, under the supervision of Prof. Tiejun Zhao. Before that, he received his B.Eng. degree also from Harbin Institute of Technology in 2021. His current research interest focuses on video generation and multimodality models.

Publications

  • Learning and Evaluating Human Preferences for Conversational Head Generation
    Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei
    Proceedings of the 31th ACM International Conference on Multimedia (ACM MM), 2023
  • Visual-Aware Text-to-Speech Synthesis
    Top 3% Paper Recognition
    Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Oral, 2023
  • Augmentation Pathways Network for Visual Recognition
    Yalong Bai, Mohan Zhou, Yuxiang Chen, Wei Zhang, Bowen Zhou, Tao Mei
    Transactions on Pattern Analysis and Machine Intelligence (T-PAMI)
  • Responsive Listening Head Generation: A Benchmark Dataset and Baseline
    Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei
    European Conference on Computer Vision (ECCV), 2022
  • Look-into-object: Self-supervised structure modeling for object recognition
    Mohan Zhou, Yalong Bai, Wei Zhang, Tiejun Zhao, Tao Mei
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020

Preprints

  • STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
    Xiaoxiao Ma*, Mohan Zhou*, Tao Liang, Yalong Bai, Tiejun Zhao, Huaian Chen, Yi Jin
  • StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models
    Mohan Zhou, Yalong Bai, Qing Yang, Tiejun Zhao
  • Interactive Conversational Head Generation
    Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao

Honors

  • First Place in AliProducts Challenge: Large-scale Product Recognition at CVPR 2020
  • Second Place in iMet: Fine-grained Attributes Recognition Challenge at CVPR 2020
  • First Place in iMaterialist Challenge on Product Recognition at CVPR 2019
  • First Place in Fieldguide Challenge: Moths and Butterflies at CVPR 2019
  • Second Place in iFood Challenge at CVPR 2019
  • Second-class People's Scholarship x 2
  • Outstanding Graduates

Services

Area Chairs

  • ACM Multimedia 2023 Grand Challenges
  • ACM Multimedia 2022 Grand Challenges

Organizing Committees

  • 1st Conversational Head Generation Challenge on ACM Multimedia 2022
  • 2nd Conversational Head Generation Challenge on ACM Multimedia 2023

Conference Reviewers

  • CAAI International Conference on Artificial Intelligence (CICAI), 2022
  • ACM International Conference on Multimedia (ACM MM), 2021

Journal Reviewers

  • IEEE Transactions on Image Processing (TIP)
  • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

Skills

  • Programming Language: Python, C
  • Deep Learning Framework: PyTorch
  • Development & Operations: Kubernetes, Docker, Git
  • Others: basic frontend & backend development