Publications


Books


[Springer Nature, 2020 Jun 29] “Deep Reinforcement Learning: Fundamentals, Research and Applications”, Springer Nature, 2020 Jun 29, (Electronic Edition 250,000 downloads; selectd to Annual High-Impact Publications in Computer Science by Chinese researchers).
H. Dong, Z. Ding, S. Zhangs, eds.



Journals & Conferences


[ICLR 2025, Spotlight (Top 5.1%)] Co3Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Xingqun Qi, Yatian Wang, Hengyuan Zhang, Jiahao Pan, Wei Xue, Shanghang Zhang, Wenhan Luo, Qifeng Liu, Yike Guo

[ICLR 2025] Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin, Xinyu Wei, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li

[ICLR 2025] MAVIS: Mathematical Visual Instruction Tuning
Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Yichi Zhang, Ziyu Guo, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Shanghang Zhang, Peng Gao, Hongsheng Li

[ICRA 2025] SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Jianing Li, Hao Wang, Gu Chenyang, Ming Lu, Wenzhao Zheng, LI DU, Shanghang Zhang

[ICRA 2025] High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior
Nan Huang, Ting Zhang, Yuhui Yuan, Dong Chen, Shanghang Zhang

[ICRA 2025] DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments
Ma, Ji, Dai, Ryan, Mu, Yao, Wu, Pengying, Wang, Hao, Chi, Xiaowei, Fei, Yang, Zhang, Shanghang, Liu, Chang

[AAAI 2025] 7

[AAAI 2025] Subgraph Aggregation for Out-of-Distribution Generalization on Graphs
Bowen Liu, Haoyang Li, Shuning Wang, Shuo Nie, Shanghang Zhang

[AAAI 2025] LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding
Senqiao Yang, Jiaming Liu, Renrui Zhang, Mingjie Pan, Ziyu Guo, Xiaoqi Li, Zehui Chen, Peng Gao, Hongsheng Li, Yandong Guo, Shanghang Zhang

[AAAI 2025] DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework
Yueru Jia, Yuhui Yuan, Aosong Cheng, Chuke Wang, Ji Li, Huizhu Jia, Shanghang Zhang

[Light: Science & Applications (Nature 子刊)] Electromagnetic metamaterial agent
Lianlin Li, Shengguo Hu, Mingyi Li, Jiawen Xu, Hongrui Zhang, Shanghang Zhang, Tie Jun Cui, and Philipp del Hougne

[NeurIPS 2024] Unveiling the Tapestry of Consistency in Large Vision-Language Models
Yuan Zhang, Fei xiao, Tao Huang, Chun-Kai Fan, Hongyuan Dong, Jiawen Li, Jiacong Wang, Kuan Cheng, Shanghang Zhang*, Haoyuan Guo*

[Advances in Neural Information Processing Systems (NeurIPS), 2024] RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation
Jiaming Liu, Mengzhen Liu, Zhenyu Wang, Pengju An, Xiaoqi Li, Kaichen Zhou, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang*

[NeurIPS 2024] Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
Peng Li, Yuan Liu, Xiaoxiao Long, Feihu Zhang, Cheng Lin, Mengfei Li, Xingqun Qi, Shanghang Zhang, Wei Xue, Wenhan Luo, Ping Tan, Wenping Wang, Qifeng Liu, Yike Guo

[EMNLP 2024] Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models
Shitian Zhao, Renrui Zhang, Xu Luo, Yan Wang, Shanghang Zhang, Peng Gao

[EMNLP 2024] Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Xinyan Chen, Jiaxin Ge, Tianjun Zhang, Jiaming Liu, Shanghang Zhang

[ECCV 2024] LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model
Yulin Luo, Ruichuan An, Bocheng Zou, Yiming Tang, Jiaming Liu, Shanghang Zhang

[ECCV 2024] I-MedSAM: Implicit Medical Image Segmentation with Segment Anything
Xiaobao Wei, Jiajun Cao, Yizhu Jin, Ming Lu, Guangyu Wang, Shanghang Zhang

[ACM MM 2024] VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness
Rongyu Zhang, Zefan Cai, Huanrui Yang, Zidong Liu, Denis Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Baobao Chang, Yuan Du, Li Du, Shanghang Zhang*

[Nature Methods (2024)] Multimodal Large Language Model for Biological Image Analysis.
Shanghang Zhang*, Gaole Dai, Tiejun Huang, and Jianxu Chen*.

[ICML 2024] Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting
Anthony Chen, Huanrui Yang, Yulu Gan, Denis A Gudovskiy, Zhen Dong, Haofan Wang, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Shanghang Zhang*

[ICML 2024] Compositional Few-Shot Class-Incremental Learning
Yixiong Zou, Shanghang Zhang, Haichen Zhou, Yuhua Li and Ruixuan Li

[ICML 2024] VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model
Pengying Wu, Yao Mu, Bingxian Wu, Yi Hou, Ji Ma, Shanghang Zhang*, Chang Liu*

[Nature Scientific Data (2024)] A multimodal physiological dataset for driving behaviour analysis
Xiaoming Tao, Dingcheng Gao, Wenqi Zhang, Tianqi Liu, Shanghang Zhang, Bing Du, Yanjun Qin

[IEEE Journal of Biomedical and Health Informatics (JBHI) (2024)] Exploring generalizable distillation for efficient medical image segmentation
Qi, Xingqun, Zhuojie Wu, Wenxuan Zou, Min Ren, Yifan Gao, Muyi Sun, Shanghang Zhang, Caifeng Shan, and Zhenan Sun

[CVPR 2024] FreeKD: Knowledge Distillation via Semantic Frequency Prompt
Yuan Zhang, Tao Huang, Jiaming Liu, Tao Jiang, Kuan Cheng, Shanghang Zhang*

[CVPR 2024] PromptCoT: Align Prompt Distribution via Adapted Chain-of-Thought
Junyi Yao, Yijiang Liu, Zhen Dong, Mingfei Guo, Helan Hu, Kurt Keutzer, Li Du, Daquan Zhou, Shanghang Zhang*

[CVPR 2024] Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui Chen, Yandong Guo, Shanghang Zhang*

[CVPR 2024] NTO3D: Neural Target Object 3D Reconstruction with Segment Anything
Xiaobao Wei, Renrui Zhang, Jiarui Wu, Jiaming Liu, Ming Lu, Yandong Guo, Shanghang Zhang*

[CVPR 2024] Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation
Xingqun Qi, Jiahao Pan, Peng Li, Ruibin Yuan, Xiaowei Chi, Mengfei Li, Wenhan Luo, Wei Xue, Shanghang Zhang, Qifeng Liu, Yike Guo

[CVPR 2024] Cloud-Device Collaborative Learning for Multimodal Large Language Models
Guanqun Wang, Jiaming Liu, Chenxuan Li, Yuan Zhang, Ma Junpeng, Xinyu Wei, Kevin Zhang, Maurice Chong, Renrui Zhang, Yijiang Liu, Shanghang Zhang*

[CVPR 2024] Gradient-based Parameter Selection for Efficient Fine-Tuning
Zhi Zhang, Qizhe Zhang, Zijun Gao, Renrui Zhang, Ekaterina Shutova, Shiji Zhou, Shanghang Zhang*

[IEEE International Conference on Multimedia & Expo (ICME), Oral presentation, 2024] Enhanced Blind Watermarking Against Black-Box Noise: Leveraging CIN Framework
Rui Ma, Mengxi Guo, Peidong Jia, Chenxuan Li, Yi Hou, Yuan Li, Xiaodong Xie, Shanghang Zhang*

[IEEE International Conference on Multimedia & Expo (ICME), 2024] Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Yijia Zhang, Lingran Zhao, Shijie Cao, Sicheng Zhang, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu

[IEEE International Conference on Multimedia & Expo (ICME), Oral presentation, 2024] VLUReID: Exploiting Vision-Language Knowledge for Unsupervised Person Re-Identification
Dongmei Zhang, Shanghang Zhang, Ray Zhang, Fan Yang, Yuan Li, Huizhu Jia, Xiaodong Xie

[Nature Methods (2024)] EfficientBioAI: making bioimaging AI models efficient in energy and latency
Zhou, Yu, Jiajun Cao, Justin Sonneck, Sweta Banerjee, Stefanie Dörr, Anika Grüneboom, Kristina Lorenz, Shanghang Zhang, and Jianxu Chen

[International Conference on Robotics and Automation (ICRA), 2024] Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer
Jiaming Liu, Qizhe Zhang, Xiaoqi Li, Jianing Li, Guanqun Wang, Ming Lu, Tiejun Huang, Shanghang Zhang*

[International Conference on Robotics and Automation (ICRA), 2024] Multi-geometric Space Alignments for Domain Adaptive Multi-view 3D Object Detection
Jiaming Liu, Rongyu Zhang, Xiaoqi Li, Xiaowei Chi, Zehui Chen, Ming Lu, Yandong Guo, Shanghang Zhang*

[International Conference on Robotics and Automation (ICRA), 2024] RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision
Mingjie Pan, Jiaming Liu, Renrui Zhang, Peixiang Huang, Xiaoqi Li, Bing Wang, Hongwei Xie, Li Liu, Shanghang Zhang*

[ICRA 2024] Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation
Jiayi Ni, Senqiao Yang, Jiaming Liu, Xiaoqi Li, Wenyu Jiao, Ran Xu, Zehui Chen, Yi Liu, Shanghang Zhang*

[The Twelfth International Conference on Learning Representations (ICLR), 2024] ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
Jiaming Liu, Senqiao Yang, Peidong Jia, Renrui Zhang, Ming Lu, Yandong Guo, Wei Xue, Shanghang Zhang*

[The Twelfth International Conference on Learning Representations (ICLR), 2024] ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate
Chi-Min Chan, Weize Chen, Yusheng Su, Jianxuan Yu, Wei Xue, Shanghang Zhang, Jie Fu, Zhiyuan Liu

[Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI), 2024] FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection
Zhang D, Li C, Zhang R, Xie S, Xue W, Xie X, Zhang S

[Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI), 2024] Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction
Yang S, Wu J, Liu J, Li X, Zhang Q, Pan M, Gan Y, Chen Z, Zhang S

[Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI), 2024] Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-wise Linear Modulation
Zhang R, Luo Y, Liu J, Yang H, Dong Z, Gudovskiy D, Okuno T, Nakata Y, Keutzer K, Du Y, Zhang S

[Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI), 2024] Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-Supervised 3D Object Detection
Gao H, Chen Z, Chen Z, Chen L, Liu J, Zhang S, Zhao F

[IEEE Transactions on Neural Networks and Learning Systems (TNNLS IF 10) 2023] Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation Learning
Qi X; Sun M; Wang Z; Liu J; Li Q; Zhao F; Zhang S; Shan C

[Artificial Intelligence. 2023 May 1;318:103886] Expanding the prediction capacity in long sequence time-series forecasting
Zhou H, Li J, Zhang S, Zhang S, Yan M, Xiong H

[NeurIPS 2023] PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection
Zhou Q, Li W, Jiang L, Wang G, Zhou G, Zhang S, Zhao H

[WACV 2024] TCP: Triplet Contrastive-relationship Preserving for Class-Incremental Learning
Li S, Ning X, Zhang S, Guo L, Zhao T, Yang H, Wang Y

[IEEE Transactions on Intelligent Vehicles] BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection
Li J, Lu M, Liu J, Guo Y, Du Y, Du L, Zhang S*

[ICCV 2023] QD-BEV: Quantization-aware View-guided Distillation for Multi-view 3D Object Detection
Zhang Y, Dong Z, Yang H, Lu M, Tseng CC, Du Y, Keutzer K, Du L, Zhang S*

[ICCV 2023] Q-diffusion: Quantizing diffusion models
Li X, Liu Y, Lian L, Yang H, Dong Z, Kang D, Zhang S, Keutzer K

[ICCV 2023] Pointclip v2: Prompting clip and gpt for powerful 3D open-world learning
Zhu X, Zhang R, He B, Guo Z, Zeng Z, Qin Z, Zhang S, Gao P

[2023 60th ACM/IEEE Design Automation Conference (DAC)] Csq: Growing mixed-precision quantization scheme with bi-level continuous sparsification
Xiao L, Yang H, Dong Z, Keutzer K, Du L, Zhang S*

[ICML 2023] Wasserstein Barycenter Matching for Graph Size Generalization of Message Passing Neural Networks
Chu X, Jin Y, Wang X, Zhang S, Wang Y, Zhu W, Mei H.

[MICCAI 2023 Oct 1] DiffuseIR: Diffusion Models for Isotropic Reconstruction of 3D Microscopic Images
Pan M, Gan Y, Zhou F, Liu J, Zhang Y, Wang A, Zhang S*, Li D

[ICASSP 2023 Jun 4] BadRes: Reveal the Backdoors Through Residual Connection
He M, Chen T, Zhou H, Zhang S, Li J

[Remote Sensing] P2FEViT: Plug-and-Play CNN Feature Embedded Hybrid Vision Transformer for Remote Sensing Image Classification
Wang G, Chen H, Chen L, Zhuang Y, Zhang S, Zhang T, Dong H, Gao P

[CVPR 2023] BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks
Chi X, Liu J, Lu M, Zhang R, Wang Z, Guo Y, Zhang S*

[CVPR 2023] Improving Generalization of Meta-Learning With Inverted Regularization at Inner-Level
Wang L, Zhou S, Zhang S*, Chu X, Chang H, Zhu W*

[CVPR 2023] Open-vocabulary point-cloud object detection without 3D annotation
Lu Y, Xu C, Wei X, Xie X, Tomizuka M, Keutzer K, Zhang S*

[CVPR 2023] Pimae: Point cloud and image interactive masked autoencoders for 3D object detection
Chen A, Zhang K, Zhang R, Wang Z, Lu Y, Guo Y, Zhang S*

[CVPR 2023] Cloud-device collaborative adaptation to continual changing environments in the real-world
Gan Y, Pan M, Zhang R, Ling Z, Zhao L, Liu J, Zhang S*

[CVPR 2023] NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers
Liu Y, Yang H, Dong Z, Keutzer K, Du L, Zhang S*

[CVPR 2023] MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID
Gu J, Wang K, Luo H, Chen C, Jiang W, Fang Y, Zhang S, You Y, Zhao J

[CVPR 2023] Annealing-Based Label-Transfer Learning for Open World Object Detection
Ma Y, Li H, Zhang Z, Guo J, Zhang S, Gong R, Liu X

[NOSSDAV, CCF B 2023] RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery
Zhang R, Du L, Liu J, Song C, Wang F, Li X, Lu M, Guo Y, Zhang S.*

[IEEE Transactions on Circuits and Systems for Video Technology, 2023 Mar 1] Frame-Recurrent Video Crowd Counting
Hou Y, Zhang S*, Ma R, Jia H, Xie X

[NeurIPS 2022] Outlier suppression: Pushing the limit of low-bit transformer language models
Wei X, Zhang Y, Zhang X, Gong R, Zhang S, Zhang Q, Yu F, Liu X

[NeurIPS 2022] Jump Self-attention: Capturing High-order Statistics in Transformers
Zhou H, Xiao S, Zhang S, Peng J, Zhang S, Li J

[NeurIPS 2022] Margin-based few-shot class-incremental learning with class-level overfitting mitigation
Zou Y, Zhang S, Li Y, Li R

[IEEE Transactions on Cognitive and Developmental Systems] Learning deep features for robotic inference from physical interactions
Dehban A, Zhang S, Cauli N, Jamone L, Santos-Victor J

[17th European Conference on Computer Vision (ECCV) 2022] MTTrans: Cross-Domain Object Detection with Mean Teacher Transformer
J. Yu, J. Liu, X.Wei, H. Zhou, Y. Nakata, D. Gudovskiy, T. Okuno, J. Li, K. Keutzer, S. Zhang*

[ECCV 2022] Efficient Meta-Tuning for Content-aware Neural Video Delivery
X. Li, J. Liu, S.Wang, C. Lyu, M. Lu, Y. Chen, A. Yao, Y. Guo, S. Zhang*

[ICML 2022] DNA: Domain generalization with diversified neural averaging
Chu X, Jin Y, Zhu W, Wang Y, Wang X, Zhang S, Mei H

[IJCAI 2022] Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data
Li T, Chen X, Dong Z, Yu W, Yan Y, Keutzer K, Zhang S*

[IEEE Transactions on Multimedia (TMM), 2022] Active Gradual Domain Adaptation: Dataset and Approach
S. Zhou, L. Wang, S. Zhang*, Z.Wang*, W.Zhu*

[CVPR 2022] Delving deep into the generalization of vision transformers under distribution shifts.
C. Zhang#, M. Zhang#, S. Zhang#, et al.

[ICRA 2022] Prototypical Supervised Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation
M. Liu, Q. Zhou, H. Zhao, L. Du, Y. Du, J. Li, K. Keutzer, S. Zhang*

[AISTATS, 2022] Online Continual Adaptation with Active Self-Training
S. Zhou, H. Zhao, S. Zhang*, L.Wang, H. Chang, Z. Wang, W. Zhu*

[WACV 2022] Self-supervised pretraining improves self-supervised pretraining
Reed CJ, Yue X, Nrusimha A, Ebrahimi S, Vijaykumar V, Mao R, Li B, Zhang S, Guillory D, Metzger S, Keutzer K

[ICCV 2021] Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency
Z. Luo, Z. Cai, C. Zhou, G. Zhang, H. Zhao, S. Yi, S. Lu, H. Li, S. Zhang, Z. Liu

[ICCV 2021] Contrastive Multimodal Fusion with TupleInfoNCE
Y. Liu, Q. Fan, S. Zhang, H. Dong, T. Funkhouser, L. Yi

[IEEE Transactions on Multimedia (TMM), 2021] Caching in Dynamic Environments: a Near-optimal Online Learning Approach
S. Zhou, Z. Wang, C. Hu, Y. Mao, H. Yan, C.Wu, S. Zhang*, W. Zhu*

[ACM Multimedia (ACM MM), 2021] Revisiting Mid-Level Patterns for Distant-Domain Few-Shot Recognition
Y. Zou, S. Zhang, J. Yu, Y. Tian, J. Moura

[ACM Multimedia (ACM MM), 2021] Annotation-Efficient Untrimmed Video Action Recognition
Y. Zou, S. Zhang, G. Chen, Y. Tian, K. Keutzer, J. Moura

[ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2021] Triplet Attention: Rethinking the similarity in Transformers
H. Zhou, J. Li, J. Peng, S. Zhang, S. Zhang

[CVPR 2021] Learning Invariant Representations and Risks for Semi-supervised Domain Adaptation
B. Li#, Y. Wang#, S. Zhang#, D. Li, T. Darrell, K. Keutzer, H. Zhao

[CVPR 2021] Prototypical Cross-domain Self-supervised Learning for Few-shot Unsupervised Domain Adaptation
X. Yue, Z. Zheng, S. Zhang, Y. Gao, T. Darrell, K. Keutzer, AL. Vincentelli

[ICLR 2021] Decoupling Global and Local Representations via Invertible Generative Flows
X Ma, X Kong, S Zhang, E Hovy

[ICASSP, 2021] Cross-Domain Sentiment Classification With Contrastive Learning and Mutual Information Maximization
T. Li, X. Chen, S. Zhang*, Z. Dong*, K. Keutzer

[Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI), 2021. AAAI Best Paper Award.] Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting
H. Zhou, S. Zhang, J. Peng, S. Zhang, J. Li, H. Xiong, W. Zhang

[IEEE Transactions on Neural Networks and Learning Systems (TNNLS IF 10) 2020] A Review of Single-Source Deep Unsupervised Visual Domain Adaptation
S. Zhao, X. Yue*, S. Zhang*, B. Li, H. Zhao, B. Wu, R. Krishna, JE. Gonzalez, AL. Vincentelli, SA. Seshia, K. Keutzer

[ACM MM 2020, Oral presentation] Compositional Few-Shot Recognition with Primitive Discovery and Enhancing
Y. Zou, S. Zhang, K. Chen, Y. Wang, J. Moura, Y. Tian

[ECCV 2020, Oral presentation (Top 2%)] TCGM: An Information-Theoretic Framework for Semi-Supervised Multi-Modality Learning
X. Sun, Y. Xu, P. Cao, Y. Kong*, L. Hu, S. Zhang*, Y.Wang

[16th European Conference on Computer Vision (ECCV), 2020] Instance Adaptive Self-Training for Unsupervised Domain Adaptation
K. Mei, C. Zhu, J. Zou, S. Zhang

[IJCAI 2020] Generalized Zero-shot Text Classification for ICD Coding
C. Song, S. Zhang, N. Sadoughi, P. Xie, and E. Xing

[AAAI 2020, Oral presentation (Top 3%)] Multi-source Distilling Domain Adaptation
S. Zhao#, G. Wang#, S. Zhang#, Y. Gu, Y. Li, Z. Song, P. Xu, R. Hu, H. Chai, K. Keutzer

[Knowledge-Based Systems (IF 5.921), 2020] Modeling relation paths for knowledge base completion via joint adversarial training
C. Li, X. Peng, S. Zhang, H. Peng, P. Yu, M. He, L. Du, L. Wang

[NeurIPS 2019] Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning
J. Ni, S. Zhang, H, Xie

[Advances in Neural Information Processing Systems (NeurIPS), 2019] MaCow: Masked Convolutional Generative Flow
X. Ma, X. Kong, S. Zhang, E. Hovy

[NeurIPS 2018] Adversarial Multiple Source Domain Adaptation
H. Zhao#, S. Zhang#, G. Wu, J. Costeira, J. Moura, G. J. Gordon

[CVPR 2018] Learning to Understand Image Blur
S. Zhang, X. Shen, Z. Lin, R. Mech, J. Costeira, J. Moura

[ICLR, invited to workshop, 2018] Multiple Source Domain Adaptation with Adversarial Learning
H. Zhao#, S. Zhang#, G. Wu, J. Costeira, J. Moura, G. J. Gordon

[IEEE International Conference on Communications (ICC), 2018] A Deep Learning Approach to IoT Authentication
R. Das, A. Gadre, S. Zhang, S. Kumar, and J. Moura

[ICCV 2017] FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras
S. Zhang#, G. Wu#, J. Costeira, J. Moura

[CVPR 2017] Understanding Traffic Density from Large-Scale Web Camera Data
S. Zhang, G. Wu, J. Costeira, J. Moura

[Advances in Neural Information Processing Systems (NIPS) Workshop, 2016] Block-Coordinate Frank-Wolfe Optimization for Counting Objects in Images
F. Xia#, S. Zhang#

[IEEE International Conference on Image Processing (ICIP), 2015] Traffic Flow from a Low Frame Rate City Camera
E.Toropov, L. Gui, S. Zhang, S. Kottur, J. M. F. Moura

[International Test Conference (ITC), 2014] Bayesian Model Fusion: Enabling Test Cost Reduction of Analog/RF Circuits via Wafer-level Spatial Variation Modeling
S. Zhang, X. Li, R.D. Blanton, J. Silva, J. M. Carulli, K. M. Butler

[IEEE Transactions on Multimedia (TMM IF 6.051), 15.8 (2013)] On a Highly Efficient RDO-based Mode Decision Pipeline Design
C. Zhu, H. Jia, S. Zhang, X. Huang, X. Xie and W. Gao

[The IEEE International Symposium on Circuits and Systems (ISCAS), 2013] A High-throughput Low-latency Arithmetic Encoder Design for HDTV
Y. Li, S. Zhang, H. Jia, X. Xie, and W. Gao

[Visual Communications and Image Processing (VCIP), 2012 IEEE] An efficient foreground-based surveillance video coding scheme in low bit-rate compression
S Zhang, K Wei, H Jia, X Xie, W Gao

[IEEE International Conference on Multimedia & Expo (ICME), 2012] An Optimized Hardware Video Encoder For AVS With Level C+ Data Reuse Scheme For Motion Estimation
K. Wei, R. Zhou, S. Zhang, H. Jia, D. Xie, and W. Gao

© Copyright 2023 HMI Lab - All rights reserved