Program

CVM 2022 Conference Programme

Thursday, April 7, 2022
08:30 - 18:00 Registration
Jittor Course on Visual Media Learning
15:00 - 16:00 Jittor: Fundamentals and Lastest Progress
Dun Liang (Tsinghua University)
16:00 - 17:00 Attention in Computer Vision
Meng-Hao Guo (Tsinghua University)
17:00 - 18:00 Visual Media Editing and Generation with Neural Rendering
Lin Gao (Institute of Computing Technology, Chinese Academy of Sciences)
 Day One Friday, April 8, 2022
08:00 - 08:30 Registration
08:30 - 09:00 Opening (Chair: Shi-Min Hu)
Keynote talk I (Chair: Seungyong Lee)
09:00 - 09:40 Mind the gap: Learning 3D representation from 2D image collections
Xin Tong (Microsoft Research Asia)
Industrial Session (Chair: Hui Zhang)
09:40 - 10:00 Modern Game Engine Architecture Introduction
Lei Su (CTO of Booming Technology)
10:00 - 10:20 Procedural Content Generation for Borderless Creation
Haozhi Huang (Technical Partner of XVERSE Technology)
10:20 - 10:40 Coffee break
Session 1 Image Synthesis (Chair: Yong-Jin Liu)
10:40 - 10:55 STATE: Learning Structure and Texture Representations for Novel View Synthesis
Xinyi Jing, Qiao Feng, Yu-Kun Lai, Jinsong Zhang, Yuanqiang Yu, Kun Li
10:55 - 11:10 A Comparative Study of CNN- and Transformer-based Neural Style Transfer
Huapeng Wei, Yingying Deng, Fan Tang, Xingjia Pan, Weiming Dong
11:10 - 11:25 StrokeGAN Painter: Learning to Paint Artworks Using Stroke-Style Generative Adversarial Networks
Qian Wang, Cai Guo, Hong-Ning Dai, Ping Li
11:25 - 11:40 Unsupervised Image Translation with Distributional Semantics Awareness
Zhexi Peng, He Wang, Yanlin Weng, Yin Yang, Tianjia Shao
Session 2 Rendering (Chair: Kun Xu)
11:40 - 11:55 A Psychoacoustic Quality Criterion for Path-Traced Sound Propagation
Chunxiao Cao, Zili An, Zhong Ren, Dinesh Manocha, Kun Zhou
11:55 - 12:10 Neural Temporal Denoising for Indirect Illumination
Yan Zeng, Lu Wang, Yanning Xu, Xiangxu Meng
12:10 - 14:00 Lunch
Session 3 Geometry & Point Cloud (Chair: Yang Liu)
14:00 - 14:15 Out-of-core Outlier Removal for Large-scale Indoor Point Clouds
Linlin Ge, Jieqing Feng
14:15 - 14:30 Towards Uniform Point Distribution in Feature-Preserving Point Cloud Filtering
Shuanjun Chen, Jinxi Wang, Wei Pan, Shang Gao, Meili Wang, Xuequan Lu
14:30 - 14:45 Jacobi--PIA Algorithm for Bi-Cubic B-Spline Interpolation Surfaces
Chengzhi Liu, Juncheng Li, Lijuan Hu
Session 4 Image Processing (Chair: Lei Zhang)
14:45 - 15:00 Light Field Super-Resolution Using Complementary-View Feature Attention
Wei Zhang, Wei Ke, Da Yang, Hao Sheng, Zhang Xiong
15:00 - 15:15 Autocomplete Repetitive Stroking with Image Guidance
Yilan Chen, Kin Chung Kwan, Hongbo Fu
15:15 - 15:30 Polygonal Finite Element Based Content-Aware Image Warping
Juan Cao, Xiaoyi Zhang, Jiannan Huang, Yongjie Jessica Zhang
15:30 - 15:45 Coffee break
Session 5 Virtual Reality (Chair: Miao Wang)
15:45 - 16:00 ARSlice: Head-Mounted Display Augmented with Dynamic Tracking and Projection
Yu-Ping Wang, Sen-Wei Xie, Lihui Wang, Hongjin Xu, Satoshi Tabata, Masatoshi Ishikawa
16:00 - 16:15 Adaptive Optimization Algorithm for Resetting Techniques in Obstacle-ridden Environments (invited TVCG paper)
Song-Hai Zhang, Chia-Hao Chen, Zheng Fu, Yongliang Yang, Shi-Min Hu
Session 6 Shape Analysis (Chair: Lin Gao)
16:15 - 16:30 Learning-based Intrinsic Reflectional Symmetry Detection
Yi-Ling Qiao, Gao Lin, Shu-Zhi Liu, Ligang Liu, Yu-Kun Lai, Xilin Chen
16:30 - 16:45 Deep Functional Maps for Simultaneously Computing Direct and Symmetric Correspondences of 3D Shapes
Hui Wang, Bitao Ma, Junjie Cao, Xiuping Liu, Hui Huang
16:45 - 17:00 TAD-Net: tooth axis detection network based on rotation transformation encoding
Yeying Fan, Qian Ma, Guangshun Wei, Zhiming Cui, Yuanfeng Zhou, Wenping Wang
Poster Session
17:00 - 18:00 Fuzzy-based Indoor Scene Modeling with Differentiated Examples
Qiang Fu, Shuhan He, Zhigang Deng, Xueming Li, Hongbo Fu

Deep Unfolding Multi-scale Regularizer Network for Image Denoising
Jingzhao Xu, Mengke Yuan, Dongming Yan, Tieru Wu

High-Quality Unsupervised Image Denoising via Multi-Scale Deep Image Prior
Qing Zhang, Yongwei Nie, Lei Zhu, Wei-Shi Zheng

Shape Embedding and Retrieval in Multi-Flow Deformation
Baiqiang Leng, Jingwei Huang, Guanlin Shen, Bin Wang

Deep Multi-Task Learning based Fingertip Detection
Ruize Han, Jiewen Zhao, Liang Wan

Shape-aware Stroke Segmentation for Calligraphic Characters
Zibo Zhang, Xueting Liu, Chengze Li, Huisi Wu, Zhenkun Wen

Attribute Consistency Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation
Fengjiang Liu, Li Yao

Point Cloud Completion on Structured Feature Map with Feedback Network
Zejia Su, Haibin Huang, Chongyang Ma, Hui Huang, Ruizhen Hu

Multi-foreground objects segmentation based on RGB-D Image
Yan Li, Di Zhu, Hui Chen, Haikun Li, Changhe Tu

Adaptive Content-aware Correction for Wide-angle Portrait Photos
Juan Cao, Binyan Lin, Zhonggui Chen

TransLoc3D: point cloud based large-scale place recognition using adaptive receptive fields
Tian-Xing Xu, Yuan-Chen Guo, Yu-Kun Lai, Song-Hai Zhang

Sphere Face Model:A 3D Morphable Model with Hypersphere Manifold Latent Space
Diqiong Jiang, Yiwei Jin, Fang-Lue Zhang, Yun Zhang, Zhe Zhu, Ruofeng Tong, Min Tang
18:00 - 20:00 Conference Banquet
Day Two Saturday, April 9, 2022
Keynote talk II (Chair: Shi-Min Hu)
09:00 - 09:40 Data-efficient GAN Training
Jun-Yan Zhu (Carnegie Mellon University)
Session 7 Attention (Chair: Weiming Dong)
09:40 - 09:55 Attention Mechanisms in Computer Vision: A Survey
Meng-Hao Guo, Tian-Xing Xu, Jiang-Jiang Liu, Zheng-Ning Liu, Peng-Tao Jiang, Tai-Jiang Mu, Song-Hai Zhang, Ralph Martin, Ming-Ming Cheng, Shi-Min Hu
09:55 - 10:10 Self-supervised Coarse-to-fine Monocular Depth Estimation Using Lightweight Attention Module
Yuanzhen Li, Fei Luo, Chunxia Xiao
10:10 - 10:25 Attention-based Dual Supervised Decoder for RGBD Semantic Segmentation
Yang Zhang, Yang Yang, Chenyun Xiong, Guodong Sun, Yanwen Guo
10:25 - 10:40 Coffee break
Session 8 Meshes & 3D Printing (Chair: Lin Gao)
10:40 - 10:55 Patch-based mesh inpainting via low rank recovery
Xiaoqun Wu, Xiaoyun Lin, Nan Li, Haisheng Li
10:55 - 11:10 Untangling All-Hex Meshes via Adaptive Boundary Optimization
Qing Huang, Wen-Xiang Zhang, Qi Wang, Ligang Liu, Xiao-Ming Fu
11:10 - 11:25 3D Printed Hair Modeling from Strand-level Hairstyles
Han Chen, Minghai Chen, Lin Lu
Session 9 Understanding (Chair: Shi-Sheng Huang)
11:25 - 11:40 Element-Arrangement Context Network for Facade Parsing
Yan Tao, Yiteng Zhang, Xuejin Chen
11:40 - 11:55 Probability-based channel pruning for depthwise separable convolutional networks
Hanli Zhao, Kaijie Shi, Xiaogang Jin, Mingliang Xu, Hui Huang, Wanglong Lu, Ying Liu
11:55 - 12:10 Learn Robust Pedestrian Representation within Minimal Modality Discrepancy for Visible-Infrared Person Re-Identification
Yujie Liu, Wenbin Shao, Xiaorui Sun
12:10 - 14:00 Lunch
Keynote talk III (Chair: Hongbo Fu)
14:00 - 14:40 Human-in-the-Loop Preferential Bayesian Optimization for Visual Design
Yuki Koyama (AIST)
Session 10 Face (Chair: Tianjia Shao)
14:40 - 14:55 3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos (invited TVCG paper)
Zipeng Ye, Mengfei Xia, Yanan Sun, Ran Yi, Minjing Yu, Juyong Zhang, Yu-Kun Lai, Yong-Jin Liu
14:55 - 15:10 Towards Harmonized Regional Style Transfer and Manipulation for Facial Images
Cong Wang, Fan Tang, Yong Zhang, Weiming Dong, Tieru Wu
15:10 - 15:25 Learning Physically-based Material and Lighting Decompositionsfor Face Editing
Qian Zhang, Vikas Thamizharasan, James Tompkin
15:25 - 15:40 Coffee break
Session 11 Simulation & Visualization (Chair: Bo Ren)
15:40 - 15:55 Simulating Fractures with Bonded Discrete Element Method (invited TVCG paper)
Jia-Ming Lu, Chenfeng Li, Geng-Chen Cao, Shi-Min Hu
15:55 - 16:10 O3NJ Trees: Optimally Ordered Orthogonal Neighbor Joining Trees for Hierarchical Cluster Analysis
Tong Ge, Yunhai Wang, Michael Sedlmair, Zhanglin Cheng, Ying Zhao, Xin Liu, Baoquan Chen, Oliver Deussen
Session 12 Tracking & SLAM (Chair: Paul Rosin)
16:10 - 16:25 Local Homography Estimation on User-specified Textureless Regions
Zheng Chen, Xiaonan Fang, Songhai Zhang
16:25 - 16:40 CGTracker: Center Graph Network for One-Stage Multi-Object Detection and Tracking
Xin Feng, Haoming Wu, Yihao Yin, Yongbo Li, Libin Lan
16:40 - 16:55 ObjectFusion: Accurate Object-level SLAM with Neural Object Priors
Zi-Xin Zou, Shi-Sheng Huang, Tai-Jiang Mu, Yu-Ping Wang
16:55 - 17:25 Closing Session

Keynote Speakers

Xin Tong, Microsoft Research Asia

Title:

Mind the gap: Learning 3D representation from 2D image collections

Abstract:

3D deep learning has demonstrated its advantage in many 3D graphics applications. However, compared to images and videos that can be easily acquired from real world, modeling or capturing 3D dataset (e.g. shapes and material maps) is still a difficult task, which limits the scale of 3D dataset available in 3D deep learning.
In this talk, I will introduce our explorations in the last several years on how to utilize 2D image collections in 3D deep learning. By bridging the gap between 2D images and 3D representations, we believe that this method will release the power of deep learning and enable new solutions for 3D content creation.

Speaker's Biography:

Dr. Xin Tong now is a partner research manager of Microsoft Research Asia (MSRA) and the leader of graphics group in MSRA. His research interests cover many topics in computer graphics and computer vision, including appearance modeling and rendering, texture synthesis, light transport analysis, 3D deep learning, performance capturing and facial animation, as well as graphics system. Xin has published more than 150 papers in top computer graphics and vision journals and conferences, including 55 SIGGRAPH/TOG papers. He has served as the associate editor of computer graphics journals (ACM TOG, IEEE TVCG, CGF) and paper committee members of ACM SIGGRAPH/SIGGRAPH ASIA, Eurographics, and Pacific Graphics. He is the associate editor of IEEE CG&A, CVMJ, and visual informatics. Xin obtained his Ph.D. degree in Computer Graphics from Tsinghua University in 1999 and his B.S. Degree and Master Degree in Computer Science from Zhejiang University in 1993 and 1996 respectively.


Jun-Yan Zhu, Carnegie Mellon University

Title:

Data-efficient GAN Training

Abstract:

The power and promise of deep generative models such as GANs lie in their ability to synthesize endless realistic, diverse, and novel visual content. Unfortunately, the creation and deployment of these large-scale GANs demand high-performance computing platforms and large-scale annotated datasets. Commonly used datasets such as ImageNet and LSUN require human annotation of millions of images. In this talk, I will present two data-efficient GAN training methods: differentiable data augmentation and ensembling off-the-shelf computer vision models. Collectively, these techniques allow us to learn a high-quality GAN model with as few as one hundred photos. If time permits, I will also discuss the issues of existing GANs evaluation metrics as well as potential fixes.

Speaker's Biography:

Jun-Yan Zhu is an Assistant Professor at the School of Computer Science of Carnegie Mellon University. Prior to joining CMU, he was a Research Scientist at Adobe Research and a postdoctoral researcher at MIT CSAIL. He obtained his Ph.D. from UC Berkeley and his B.E. from Tsinghua University. He studies computer vision, computer graphics, computational photography, and machine learning. He is the recipient of the Facebook Fellowship, ACM SIGGRAPH Outstanding Doctoral Dissertation Award, and UC Berkeley EECS David J. Sakrison Memorial Prize for outstanding doctoral research. His co-authored work has received the NVIDIA Pioneer Research Award, SIGGRAPH 2019 Real-time Live Best of Show Award and Audience Choice Award, and The 100 Greatest Innovations of 2019 by Popular Science.


Yuki Koyama, National Institute of Advanced Industrial Science and Technology (AIST)

Title:

Human-in-the-Loop Preferential Bayesian Optimization for Visual Design

Abstract:

Visual design often involves searching for the best parameter set that yields the best-looking design. However, such tasks are difficult to solve with typical optimization algorithms since the objective function is based on subjective evaluation and thus requires special treatment. This talk will introduce preferential Bayesian optimization (PBO), a powerful technique for handling such subjective tasks. PBO is a human-in-the-loop Bayesian optimization method that runs with relative preferential evaluation (e.g., which design is the best compared among several alternatives) instead of absolute evaluation (e.g., how good the current design is). This technique constructs a predictive model of the latent preference and generates effective preference queries to human evaluators based on the predictive model. Then, I will explain two advanced PBO methods [SIGGRAPH 2017; SIGGRAPH 2020] that achieve even better sample efficiency by combining with tailored user interactions.

Speaker's Biography:

Dr. Yuki Koyama is a Researcher at the National Institute of Advanced Industrial Science and Technology (AIST). He received his Ph.D. from the University of Tokyo in 2017, advised by Prof. Takeo Igarashi. His research fields are computer graphics and human-computer interaction, and he has published his first-authored papers at top venues such as SIGGRAPH, SIGGRAPH Asia, CHI, and UIST. His interest includes computational design and human-in-the-loop design optimization. From 2021, he also started working at Graphinica (a Japanese animation studio), in which he is aiming at bridging art and technology in animation production. He was awarded JSPS Ikushi Prize (2017) and Asiagraphics Young Researcher Award (2021).