Zhi-Yi Chin
Hi, I’m a research assistant at National Yang Ming Chiao Tung University (NYCU) in Taiwan, working with Wei-Chen (Walon) Chiu. I also collaborate closely with Pin-Yu Chen from IBM and Mario Fritz from CISPA.
My research interests lie in the application of multimodal generative models, with a specific focus on ensuring their trustworthiness. Additionally, I am conducting interpretability analyses to understand the mechanisms behind these attacks better, as it is crucial to not only identify vulnerabilities but also to comprehend the underlying causes. Beyond my work in the trustworthy ML/ AI safety domain, I am interested in multimodal/ LLM focus applications and alignment. I aim to design an improved evaluation method for assessing video-text alignment, where standard metrics currently fall short.
Previously, I completed my master’s degree in computer scient under the guidance of Professor Wei-Chen (Walon) Chiu. Prior to that, I earned my bachelor degree in computer science from National Chung Cheng University, where I had the privilege of being advised by Professor Chen-Kuo (Adrian) Chiang.
Please feel free to explore my academic journey and research interests on this website, including my Curriculum Vitae. I’m currently seeking a Ph.D. position in multimodal generative model applications: trustworthy, alignment, interpretability. Should my research resonate with you, I welcome you to reach out at joycenerd.cs09[AT]nycu.edu.tw for any potential collaboration or discussion.
Publications
For a comprehensive list of my publications, please refer to my Curriculum Vitae (CV).
($^\dagger$ indicates equal contribution)
Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts Zhi-Yi Chin†, Chieh-Ming Jiang†, Pin-Yu Chen, Ching-Chun Huang, Wei-Chen Chiu In ICML 2024 | |
Realizing Video Summarization from the Path of Language-based Semantic Understanding Kuan-Chen Mu, Zhi-Yi Chin, Wei-Chen Chiu Preprint | |
Masking Improves Contrastive Self-Supervised Learning for ConvNets, and Saliency Tells You Where Zhi-Yi Chin†, Chieh-Ming Jiang†, Pin-Yu Chen, Ching-Chun Huang, Wei-Chen Chiu In WACV 2024 | |
Multi-Camera Tracking by Candidate Intersection Ratio Tracklet Matching Yun-Lun Li, Zhi-Yi Chin, Ming-Ching Chang, Chen-Kuo Chiang. In CVPR Workshop 2021 |
Projects
3D Point Cloud Data Augmentation via Scene Representation Network Pei-Tse Chiang, Meng-Hsun Tsai, Zhi-Yi Chin, Chieh-Ming Jiang. 2021 MediaTek Research Project We design a 3D point cloud augmentation based on a novel view synthesis method, scene representation networks, and use PointNet to evaluate our augmented point clouds quality. We replace instance object id with image features from ResNet to apply our method on unseen objects and do interpolation later on. Our method is successful in ModelNet10 and generates the augmented data by intra-class interpolation with ShapeNet in the latent space of SRN encoder. | |
RSNA Pneumonia Detection Zhi-Yi Chin, Chieh-Ming Jiang. Final project in Setected Topics in Visual Recognition Using Deep Learning 2021 Fall We design a two stage method for RSNA Pneumonia detection challenge held on Kaggle. We get the best results by using EfficientNet as classification model with 0.2 classification probability threshold when testing, and YOLOR as detection model. At last, we boost the final accuracy 2% by resizing the predicted bounding box to 87.5% of the original size. | |
Generative Models as a Data Augmentation for Classification Zhi-Yi Chin, Chieh-Ming Jiang. Final project in Deep Learning and Practice 2021 Summer We investigate image transformation by exploring walks in the latent space of GAN, which is called GAN steer. We conclude that GAN steerability is a better data augmentation technique compare to transformation done in the data space. | |
Reimplemenatation Challenge -- Maximum a Posteriori Policy Optimisation Zhi-Yi Chin,Yi-Hsin Chen, Yu-Hsuan Li, Yu-Jie Chen. Reimplementation project in Reinforcement Learning 2021 Spring Apart from replicating the algorithm from the paper, we also apply numerical tricks to stabilize the training process. Moreover, We are considering improving the method by modifying the E-step. | |
Lane Detection Zhi-Yi Chin, Shao-Yu Weng, Bo-Yu Cheng. Final project in Computer Vision 2021 Spring We modify two traditional methods and successfully detect more than 2 lanes with accuracy over 70%. We reach high accuracy by apply hourglass network and double hinge loss. | |
Mango Classification Zhi-Yi Chin, Tzu-Cheng Lin, Kung-Hao Chang, Yu-Chang Chen. Final project in Machine Learning 2020 Spring Achieve accuracy 82.31% on the testing data and rank 8 in the public board in AICUP Mango Image Recognition Challenge: Grade Classification and Defective Classification. | |
Face Morphing and Warping Zhi-Yi Chin. Final project in Introduction to Multimedia Technology in Fall 2019 Face swapping from my face to another person's face smoothly without ghost effect by morphing and warping technique. | |
Calendar Helper Zhi-Yi Chin, Mi Li, Jhong-Yu Huang. Google CodeU project 2019 It is a multifunctional platform for to-do lists and calendars. Its highlight is that we have added a tagging system to calendar events/tasks. |
Honors
- Dean’s list (6 times), Computer Science and Information Engineering Dept. at CCU, Fall ‘17, Spring ‘18, Fall ‘18, Spring ‘19, Fall ‘19, Spring ‘20
- College Student Research Scholarship, get NT$ 48,000 from Ministry of Science and Technology, 2020
- Google Student Travel Scholarship, scholarship to attend Grace Hopper Celebration, 2019
Miscellaneous
Books I enjoy
- The Ride of a Lifetime by Robert Iger
- Becoming by Michelle Obama
- What We Owe the Future by William MacAskill
- Atomic Habits by James Clear
- Make Time by Jake Knapp and John Zeratsky
- Show Your Work by Austin Kleon
TV shows I enjoy
- Grey’s Anatomy
- Lessons in Chemistry
- The Morning Show
- Hospital Playlist