My Curriculum Vitae
This is my formal CV. Please click HERE to review the PDF file.
BIO
I am a master’s student (2021.9 – 2024.6) at the Information Science Lab of École Centrale Pékin (Sino-French Engineering School), Beihang University (Beijing, China), supervised by Prof. Meng Song. I am currently pursuing a master’s degree in Electronics and Information at Beihang University, École Centrale Pékin, and an Engineer degree from Groupe des Écoles Centrale (France). Before that, I received my BSc degree in Mathematics and Applied Mathematics from Beihang University, École Centrale Pékin (2017.9 – 2021.6).
My research interest primarily focuses on privacy-preserving deep learning, particularly differential private (DP) deep learning. The objective of my graduate program is to employ differential privacy techniques to safeguard graph convolutional networks (GCN) against membership inference attacks (MIA) in contexts involving node, edge, and graph classification. Since graph learning benefits from leveraging additional structural information inherent in graph-structured data, my main motivation lies in exploring the potential applications of graph neural networks (GNN) while addressing their vulnerability to privacy breaches. Furthermore, I am also interested in privacy-preserving deep learning in distributed systems, such as utilizing methodologies like federated learning (FL) for privacy-preserving deep learning and applying physically unclonable functions (PUFs) for secure device recognition in real-world scenarios. I am interested in studying privacy attack methodologies as well, including model extraction attacks and membership inference attacks.
EDUCATION
MSc, Electronics and Information (2021.9 - 2024.6)
Beihang University, Sino-French Engineering School
BSc, Mathematics and Applied Mathematics (2017.9 - 2021.6)
Beihang University, Sino-French Engineering School
INTERNSHIPS & RA
Research Assistant
Chinese University of Hong Kong (Shenzhen, China) (2023.11 – 2024.1)
I am participating in Prof. Yue Zheng ’s team as a research assistant at CUHK (Shenzhen), with a research topic regarding AI for Security and Security of AI in IoT context. Currently we focus on using deep learning (CNN+ML) to process DRAM PUFs in order to create secure and lightweight end-to-end device recognition in IoT context.
Security & Governance Algorithm Intern
AI Group, Zhihu.Inc (Beijing, China) (2023.2 – 2023.7)
I participated in a five-month internship as a Security & Governance Algorithm intern at Zhihu.Inc, Beijing, China. My work duty mainly falls on the utilization and improvement of security and governance algorithms used at zhihu.com, the largest Chinese online Q&A forum. Here are my key contributions.
-
Image classification task: crowd-gathering image classification model.
Training a binary image classification model. By finetuning on 3 backbone networks with different neg-sample ratios, the best model after evaluation (90% Acc & 82% Recall) was submitted as a service. -
General Large Language Model Evaluation Task/Dataset Collection.
Researching the existing large model evaluation work and giving the task capability breakdown, then collecting representative Chinese/English datasets under each task and giving the mapping schema, prompt, and evaluation index. -
Variant-text recognition task.
Using Chinese-RoBERTa3 as the backbone network to finetune a variant-text recognition model. To solve the inconsistency between online and offline inference, I solved the problem that in the pre-processing process the tokenizer online and offline were not aligned. The final metric is 95% Acc with 88% Recall. -
Model Inference Consistency Special Task.
To address the problem of DNNs’ catastrophic forgetting when we regularly update our model, it is critical to improve the new model’s recall on the old model within the same task. Initially, I used the new model to infer the training set of the old model and re-add false negative samples (e.g., positive samples with low model scores) in the new model's training set for retraining. After conducting experiments with different thresholds, the new model improved the recall by 53% relatively (29.32% - 44.86%). The next step would be to find more similar samples of low-scoring samples by clustering their embedding vectors.
Machine Learning Intern
Machine Learning Group, ZhiNeng Tech.Inc (Beijing, China) (2022.7 – 2022.11)
I participated in a four-month internship as a Machine Learning Intern at ZhiNeng Tech. Inc Beijing, China. Here are my key contributions.
-
Electricity Trading Project
Building a deep learning model (Resnet50) to make a time-series forecast of daily electricity load data and implementing iterative optimization based on result analysis. Predicting rolling market prices on electricity trading days using machine learning methods (Ensemble). -
Trading Strategy Project
Collating real market data and forecast data at all levels and analyzing and verifying the profit and loss of different trading strategies. -
Air Pollutant Traceability Project
Building and optimizing a Gaussian smoke-cluster model to simulate the emission of air pollutants.
RESEARCH PROJECT
Simulation model for componentized mechanistic of aircraft (2021.9 – 2022.5)
The project sponsor is China Aerospace Science and Industry Corporation Limited (CASIC) II. The task requires static and dynamic system modeling and simulation for componentized aircraft as well as missiles based on mechanical knowledge and classical control theory using Sysml language. Received Outstanding Project of the Year Award by Sino-French Engineering School, Beihang University.
A Study of Differential Privacy in Recommendation Systems (2021.4 – 2022.1)
Collaboration with Orange (Beijing) to study the application of differential privacy and federation learning in recommendation systems. Based on TensorFlow 1., the project aimed to study the performance of NCF and DSSM recommendation algorithms combined with federated learning and differential privacy on the MIND news dataset.
Installation and debugging of materials science software and interface construction (2020.9 – 2021.5)
The whole project was based on the High-Performance Computing Center platform under the Linux system, therefore I obtained a preliminary understanding of the Linux system and software compilation process, and parallel computing using MPI protocol.
TECHNICS & LINGUISTICS
English: IETLS: 7.5 (8.5 in Listening)
French: 6 years of French education experience
Coding: Python, C/C++, Matlab
DL: Pytorch, TensorFlow, Scikit-learn
