Part of Robotics Safety division of ETHRC. Exploring adversarial attacks and defenses of VLA models in humanoid robots.
Andrei Baroian
From pre-training to post-training to attacking LLMs.
I'm a MSc Computer Science - AI student at Leiden University, currently doing my thesis at SPY Lab (ETH Zurich) on prompt injection attacks, supervised by Jie Zhang and Florian Tramèr. I'm also part of the Robotics Safety Division at ETH Robotics Club, working on adversarial attacks against VLA models in humanoid robots.
My past research includes LLM pre-training (exploring architectural variants) and LLM post-training (speeding up GRPO training with Prompt Reuse), with smaller projects in LLM quantization and mechanistic interpretability. I also work as a data engineer at Akida and was a Teaching Assistant at Leiden University.
Excited about autoresearch, scared of cybersecurity capabilities of agents.
Research & Projects
Experience
- Grade assignments and provide feedback.
- Guide students in selecting, understanding, and presenting research papers.
- Develop filtering logic to detect construction projects in public-sector sources using heuristics & GenAI.
- Build the extraction pipeline for summarization and structured information retrieval with LLMs.
- Collaborate on annotation workflows and quality evaluation of LLMs.
- Test, deploy, and monitor Azure applications.
Education
Notable grades: Seminar in Deep Reinforcement Learning (10), Deep Learning (9.0), Seminar in Deep Learning (9.0), Natural Language Processing (9.0).
Adversarial attacks on vision-language models. Supervised by Jie Zhang and Florian Tramèr.