GADE Research
GADE Research is the current strategic priority of GADE Union, focusing on frontier academic research.
We specialize in Multimodal and Agent-based evaluation benchmarks. Unlike conventional datasets, we aim to build high-quality benchmarks that precisely capture model boundaries and critical flaws. We believe that identifying where models "fail" is essential for providing clear direction for the research and iteration of frontier models.
Selected Work
Pix2Fact (2026): An expert-level visual reasoning benchmark project designed to identify factual reasoning flaws in complex Visual Question Answering (VQA) models. Paper Link