TWO SHOTS are Enough! — CGMaker with sparse 3DGS
Published:
Established a pipeline capable of performing 3D reconstruction of a moving scene using only two shots, followed by editing the reconstructed scene.
Published:
Established a pipeline capable of performing 3D reconstruction of a moving scene using only two shots, followed by editing the reconstructed scene.
Published:
A.pl offers a modular SDK enabling blockchain-based autonomous agents to securely generate interaction data, addressing Web2 data scarcity. It uses asynchronous methods to overcome blockchain latency and concurrency issues.
Published:
APT-based Pipeline is an end-to-end insurance analysis system using watt-tool-8B for function orchestration and Mistral-small-24B for detailed output generation.
Published in ACL 2025 (main), 2025
We introduce ACON, a dataset of 1,000 images (500 newly contributed) paired with captions, editing instructions, and Q&A pairs to evaluate cross-modal transfers rigorously.
Published in preprint, 2025
InfoCausalQA—a benchmark of 494 infographic–text pairs with 1,482 human-revised MCQs generated via GPT-4o—tests quantitative trend reasoning and five semantic causal types (cause, effect, intervention, counterfactual, temporal) and shows that current VLMs, far below human performance, struggle with genuinely grounded causal inference from infographics.