Andreas Dengel

Co-authored Publications: 14

Preprint · 2026

TextTeacher: What Can Language Teach About Images?

Tobias Christian Nauen, Stanislav Frolov, Brian Bernhard Moser, Federico Raue, Ahmed Anwar, Andreas Dengel

We use a frozen text encoder on image captions as a lightweight training-time auxiliary objective for image classifiers. The text components are drop.p.ed at inference, leaving a fast, unimodal vision model. Accuracy on ImageNet improves by up to +2.7 p.p. and downstream transfer by +1.0 p.p. on average, outperforming vision knowledge distillation at a fraction of the compute.

→ project page ↗ code ↗ Precomputed Embeddings

arXiv · 2026

Hyperspherical Forward-Forward with Prototypical Representations

Shalini Sarode, Brian Bernhard Moser, Joachim Folz, Federico Raue, Tobias Christian Nauen, Stanislav Frolov, Andreas Dengel

We fix Forward-Forward's slow inference by replacing per-class passes with a single forward pass through hyperspherical prototype matching. Thus, we achieve 40× faster inference with competitive accuracy.

→ project page ↗ pdf

Accepted to CVPR 2026 · 2026

When Pretty Isn't Useful: Investigating Why Modern Text-to-Image Models Fail as Reliable Training Data Generators

Krzysztof Adamkiewicz, Brian Bernhard Moser, Stanislav Frolov, Tobias Christian Nauen, Federico Raue, Andreas Dengel

We show that newer text-to-image models are progressively worse as training data generators, despite better visual quality, because they collapse to a narrow aesthetic-centric distribution that diverges from real data.

→ project page ↗ pdf

TMLR · 2026

PRISM: Diversifying Dataset Distillation by Decoupling Architectural Priors

Brian Bernhard Moser, Shalini Sarode, Federico Raue, Krzysztof Adamkiewicz, Arundhati Shanbhag, Joachim Folz, Tobias Christian Nauen, Andreas Dengel

We introduce PRISM, a framework that disentangles architectural priors for dataset distillation, outperforming single-teacher setups.

→ project page ↗ pdf ↗ link

arXiv · 2025

SubZeroCore: A Submodular Approach with Zero Training for Coreset Selection

Brian Bernhard Moser, Tobias Christian Nauen, Arundhati Shanbhag, Federico Raue, Stanislav Frolov, Joachim Folz, Andreas Dengel

We introduce SubZeroCore, a novel, training-free coreset selection method that integrates submodular coverage and density into a single, unified objective.

→ project page ↗ pdf

Accepted to ICPR 2026 · 2025

HyperCore: Coreset Selection under Noise via Hypersphere Models

Brian Bernhard Moser, Arundhati Shanbhag, Tobias Christian Nauen, Stanislav Frolov, Federico Raue, Joachim Folz, Andreas Dengel

We present HyperCore, a lightweight adaptive coreset selection framework designed for noisy environments. HyperCore utilizes per class hypersphere models and adaptively selects pruning thresholds.

→ project page ↗ pdf

ICIP 2025 · 2025

When 512×512 is not Enough: Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution

Brian Bernhard Moser, Stanislav Frolov, Tobias Christian Nauen, Federico Raue, Andreas Dengel

We extend pretrained super-resolution models to larger images by using local-aware prompts.

→ project page ↗ pdf ↗ code ↗ doi

arXiv · 2025

ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias Mitigation

Tobias Christian Nauen, Brian Bernhard Moser, Federico Raue, Stanislav Frolov, Andreas Dengel

We improve the training of vision transformers by segmenting and recombining objects and backgrounds from datasets. This makes the transformers more accurate, as well as more robust.

→ project page ↗ pdf ↗ code ↗ dataset ↗ Supplementary Material

WACV 2025 · 2025

Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers

Tobias Christian Nauen, Sebastian Palacio, Federico Raue, Andreas Dengel

A comprehensive benchmark and analysis of more than 45 transformer models for image classification to evaluate their efficiency, considering various performance metrics. We find the optimal architectures to use and uncover that model-scaling is more efficient than image scaling.

→ project page ↗ pdf ↗ code ↗ poster ↗ slides ↗ Data Explorer ↗ Supplementary Material

Accepted to ICPR 2026 · 2025

A Study in Dataset Distillation for Image Super-Resolution

Tobias Dietz, Brian Bernhard Moser, Tobias Christian Nauen, Federico Raue, Stanislav Frolov, Andreas Dengel

We conduct the first systematic study of dataset distillation for Super-Resolution.

→ project page ↗ pdf

ICPR 2024 (oral) · 2024

TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back) using Taylor-Softmax

Tobias Christian Nauen, Sebastian Palacio, Andreas Dengel

This paper introduces TaylorShift, a novel reformulation of the attention mechanism using Taylor softmax that enables computing full token-to-token interactions in linear time. We analytically and empirically determine the crossover points where employing TaylorShift becomes more efficient than traditional attention. TaylorShift outperforms the traditional transformer architecture in 4 out of 5 tasks.

→ project page ↗ pdf ↗ code ↗ slides ↗ Appendix ↗ doi