Just Leaf It: Accelerating Diffusion Classifiers with Hierarchical Class Pruning

Nov 18, 2024·
Arundhati S Shanbhag
,
Brian Bernhard Moser
Tobias Christian Nauen Tobias Christian Nauen
,
Stanislav Frolov
,
Federico Raue
,
Andreas Dengel
PDF
Abstract
Diffusion models, known for their generative capabilities, have recently shown unexpected potential in image classification tasks by using Bayes’ theorem. However, most diffusion classifiers require evaluating all class labels for a single classification, leading to significant computational costs that can hinder their application in large-scale scenarios. To address this, we present a Hierarchical Diffusion Classifier (HDC) that exploits the inherent hierarchical label structure of a dataset. By progressively pruning irrelevant high-level categories and refining predictions only within relevant subcategories, i.e., leaf nodes, HDC reduces the total number of class evaluations. As a result, HDC can accelerate inference by up to 60% while maintaining and, in some cases, improving classification accuracy. Our work enables a new control mechanism of the trade-off between speed and precision, making diffusion-based classification more viable for real-world applications, particularly in large-scale image classification tasks.
Type
Publication
arXiv preprint (arXiv)
publications

For more information, see the paper pdf.

Associated Projects: SEmbedAI, SustAInML, Albatross

Citation

If you use this work, please cite our paper:

@misc{shanbhag2024justleafit,
  title         = {Just Leaf It: Accelerating Diffusion Classifiers with Hierarchical Class Pruning},
  author        = {Arundhati S. Shanbhag and Brian B. Moser and Tobias C. Nauen and Stanislav Frolov and Federico Raue and Andreas Dengel},
  year          = {2024},
  eprint        = {2411.12073},
  archiveprefix = {arXiv},
  primaryclass  = {cs.CV}
}
Tobias Christian Nauen
Authors
PhD Student
I’m a researcher of artificial intelligence at DFKI and RPTU Kaiserslautern-Landau. My research interests include efficient deep learning, transformer models, multimodal learning, and computer vision. In my PhD project, my focus lies on the development of efficient transformer models for vision, language, and multimodal tasks.