Efficient and Sustainable AI

deep learning efficiency; sustainable AI; deep learning optimization; AI4Science

What we are interested in

Deep Learning (DL) has proven to be a powerful tool to address complex problems across a wide range of fields. At iol.LEARN, we mainly focus on the intersection of sustainability and AI from two perspectives: a) enhancing the efficiency of DL methods, and b) employing DL techniques to tackle sustainability-related challenges and other public interest issues. For the former, addressing the challenges posed by the high computational, financial, and environmental costs of training and operating large models such as LLMs, we develop techniques to enhance the computational and memory efficiency through methods like pruning, quantization and decentralized training. For the latter, our group develops methods to tackle sustainability-related challenges, exemplified by our AI4Forest project, which involves learning high-resolution global tree height maps from satellite imagery to monitor deforestation and forest disturbances. We are further interested in taking advantage of DL for scientific discovery (AI4Science) and improving DL training towards robustness and interpretability through theoretically grounded techniques rooted in continuous and discrete optimization.

🧑‍🎓 Members

Sebastian Pokutta
Department Head
pokutta (at) zib.de

Sai Ganesh Nagarajan
Research Area Lead
nagarajan (at) zib.de

Max Zimmer
Research Area Lead
zimmer (at) zib.de

Suhrab Asadulla
asadull (at) zib.de

Mihail Birsan
birsan (at) zib.de

Kartikeya Chitranshi
chitranshi (at) zib.de

Dominik Kuzinowicz
kuzinowicz (at) zib.de

Ingo Meise
meise (at) zib.de

Konrad Mundinger
mundinger (at) zib.de

Nico Pelleriti
pelleriti (at) zib.de

Christophe Roux
roux (at) zib.de

Shpresim Sadiku
sadiku (at) zib.de

Malika Tokpanova
tokpanova (at) zib.de

Berkant Turan
turan (at) zib.de

Alonso Urbano
urbano (at) zib.de

Moritz Wagner
wagner (at) zib.de

Former members

Sushanth Malipati (2025)
Christoph Graczyk (2020–2025)
Felix Prause (2020–2025)
Kartikey Sharma (2020–2024)
Megi Andoni (2023–2024)
Muhammad Nawfal Meeran (2024)
Surya Iyer (2022–2023)
Adnan Mahmud (2023)
Maria Bulychev (2021–2023)
Giacomo Salvati (2021–2023)
Elias Schecke (2021–2022)
Stephan Wäldchen (2020–2022)
Sergi Andreu (2020–2021)
Minh Pham (2020–2021)
Max-Georg Schorr (2020–2021)
Margot Cosson (2021)
Huy Do Anh

Visitors

Takeshi Koshizuka (May–December 2025)
Hiroshi Kera (October 2024–September 2025)
Krunal Patel (February 2024)

💬 Talks and posters

Conference and workshop talks

May 2025: Algorithms and Complexity for Two-Team Polymatrix Zero-Sum Games: Revisiting Constrained Bilinear Minmax Optimization by Sai Ganesh Nagarajan
7th DOxML Conference, Kyoto
May 2025: Neural Discovery in Mathematics by Christoph Spiegel
7th DOxML Conference, Kyoto [PDF]
Nov 2024: A Combinatorial Comparison of Boosting and Repeated Boosting by Ingo Meise
Annual Meeting, SPP Theoretical Foundations of Deep Learning, Tutzing
Oct 2024: Extending the Continuum of Six-Colorings by Christoph Spiegel
41st Kolloquium über Kombinatorik (KolKom), Heidelberg [PDF]
Jul 2024: Extending the Continuum of Six-Colorings by Christoph Spiegel
13th Discrete Mathematics Days (DMD), Alcalá de Henares [PDF]
Sep 2022: Wavelet-based Low Frequency Adversarial Attacks by Shpresim Sadiku
3rd BMS-BGSMath Junior Meeting, Barcelona [PDF]
Mar 2021: Neural Network Approximation Theory by Shpresim Sadiku
BMS Conference [PDF]
Dec 2019: Tensor-based Algorithms for Image Classification by Patrick Gelß
Mathematics of Deep Learning

View More / Less

Research seminar talks

May 2025: Coloring the Plane with Neural Networks by Christoph Spiegel
Advanced Topics in Combinatorics II, Taipei [PDF]
Dec 2024: Coloring the Plane with Neural Networks by Christoph Spiegel
Research Seminar Combinatorics, Berlin [PDF]
May 2024: Learning Operators Via Hypernetworks by Konrad Mundinger
Research Seminar Numerical Analysis of Stochastic and Deterministic Partial Differential Equations, Berlin
Aug 2023: Minimally Distorted Explainable Adversarial Attacks by Shpresim Sadiku
AIP Seminar, RIKEN Center for Advanced Intelligence Project, Tokyo
Feb 2023: What Is Backpropagation? by Shpresim Sadiku
"What is ...?" seminar [PDF]

Poster presentations

Jul 2025: Approximating Latent Manifolds in Neural Networks Via Vanishing Ideals by Max Zimmer
42nd International Conference on Machine Learning (ICML), Vancouver
Jul 2025: DUNIA: Pixel-Sized Embeddings Via Cross-Modal Alignment for Earth Observation Applications by Max Zimmer
42nd International Conference on Machine Learning (ICML), Vancouver
Jul 2025: Neural Discovery in Mathematics: Do Machines Dream of Colored Planes? by Konrad Mundinger and Max Zimmer
42nd International Conference on Machine Learning (ICML), Vancouver
Jul 2025: Neural Concept Verifier: Scaling Prover-Verifier Games Via Concept Encodings by Berkant Turan
Workshop on Actionable Interpretability @ ICML 2025, Vancouver
Jul 2025: Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation by Berkant Turan and Max Zimmer
42nd International Conference on Machine Learning (ICML), Vancouver
May 2025: The Good, the Bad and the Ugly: Watermarks, Transferable Attacks and Adversarial Defenses by Berkant Turan
7th DOxML Conference, Kyoto
May 2025: S-CFE: Simple Counterfactual Explanations by Shpresim Sadiku
28th AISTATS Conference, Phuket
Apr 2025: On the Byzantine-resilience of Distillation-based Federated Learning by Christophe Roux and Max Zimmer
13th International Conference on Learning Representations (ICLR), Singapore
Apr 2025: Group-wise Sparse and Explainable Adversarial Attacks by Shpresim Sadiku
13th International Conference on Learning Representations (ICLR), Singapore
Apr 2025: The Good, the Bad and the Ugly: Watermarks, Transferable Attacks and Adversarial Defenses by Berkant Turan
Workshop on GenAI Watermarking @ ICLR 2025, Singapore
Apr 2025: Compressed Sparse Tiles for Memory-Efficient Unstructured and Semi-Structured Sparsity by Max Zimmer
Workshop on Sparse LLMs @ ICML 2025, Singapore
Jul 2024: Estimating Canopy Height at Scale by Max Zimmer
41st International Conference on Machine Learning (ICML), Vienna
Jul 2024: Unified Taxonomy in AI Safety: Watermarks, Adversarial Defenses, and Transferable Attacks by Berkant Turan
Workshop on Theoretical Foundations of Foundation Models (TF2M) @ ICML 2024, Vienna
Jul 2024: Self-Supervised Detection of Perfect and Partial Input-Dependent Symmetries by Alonso Urbano
Geometry-grounded Representation Learning and Generative Modeling Workshop (GRaM) @ ICML 2024, Vienna
May 2024: Sparse Model Soups a Recipe for Improved Pruning Via Model Averaging by Max Zimmer
12th International Conference on Learning Representations (ICLR), Vienna
May 2024: Interpretability Guarantees with Merlin-Arthur Classifiers by Berkant Turan
27th AISTATS Conference, València
May 2024: Neural Parameter Regression for Explicit Representations of PDE Solution Operators by Konrad Mundinger
Workshop on AI4DifferentialEquations in Science @ ICLR 2024, Vienna
Mar 2024: Group-wise Sparse and Explainable Adversarial Attacks by Shpresim Sadiku
Deep Learning: Theory, Applications, and Implications, RIKEN Center for Advanced Intelligence Project, Tokyo
Jul 2023: Extending Merlin-Arthur Classifiers for Improved Interpretability by Berkant Turan
The 1st World Conference on eXplainable Artificial Intelligence
May 2023: How I Learned to Stop Worrying and Love Retraining by Max Zimmer
11th International Conference on Learning Representations (ICLR), Kigali
Mar 2023: Wavelet-based Low Frequency Adversarial Attacks by Shpresim Sadiku
Workshop on Optimization and Machine Learning, Waischenfeld
Mar 2023: How I Learned to Stop Worrying and Love Retraining by Max Zimmer
Workshop on Optimization and Machine Learning, Waischenfeld
Dec 2021: Learning to Schedule Heuristics in Branch-and-Bound by Antonia Chmiela
35th conference on neural information processing systems (NeurIPS)

View More / Less

📝 Publications and preprints

Preprints

Haase, J., Klessascheck, F., Mendling, J., and Pokutta, S. (2025). Sustainability Via LLM Right-sizing. [arXiv]

[BibTeX]

@misc{2025_HaaseFinnJanPokutta_Sustainabilityllm,
  archiveprefix = {arXiv},
  eprint = {2504.13217},
  primaryclass = {cs.CL},
  year = {2025},
  author = {Haase, Jennifer and Klessascheck, Finn and Mendling, Jan and Pokutta, Sebastian},
  title = {Sustainability Via LLM Right-sizing},
  date = {2025-04-17}
}

Haase, J., Hanel, P. H. P., and Pokutta, S. (2025). Has the Creativity of Large-Language Models Peaked? An Analysis of Inter- and Intra-LLM Variability. [arXiv]

[BibTeX]

@misc{2025_HaasePPokutta_Llmvariability,
  archiveprefix = {arXiv},
  eprint = {2504.12320},
  primaryclass = {cs.CL},
  year = {2025},
  author = {Haase, Jennifer and Hanel, Paul H. P. and Pokutta, Sebastian},
  title = {Has the Creativity of Large-Language Models Peaked? An Analysis of Inter- and Intra-LLM Variability},
  date = {2025-04-10}
}

Haase, J., and Pokutta, S. (2025). Beyond Static Responses: Multi-Agent LLM Systems As a New Paradigm for Social Science Research. [arXiv]

[BibTeX]

@misc{2025_HaasePokutta_Multiagentllm_2506-01839,
  archiveprefix = {arXiv},
  eprint = {2506.01839},
  primaryclass = {cs.MA},
  year = {2025},
  author = {Haase, Jennifer and Pokutta, Sebastian},
  title = {Beyond Static Responses: Multi-Agent LLM Systems As a New Paradigm for Social Science Research},
  date = {2025-06-02}
}

Yoda, K., Kawamoto, K., and Kera, H. (2025). Learning Moderately Input-Sensitive Functions: A Case Study in QR Code Decoding. [arXiv]

[BibTeX]

@misc{2025_KazukiKazuhikoKera_Qrcodedecoding_2506-20305,
  archiveprefix = {arXiv},
  eprint = {2506.20305},
  primaryclass = {cs.LG},
  year = {2025},
  author = {Yoda, Kazuki and Kawamoto, Kazuhiko and Kera, Hiroshi},
  title = {Learning Moderately Input-Sensitive Functions: A Case Study in QR Code Decoding},
  date = {2025-06-25}
}

Kera, H., and Kehrein, A. (2025). Gradient-Weighted, Data-Driven Normalization for Approximate Border Bases – Concept and Computation. [arXiv]

[BibTeX]

@misc{2025_KeraAchim_Gradientweightednormalization_2506-09529,
  archiveprefix = {arXiv},
  eprint = {2506.09529},
  primaryclass = {cs.SC},
  year = {2025},
  author = {Kera, Hiroshi and Kehrein, Achim},
  title = {Gradient-Weighted, Data-Driven Normalization for Approximate Border Bases -- Concept and Computation},
  date = {2025-06-11}
}

Kera, H., Pelleriti, N., Ishihara, Y., Zimmer, M., and Pokutta, S. (2025). Computational Algebra with Attention: Transformer Oracles for Border Basis Algorithms. [arXiv]

[BibTeX]

@misc{2025_KeraEtAl_Transformeroracles_2505-23696,
  archiveprefix = {arXiv},
  eprint = {2505.23696},
  primaryclass = {cs.LG},
  year = {2025},
  author = {Kera, Hiroshi and Pelleriti, Nico and Ishihara, Yuki and Zimmer, Max and Pokutta, Sebastian},
  title = {Computational Algebra with Attention: Transformer Oracles for Border Basis Algorithms},
  date = {2025-05-29}
}

Kera, H., Arakawa, S., and Sato, Y. (2025). CALT: A Library for Computer Algebra with Transformer. [arXiv]

[BibTeX]

@misc{2025_KeraShunYuta_Calttransformer_2506-08600,
  archiveprefix = {arXiv},
  eprint = {2506.08600},
  primaryclass = {cs.LG},
  year = {2025},
  author = {Kera, Hiroshi and Arakawa, Shun and Sato, Yuta},
  title = {CALT: A Library for Computer Algebra with Transformer},
  date = {2025-06-10}
}

Koshizuka, T., and Sato, I. (2025). Understanding Generalization in Physics Informed Models Through Affine Variety Dimensions. [arXiv]

[BibTeX]

@misc{2025_KoshizukaIssei_Generalizationaffinevariety_2501-18879,
  archiveprefix = {arXiv},
  eprint = {2501.18879},
  primaryclass = {cs.LG},
  year = {2025},
  author = {Koshizuka, Takeshi and Sato, Issei},
  title = {Understanding Generalization in Physics Informed Models Through Affine Variety Dimensions},
  date = {2025-01-31}
}

Sadiku, S., Chitranshi, K., Kera, H., and Pokutta, S. (2025). Training on Plausible Counterfactuals Removes Spurious Correlations. [arXiv]

[BibTeX]

@misc{2025_SadikuChitranshiKeraPokutta_Plausiblecounterfactuals,
  archiveprefix = {arXiv},
  eprint = {2505.16583},
  primaryclass = {cs.LG},
  year = {2025},
  author = {Sadiku, Shpresim and Chitranshi, Kartikeya and Kera, Hiroshi and Pokutta, Sebastian},
  title = {Training on Plausible Counterfactuals Removes Spurious Correlations},
  date = {2025-05-22}
}

Kumano, S., Kera, H., and Yamasaki, T. (2025). Adversarially Pretrained Transformers May Be Universally Robust In-Context Learners. [arXiv]

[BibTeX]

@misc{2025_SoichiroKeraToshihiko_Adversarialtransformers,
  archiveprefix = {arXiv},
  eprint = {2505.14042},
  primaryclass = {cs.LG},
  year = {2025},
  author = {Kumano, Soichiro and Kera, Hiroshi and Yamasaki, Toshihiko},
  title = {Adversarially Pretrained Transformers May Be Universally Robust In-Context Learners},
  date = {2025-05-20}
}

Urbano, A., Romero, D. W., Zimmer, M., and Pokutta, S. (2025). RECON: Robust Symmetry Discovery Via Explicit Canonical Orientation Normalization. [arXiv]

[BibTeX]

@misc{2025_UrbanoWZimmerPokutta_Reconsymmetrydiscovery,
  archiveprefix = {arXiv},
  eprint = {2505.13289},
  primaryclass = {cs.LG},
  year = {2025},
  author = {Urbano, Alonso and Romero, David W. and Zimmer, Max and Pokutta, Sebastian},
  title = {RECON: Robust Symmetry Discovery Via Explicit Canonical Orientation Normalization},
  date = {2025-05-19}
}

Sato, Y., Kawamoto, K., and Kera, H. (2025). Chain of Thought in Order: Discovering Learning-Friendly Orders for Arithmetic. [arXiv]

[BibTeX]

@misc{2025_YutaKazuhikoKera_Chainthoughtordering_2506-23875,
  archiveprefix = {arXiv},
  eprint = {2506.23875},
  primaryclass = {cs.LG},
  year = {2025},
  author = {Sato, Yuta and Kawamoto, Kazuhiko and Kera, Hiroshi},
  title = {Chain of Thought in Order: Discovering Learning-Friendly Orders for Arithmetic},
  date = {2025-06-30}
}

Göß, A., Martin, A., Pokutta, S., and Sharma, K. (2024). Norm-induced Cuts: Optimization with Lipschitzian Black-box Functions. [URL] [arXiv]

[BibTeX]

@misc{2024_AdrianMartinPokuttaSharma_Norminducedcuts,
  url = {https://opus4.kobv.de/opus4-trr154/files/518/nic_preprint.pdf},
  archiveprefix = {arXiv},
  eprint = {2403.11546},
  primaryclass = {math.OC},
  year = {2024},
  author = {Göß, Adrian and Martin, Alexander and Pokutta, Sebastian and Sharma, Kartikey},
  title = {Norm-induced Cuts: Optimization with Lipschitzian Black-box Functions}
}

Goerigk, M., Hartisch, M., Merten, S., and Sharma, K. (2024). Feature-Based Interpretable Optimization. [arXiv]

[BibTeX]

@misc{2024_GoerigkHartischMertenSharma_Featureinterpretableoptimization,
  archiveprefix = {arXiv},
  eprint = {2409.01869},
  primaryclass = {math.OC},
  year = {2024},
  author = {Goerigk, Marc and Hartisch, Michael and Merten, Sebastian and Sharma, Kartikey},
  title = {Feature-Based Interpretable Optimization},
  date = {2024-09-03}
}

Mundinger, K., Zimmer, M., and Pokutta, S. (2024). Neural Parameter Regression for Explicit Representations of PDE Solution Operators. [arXiv]

[BibTeX]

@misc{2024_MundingerZimmerPokutta_Neuralparameterregression,
  archiveprefix = {arXiv},
  eprint = {2403.12764},
  primaryclass = {cs.LG},
  year = {2024},
  author = {Mundinger, Konrad and Zimmer, Max and Pokutta, Sebastian},
  title = {Neural Parameter Regression for Explicit Representations of PDE Solution Operators},
  date = {2024-03-19}
}

Haase, J., and Pokutta, S. (2024). Human-AI Co-Creativity: Exploring Synergies Across Levels of Creative Collaboration. [arXiv]

[BibTeX]

@misc{2024_Pokutta_HumanAICOcreative,
  archiveprefix = {arXiv},
  eprint = {2411.12527},
  primaryclass = {cs.HC},
  year = {2024},
  author = {Haase, Jennifer and Pokutta, Sebastian},
  title = {Human-AI Co-Creativity: Exploring Synergies Across Levels of Creative Collaboration},
  date = {2024-11-19}
}

Zimmer, M., Andoni, M., Spiegel, C., and Pokutta, S. (2023). PERP: Rethinking the Prune-Retrain Paradigm in the Era of LLMs. [arXiv] [code]

[BibTeX]

@misc{2023_ZimmerAndoniSpiegelPokutta_PerpPruneRetrain,
  archiveprefix = {arXiv},
  eprint = {2312.15230},
  primaryclass = {cs.LG},
  year = {2023},
  author = {Zimmer, Max and Andoni, Megi and Spiegel, Christoph and Pokutta, Sebastian},
  title = {PERP: Rethinking the Prune-Retrain Paradigm in the Era of LLMs},
  code = {https://github.com/ZIB-IOL/PERP},
  date = {2023-12-23}
}

Combettes, C., Spiegel, C., and Pokutta, S. (2020). Projection-free Adaptive Gradients for Large-scale Optimization. [arXiv] [summary] [code]

[BibTeX]

@misc{2020_CombettesSpiegelPokutta_Projectionfreeadaptivegradients,
  archiveprefix = {arXiv},
  eprint = {2009.14114},
  primaryclass = {math.OC},
  year = {2020},
  author = {Combettes, Cyrille and Spiegel, Christoph and Pokutta, Sebastian},
  title = {Projection-free Adaptive Gradients for Large-scale Optimization},
  code = {https://github.com/ZIB-IOL/StochasticFrankWolfe},
  summary = {https://pokutta.com/blog/research/2020/10/21/adasfw.html},
  date = {2020-09-29}
}

Pokutta, S., Spiegel, C., and Zimmer, M. (2020). Deep Neural Network Training with Frank-Wolfe. [arXiv] [summary] [code]

[BibTeX]

@misc{2020_PokuttaSpiegelZimmer_Frankwolfeneuralnetworks,
  archiveprefix = {arXiv},
  eprint = {2010.07243},
  primaryclass = {cs.LG},
  year = {2020},
  author = {Pokutta, Sebastian and Spiegel, Christoph and Zimmer, Max},
  title = {Deep Neural Network Training with Frank-Wolfe},
  code = {https://github.com/ZIB-IOL/StochasticFrankWolfe},
  summary = {https://pokutta.com/blog/research/2020/11/11/NNFW.html},
  date = {2020-10-14}
}

Conference proceedings

Sadiku, S., Wagner, M., and Pokutta, S. (2025). Group-wise Sparse and Explainable Adversarial Attacks. Proceedings of the International Conference on Learning Representations. [arXiv]

[BibTeX]

@inproceedings{2023_SadikuWagnerPokutta_Groupwisesparseattacks,
  year = {2025},
  booktitle = {Proceedings of the International Conference on Learning Representations},
  archiveprefix = {arXiv},
  eprint = {2311.17434},
  primaryclass = {cs.CV},
  author = {Sadiku, Shpresim and Wagner, Moritz and Pokutta, Sebastian},
  title = {Group-wise Sparse and Explainable Adversarial Attacks}
}

Fukuda, T., Kera, H., and Kawamoto, K. (2025). Adapter Merging with Centroid Prototype Mapping for Scalable Class-Incremental Learning. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. [arXiv]

[BibTeX]

@inproceedings{2024_FukudaKeraKawamoto_AdapterMerging,
  year = {2025},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  archiveprefix = {arXiv},
  eprint = {2412.18219},
  primaryclass = {cs.CV},
  author = {Fukuda, Takuma and Kera, Hiroshi and Kawamoto, Kazuhiko},
  title = {Adapter Merging with Centroid Prototype Mapping for Scalable Class-Incremental Learning}
}

Głuch, G., Turan, B., Nagarajan, S. G., and Pokutta, S. (2025). The Good, the Bad and the Ugly: Watermarks, Transferable Attacks and Adversarial Defenses. Proceedings of the ICLR Workshop on GenAI Watermarking. [arXiv]

[BibTeX]

@inproceedings{2024_GrzegorzTuranNagarajanPokutta_Watermarksadversarialdefenses,
  year = {2025},
  booktitle = {Proceedings of the ICLR Workshop on GenAI Watermarking},
  archiveprefix = {arXiv},
  eprint = {2410.08864},
  primaryclass = {cs.LG},
  author = {Głuch, Grzegorz and Turan, Berkant and Nagarajan, Sai Ganesh and Pokutta, Sebastian},
  title = {The Good, the Bad and the Ugly: Watermarks, Transferable Attacks and Adversarial Defenses}
}

Hollender, A., Maystre, G., and Nagarajan, S. G. (2025). The Complexity of Two-Team Polymatrix Games with Independent Adversaries. Proceedings of the International Conference on Learning Representations. [arXiv]

[BibTeX]

@inproceedings{2024_HollenderMaystreNagarajan_Twoteampolymatrix,
  year = {2025},
  booktitle = {Proceedings of the International Conference on Learning Representations},
  archiveprefix = {arXiv},
  eprint = {2409.07398},
  primaryclass = {cs.GT},
  author = {Hollender, Alexandros and Maystre, Gilbert and Nagarajan, Sai Ganesh},
  title = {The Complexity of Two-Team Polymatrix Games with Independent Adversaries}
}

Roux, C., Zimmer, M., and Pokutta, S. (2025). On the Byzantine-resilience of Distillation-based Federated Learning. Proceedings of the International Conference on Learning Representations. [arXiv]

[BibTeX]

@inproceedings{2024_RouxZimmerPokutta_Byzantineresilience,
  year = {2025},
  booktitle = {Proceedings of the International Conference on Learning Representations},
  archiveprefix = {arXiv},
  eprint = {2402.12265},
  primaryclass = {cs.LG},
  author = {Roux, Christophe and Zimmer, Max and Pokutta, Sebastian},
  title = {On the Byzantine-resilience of Distillation-based Federated Learning}
}

Sadiku, S., Wagner, M., Nagarajan, S. G., and Pokutta, S. (2025). S-CFE: Simple Counterfactual Explanations. Proceedings of the International Conference on Artificial Intelligence and Statistics. [arXiv]

[BibTeX]

@inproceedings{2024_SadikuEtAl_Counterfactualexplanations,
  year = {2025},
  booktitle = {Proceedings of the International Conference on Artificial Intelligence and Statistics},
  archiveprefix = {arXiv},
  eprint = {2410.15723},
  primaryclass = {cs.LG},
  author = {Sadiku, Shpresim and Wagner, Moritz and Nagarajan, Sai Ganesh and Pokutta, Sebastian},
  title = {S-CFE: Simple Counterfactual Explanations}
}

Haase, J., Hanel, P. H. P., and Pokutta, S. (2025, May 14). S-DAT: a Multilingual, GenAI-driven Framework for Automated Divergent Thinking Assessment. Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society (AIES). [arXiv]

[BibTeX]

@inproceedings{2025_HaasePPokutta_SdatDivergentthinking,
  year = {2025},
  booktitle = {Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society (AIES)},
  archiveprefix = {arXiv},
  eprint = {2505.09068},
  primaryclass = {cs.CL},
  author = {Haase, Jennifer and Hanel, Paul H. P. and Pokutta, Sebastian},
  title = {S-DAT: a Multilingual, GenAI-driven Framework for Automated Divergent Thinking Assessment},
  date = {2025-05-14}
}

Fayad, I., Zimmer, M., Schwartz, M., Ciais, P., Gieseke, F., Belouze, G., Brood, S., De Truchis, A., and d’Aspremont, A. (2025, February 24). DUNIA: Pixel-Sized Embeddings Via Cross-Modal Alignment for Earth Observation Applications. Proceedings of the International Conference on Machine Learning. [arXiv]

[BibTeX]

@inproceedings{2025_IbrahimEtAl_DuniaCrossmodalalignment,
  year = {2025},
  booktitle = {Proceedings of the International Conference on Machine Learning},
  archiveprefix = {arXiv},
  eprint = {2502.17066},
  primaryclass = {cs.CV},
  author = {Fayad, Ibrahim and Zimmer, Max and Schwartz, Martin and Ciais, Philippe and Gieseke, Fabian and Belouze, Gabriel and Brood, Sarah and De Truchis, Aurelien and d'Aspremont, Alexandre},
  title = {DUNIA: Pixel-Sized Embeddings Via Cross-Modal Alignment for Earth Observation Applications},
  date = {2025-02-24}
}

Pauls, J., Zimmer, M., Turan, B., Saatchi, S., Ciais, P., Pokutta, S., and Gieseke, F. (2025, January 31). Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation. Proceedings of the International Conference on Machine Learning. [arXiv]

[BibTeX]

@inproceedings{2025_JanEtAl_Temporalcanopyheight,
  year = {2025},
  booktitle = {Proceedings of the International Conference on Machine Learning},
  archiveprefix = {arXiv},
  eprint = {2501.19328},
  primaryclass = {cs.LG},
  author = {Pauls, Jan and Zimmer, Max and Turan, Berkant and Saatchi, Sassan and Ciais, Philippe and Pokutta, Sebastian and Gieseke, Fabian},
  title = {Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation},
  date = {2025-01-31}
}

Lasby, M., Zimmer, M., Pokutta, S., and Schultheis, E. (2025). Compressed Sparse Tiles for Memory-efficient Unstructured and Semi-structured Sparsity. Proceedings of the ICLR Workshop on Sparsity in LLMs.

[BibTeX]

@inproceedings{2025_LasbyZimmerPokuttaSchultheis_SparseTiles,
  year = {2025},
  booktitle = {Proceedings of the ICLR Workshop on Sparsity in LLMs},
  author = {Lasby, Mike and Zimmer, Max and Pokutta, Sebastian and Schultheis, Erik},
  title = {Compressed Sparse Tiles for Memory-efficient Unstructured and Semi-structured Sparsity}
}

Mundinger, K., Zimmer, M., Kiem, A., Spiegel, C., and Pokutta, S. (2025, January 30). Neural Discovery in Mathematics: Do Machines Dream of Colored Planes? Proceedings of the International Conference on Machine Learning. [arXiv]

[BibTeX]

@inproceedings{2025_MundingerZimmerKiemSpiegelPokutta_NeuralDiscovery,
  year = {2025},
  booktitle = {Proceedings of the International Conference on Machine Learning},
  archiveprefix = {arXiv},
  eprint = {2501.18527},
  primaryclass = {cs.LG},
  author = {Mundinger, Konrad and Zimmer, Max and Kiem, Aldo and Spiegel, Christoph and Pokutta, Sebastian},
  title = {Neural Discovery in Mathematics: Do Machines Dream of Colored Planes?},
  date = {2025-01-30}
}

Pelleriti, N., Zimmer, M., Wirth, E., and Pokutta, S. (2025, February 20). Approximating Latent Manifolds in Neural Networks Via Vanishing Ideals. Proceedings of the International Conference on Machine Learning. [arXiv]

[BibTeX]

@inproceedings{2025_PelleritiZimmerWirthPokutta_Latentmanifolds,
  year = {2025},
  booktitle = {Proceedings of the International Conference on Machine Learning},
  archiveprefix = {arXiv},
  eprint = {2502.15051},
  primaryclass = {cs.LG},
  author = {Pelleriti, Nico and Zimmer, Max and Wirth, Elias and Pokutta, Sebastian},
  title = {Approximating Latent Manifolds in Neural Networks Via Vanishing Ideals},
  date = {2025-02-20}
}

Turan, B., Asadulla, S., Steinmann, D., Stammer, W., and Pokutta, S. (2025, July 10). Neural Concept Verifier: Scaling Prover-Verifier Games Via Concept Encodings. Proceedings of the ICML Workshop on Actionable Interpretability. [arXiv]

[BibTeX]

@inproceedings{2025_TuranEtAl_Neuralconceptverifier_2507-07532,
  year = {2025},
  booktitle = {Proceedings of the ICML Workshop on Actionable Interpretability},
  archiveprefix = {arXiv},
  eprint = {2507.07532},
  primaryclass = {cs.LG},
  author = {Turan, Berkant and Asadulla, Suhrab and Steinmann, David and Stammer, Wolfgang and Pokutta, Sebastian},
  title = {Neural Concept Verifier: Scaling Prover-Verifier Games Via Concept Encodings},
  date = {2025-07-10}
}

Wäldchen, S., Sharma, K., Turan, B., Zimmer, M., and Pokutta, S. (2024). Interpretability Guarantees with Merlin-Arthur Classifiers. Proceedings of the International Conference on Artificial Intelligence and Statistics. [arXiv]

[BibTeX]

@inproceedings{2022_WaeldchenEtAl_Interpretabilityguarantees,
  year = {2024},
  booktitle = {Proceedings of the International Conference on Artificial Intelligence and Statistics},
  archiveprefix = {arXiv},
  eprint = {2206.00759},
  primaryclass = {cs.LG},
  author = {Wäldchen, Stephan and Sharma, Kartikey and Turan, Berkant and Zimmer, Max and Pokutta, Sebastian},
  title = {Interpretability Guarantees with Merlin-Arthur Classifiers}
}

Kera, H., Ishihara, Y., Kambe, Y., Vaccon, T., and Yokoyama, K. (2024). Learning to Compute Gröbner Bases. Proceedings of the Conference on Neural Information Processing Systems. [arXiv]

[BibTeX]

@inproceedings{2023_KeraEtAl_Grbnerlearning,
  year = {2024},
  booktitle = {Proceedings of the Conference on Neural Information Processing Systems},
  archiveprefix = {arXiv},
  eprint = {2311.12904},
  primaryclass = {math.AC},
  author = {Kera, Hiroshi and Ishihara, Yuki and Kambe, Yuta and Vaccon, Tristan and Yokoyama, Kazuhiro},
  title = {Learning to Compute Gr\"obner Bases}
}

Zimmer, M., Spiegel, C., and Pokutta, S. (2024). Sparse Model Soups: A Recipe for Improved Pruning Via Model Averaging. Proceedings of the International Conference on Learning Representations. [URL] [arXiv]

[BibTeX]

@inproceedings{2023_ZimmerSpiegelPokutta_Sparsemodelsoups,
  year = {2024},
  booktitle = {Proceedings of the International Conference on Learning Representations},
  url = {https://iclr.cc/virtual/2024/poster/17433},
  archiveprefix = {arXiv},
  eprint = {2306.16788},
  primaryclass = {cs.LG},
  author = {Zimmer, Max and Spiegel, Christoph and Pokutta, Sebastian},
  title = {Sparse Model Soups: A Recipe for Improved Pruning Via Model Averaging}
}

Kumano, S., Kera, H., and Yamasaki, T. (2024, October 31). Wide Two-Layer Networks Can Learn From Adversarial Perturbations. Proceedings of the Conference on Neural Information Processing Systems. [URL] [arXiv]

[BibTeX]

@inproceedings{2024_KuramoKeraKamasaki_AdversarialPerturbation,
  year = {2024},
  booktitle = {Proceedings of the Conference on Neural Information Processing Systems},
  url = {https://proceedings.neurips.cc/paper_files/paper/2024/hash/6e0274fdb05f21654421db43cd8de042-Abstract-Conference.html},
  archiveprefix = {arXiv},
  eprint = {2410.23677},
  primaryclass = {cs.LG},
  author = {Kumano, Soichiro and Kera, Hiroshi and Yamasaki, Toshihiko},
  title = {Wide Two-Layer Networks Can Learn From Adversarial Perturbations},
  date = {2024-10-31}
}

Mundinger, K., Pokutta, S., Spiegel, C., and Zimmer, M. (2024, April 8). Extending the Continuum of Six-Colorings. Proceedings of the Discrete Mathematics Days. [URL] [arXiv]

[BibTeX]

@inproceedings{2024_MundingerPokuttaSpiegelZimmer_SixcoloringsExpansion:1,
  year = {2024},
  booktitle = {Proceedings of the Discrete Mathematics Days},
  url = {https://dmd2024.web.uah.es/files/abstracts/paper_27.pdf},
  archiveprefix = {arXiv},
  eprint = {2404.05509},
  primaryclass = {math.CO},
  author = {Mundinger, Konrad and Pokutta, Sebastian and Spiegel, Christoph and Zimmer, Max},
  title = {Extending the Continuum of Six-Colorings},
  date = {2024-04-08}
}

Pauls, J., Zimmer, M., Kelly, U. M., Schwartz, M., Saatchi, S., Ciais, P., Pokutta, S., Brandt, M., and Gieseke, F. (2024, June 3). Estimating Canopy Height at Scale. Proceedings of the International Conference on Machine Learning. [arXiv] [code]

[BibTeX]

@inproceedings{2024_PaulsEtAl_Canopyheightestimation,
  year = {2024},
  booktitle = {Proceedings of the International Conference on Machine Learning},
  archiveprefix = {arXiv},
  eprint = {2406.01076},
  primaryclass = {cs.CV},
  author = {Pauls, Jan and Zimmer, Max and Kelly, Una M and Schwartz, Martin and Saatchi, Sassan and Ciais, Philippe and Pokutta, Sebastian and Brandt, Martin and Gieseke, Fabian},
  title = {Estimating Canopy Height at Scale},
  code = {https://github.com/AI4Forest/Global-Canopy-Height-Map},
  date = {2024-06-03}
}

Zimmer, M., Spiegel, C., and Pokutta, S. (2023). How I Learned to Stop Worrying and Love Retraining. Proceedings of the International Conference on Learning Representations. [URL] [arXiv] [code]

[BibTeX]

@inproceedings{2021_ZimmerSpiegelPokutta_Retrainingpruning,
  year = {2023},
  booktitle = {Proceedings of the International Conference on Learning Representations},
  url = {https://iclr.cc/virtual/2023/poster/10914},
  archiveprefix = {arXiv},
  eprint = {2111.00843},
  primaryclass = {cs.LG},
  author = {Zimmer, Max and Spiegel, Christoph and Pokutta, Sebastian},
  title = {How I Learned to Stop Worrying and Love Retraining},
  code = {https://github.com/ZIB-IOL/BIMP}
}

Chmiela, A., Gleixner, A., Lichocki, P., and Pokutta, S. (2023). Online Learning for Scheduling MIP Heuristics. Proceedings of the International Conference on the Integration of Constraint Programming, Artificial Intelligence, and Operations Research, 114–123. DOI: 10.1007/978-3-031-33271-5_8 [arXiv]

[BibTeX]

@inproceedings{2023_ChmielaGleixnerLichockiPokutta_Onlinelearningscheduling,
  year = {2023},
  date = {2023-05-23},
  booktitle = {Proceedings of the International Conference on the Integration of Constraint Programming, Artificial Intelligence, and Operations Research},
  pages = {114-123},
  doi = {10.1007/978-3-031-33271-5_8},
  archiveprefix = {arXiv},
  eprint = {2304.03755},
  primaryclass = {math.OC},
  author = {Chmiela, Antonia and Gleixner, Ambros and Lichocki, Pawel and Pokutta, Sebastian},
  title = {Online Learning for Scheduling MIP Heuristics}
}

Mexi, G., Besançon, M., Bolusani, S., Chmiela, A., Hoen, A., and Gleixner, A. (2023, July 7). Scylla: a Matrix-free Fix-propagate-and-project Heuristic for Mixed-integer Optimization. Proceedings of the Conference of the Society for Operations Research in Germany. [arXiv]

[BibTeX]

@inproceedings{2023_MexiEtAl_Scyllaheuristic,
  year = {2023},
  booktitle = {Proceedings of the Conference of the Society for Operations Research in Germany},
  archiveprefix = {arXiv},
  eprint = {2307.03466},
  primaryclass = {math.OC},
  author = {Mexi, Gioni and Besançon, Mathieu and Bolusani, Suresh and Chmiela, Antonia and Hoen, Alexander and Gleixner, Ambros},
  title = {Scylla: a Matrix-free Fix-propagate-and-project Heuristic for Mixed-integer Optimization},
  date = {2023-07-07}
}

Thuerck, D., Sofranac, B., Pfetsch, M., and Pokutta, S. (2023, May 20). Learning Cuts Via Enumeration Oracles. Proceedings of the Conference on Neural Information Processing Systems. [arXiv]

[BibTeX]

@inproceedings{2023_ThuerckSofranacPfetschPokutta_Learningcutsenumeration,
  year = {2023},
  booktitle = {Proceedings of the Conference on Neural Information Processing Systems},
  archiveprefix = {arXiv},
  eprint = {2305.12197},
  primaryclass = {math.OC},
  author = {Thuerck, Daniel and Sofranac, Boro and Pfetsch, Marc and Pokutta, Sebastian},
  title = {Learning Cuts Via Enumeration Oracles},
  date = {2023-05-20}
}

Macdonald, J., Besançon, M., and Pokutta, S. (2022). Interpretable Neural Networks with Frank-Wolfe: Sparse Relevance Maps and Relevance Orderings. Proceedings of the International Conference on Machine Learning. [arXiv] [poster] [video]

[BibTeX]

@inproceedings{2021_MacdonaldBesanconPokutta_Interpretableneuralnetworks,
  year = {2022},
  booktitle = {Proceedings of the International Conference on Machine Learning},
  archiveprefix = {arXiv},
  eprint = {2110.08105},
  primaryclass = {cs.LG},
  author = {Macdonald, Jan and Besançon, Mathieu and Pokutta, Sebastian},
  title = {Interpretable Neural Networks with Frank-Wolfe: Sparse Relevance Maps and Relevance Orderings},
  poster = {https://pokutta.com/slides/20220712_icml_poster_interpretable_rde.pdf},
  video = {https://slideslive.com/38983588}
}

Tsuji, K., Tanaka, K., and Pokutta, S. (2022). Pairwise Conditional Gradients without Swap Steps and Sparser Kernel Herding. Proceedings of the International Conference on Machine Learning. [arXiv] [summary] [slides] [code] [video]

[BibTeX]

@inproceedings{2021_TsujiTanakaPokutta_Pairwiseconditionalgradients,
  year = {2022},
  booktitle = {Proceedings of the International Conference on Machine Learning},
  archiveprefix = {arXiv},
  eprint = {2110.12650},
  primaryclass = {math.OC},
  author = {Tsuji, Kazuma and Tanaka, Ken'ichiro and Pokutta, Sebastian},
  title = {Pairwise Conditional Gradients without Swap Steps and Sparser Kernel Herding},
  code = {https://github.com/ZIB-IOL/FrankWolfe.jl},
  slides = {https://pokutta.com/slides/20220624_ICML2022_BPCG.pdf},
  summary = {https://pokutta.com/blog/research/2022/05/21/bpcg-abstract.html},
  video = {https://slideslive.com/38983561}
}

Wäldchen, S., Huber, F., and Pokutta, S. (2022, February 23). Training Characteristic Functions with Reinforcement Learning: XAI-methods Play Connect Four. Proceedings of the International Conference on Machine Learning. [arXiv] [poster] [video]

[BibTeX]

@inproceedings{2022_WaeldchenHuberPokutta_CharacteristicfunctionsReinforcementlearning,
  year = {2022},
  booktitle = {Proceedings of the International Conference on Machine Learning},
  archiveprefix = {arXiv},
  eprint = {2202.11797},
  primaryclass = {cs.LG},
  author = {Wäldchen, Stephan and Huber, Felix and Pokutta, Sebastian},
  title = {Training Characteristic Functions with Reinforcement Learning: XAI-methods Play Connect Four},
  poster = {https://pokutta.com/slides/20220712_icml_poster_conn4.pdf},
  video = {https://slideslive.com/38983111},
  date = {2022-02-23}
}

Chmiela, A., Khalil, E. B., Gleixner, A., Lodi, A., and Pokutta, S. (2021). Learning to Schedule Heuristics in Branch-and-bound. Proceedings of the Conference on Neural Information Processing Systems, 34, 24235–24246. [URL] [arXiv] [poster]

[BibTeX]

@inproceedings{2021_ChmielaEtAl_Heuristicscheduling,
  year = {2021},
  booktitle = {Proceedings of the Conference on Neural Information Processing Systems},
  month = mar,
  volume = {34},
  pages = {24235–24246},
  url = {https://proceedings.neurips.cc/paper_files/paper/2021/file/cb7c403aa312160380010ee3dd4bfc53-Paper.pdf},
  archiveprefix = {arXiv},
  eprint = {2103.10294},
  primaryclass = {cs.LG},
  author = {Chmiela, Antonia and Khalil, Elias B. and Gleixner, Ambros and Lodi, Andrea and Pokutta, Sebastian},
  title = {Learning to Schedule Heuristics in Branch-and-bound},
  poster = {https://pokutta.com/slides/20211120_poster_NeurIPS21_learningheuristics.pdf},
  date = {2021-03-18}
}

Ziemke, T., Sering, L., Vargas Koch, L., Zimmer, M., Nagel, K., and Skutella, M. (2020). Flows Over Time As Continuous Limits of Packet-based Network Simulations. Proceedings of the EURO Working Group on Transportation Meeting.

[BibTeX]

@inproceedings{2021_ZiemkeEtAl_Flowscontinuouslimits:1,
  year = {2020},
  booktitle = {Proceedings of the EURO Working Group on Transportation Meeting},
  author = {Ziemke, Theresa and Sering, Leon and Vargas Koch, Laura and Zimmer, Max and Nagel, Kai and Skutella, Martin},
  title = {Flows Over Time As Continuous Limits of Packet-based Network Simulations}
}

Arumugam, K., Kadampot, I., Tahmasbi, M., Shah, S., Bloch, M., and Pokutta, S. (2017). Modulation Recognition Using Side Information and Hybrid Learning. Proceedings of the IEEE DySPAN.

[BibTeX]

@inproceedings{2017_ArumugamKadampotTahmasbiShahBlochPokutta_ModulationRecognition,
  year = {2017},
  booktitle = {Proceedings of the IEEE DySPAN},
  author = {Arumugam, K. and Kadampot, I. and Tahmasbi, M. and Shah, S. and Bloch, M. and Pokutta, Sebastian},
  title = {Modulation Recognition Using Side Information and Hybrid Learning}
}

Roy, A., Xu, H., and Pokutta, S. (2017, June 15). Reinforcement Learning Under Model Mismatch. Proceedings of the Conference on Neural Information Processing Systems. [arXiv]

[BibTeX]

@inproceedings{2017_RoyXuPokutta_ReinforcementLearning,
  year = {2017},
  booktitle = {Proceedings of the Conference on Neural Information Processing Systems},
  archiveprefix = {arXiv},
  eprint = {1706.04711},
  primaryclass = {cs.LG},
  author = {Roy, Aurko and Xu, Huan and Pokutta, Sebastian},
  title = {Reinforcement Learning Under Model Mismatch},
  date = {2017-06-15}
}

Full articles

Carderera, A., Pokutta, S., Schütte, C., and Weiser, M. (2025). An Efficient First-order Conditional Gradient Algorithm in Data-driven Sparse Identification of Nonlinear Dynamics to Solve Sparse Recovery Problems Under Noise. Journal of Computational and Applied Mathematics. DOI: 10.1016/j.cam.2025.116675 [arXiv]

[BibTeX]

@article{2021_CardereraPokuttaSchutteWeiser_CINDy,
  year = {2025},
  journal = {Journal of Computational and Applied Mathematics},
  date = {2025-04-01},
  month = apr,
  doi = {10.1016/j.cam.2025.116675},
  archiveprefix = {arXiv},
  eprint = {2101.02630},
  primaryclass = {math.DS},
  author = {Carderera, Alejandro and Pokutta, Sebastian and Schütte, Christof and Weiser, Martin},
  title = {An Efficient First-order Conditional Gradient Algorithm in Data-driven Sparse Identification of Nonlinear Dynamics to Solve Sparse Recovery Problems Under Noise}
}

Zimmer, M., Spiegel, C., and Pokutta, S. (2025). Compression-aware Training of Neural Networks Using Frank-Wolfe. Mathematical Optimization for Machine Learning, 137–168. DOI: 10.1515/9783111376776-010 [arXiv]

[BibTeX]

@article{2022_ZimmerSpiegelPokutta_Compressionawaretraining,
  year = {2025},
  journal = {Mathematical Optimization for Machine Learning},
  date = {2025-04-22},
  month = apr,
  pages = {137-168},
  doi = {10.1515/9783111376776-010},
  archiveprefix = {arXiv},
  eprint = {2205.11921},
  primaryclass = {cs.LG},
  author = {Zimmer, Max and Spiegel, Christoph and Pokutta, Sebastian},
  title = {Compression-aware Training of Neural Networks Using Frank-Wolfe}
}

Mundinger, K., Pokutta, S., Spiegel, C., and Zimmer, M. (2024). Extending the Continuum of Six-Colorings. Geombinatorics Quarterly, XXXIV. [URL] [arXiv]

[BibTeX]

@article{2024_MundingerPokuttaSpiegelZimmer_SixcoloringsExpansion,
  year = {2024},
  journal = {Geombinatorics Quarterly},
  date = {2024-07-01},
  volume = {XXXIV},
  url = {https://geombina.uccs.edu/past-issues/volume-xxxiv},
  archiveprefix = {arXiv},
  eprint = {2404.05509},
  primaryclass = {math.CO},
  author = {Mundinger, Konrad and Pokutta, Sebastian and Spiegel, Christoph and Zimmer, Max},
  title = {Extending the Continuum of Six-Colorings}
}

Vu‐Han, T. L., Sunkara, V., Bermudez‐Schettino, R., Schwechten, J., Runge, R., Perka, C., Winkler, T., Pokutta, S., Weiß, C., and Pumberger, M. (2024). Feature Engineering for the Prediction of Scoliosis in 5q‐Spinal Muscular Atrophy. Journal of Cachexia, Sarcopenia and Muscle. DOI: 10.1002/jcsm.13599 [URL]

[BibTeX]

@article{2024_Pokutta_PredictionScoliosis,
  year = {2024},
  journal = {Journal of Cachexia, Sarcopenia and Muscle},
  date = {2024-12-05},
  doi = {10.1002/jcsm.13599},
  url = {https://onlinelibrary.wiley.com/doi/pdf/10.1002/jcsm.13599},
  author = {Vu‐Han, Tu‐Lan and Sunkara, Vikram and Bermudez‐Schettino, Rodrigo and Schwechten, Jakob and Runge, Robin and Perka, Carsten and Winkler, Tobias and Pokutta, Sebastian and Weiß, Claudia and Pumberger, Matthias},
  title = {Feature Engineering for the Prediction of Scoliosis in 5q‐Spinal Muscular Atrophy}
}

Kevin-Martin, A., Bärmann, A., Braun, K., Liers, F., Pokutta, S., Schneider, O., Sharma, K., and Tschuppik, S. (2023). Data-driven Distributionally Robust Optimization Over Time. INFORMS Journal on Optimization, 5(4), 376–394. DOI: 10.1287/ijoo.2023.0091 [URL] [arXiv]

[BibTeX]

@article{2023_KevinEtAl_Datadrivendistributionallyrobust,
  year = {2023},
  journal = {INFORMS Journal on Optimization},
  date = {2023-10-01},
  month = oct,
  volume = {5},
  number = {4},
  pages = {376-394},
  doi = {10.1287/ijoo.2023.0091},
  url = {https://pubsonline.informs.org/doi/pdf/10.1287/ijoo.2023.0091},
  archiveprefix = {arXiv},
  eprint = {2304.05377},
  primaryclass = {math.OC},
  author = {Kevin-Martin, Aigner and Bärmann, Andreas and Braun, Kristin and Liers, Frauke and Pokutta, Sebastian and Schneider, Oskar and Sharma, Kartikey and Tschuppik, Sebastian},
  title = {Data-driven Distributionally Robust Optimization Over Time}
}

Bienstock, D., Muñoz, G., and Pokutta, S. (2023). Principled Deep Neural Network Training Through Linear Programming. Discrete Optimization, 49. DOI: 10.1016/j.disopt.2023.100795 [arXiv] [summary]

[BibTeX]

@article{2018_DanielGonzaloPokutta_Polyhedraltraining,
  year = {2023},
  journal = {Discrete Optimization},
  date = {2023-08-01},
  month = aug,
  volume = {49},
  doi = {10.1016/j.disopt.2023.100795},
  archiveprefix = {arXiv},
  eprint = {1810.03218},
  primaryclass = {cs.LG},
  author = {Bienstock, Daniel and Muñoz, Gonzalo and Pokutta, Sebastian},
  title = {Principled Deep Neural Network Training Through Linear Programming},
  summary = {https://www.pokutta.com/blog/research/2018/10/12/DNN-learning-lp-abstract.html}
}

Kruser, J., Sharma, K., Holl, J., and Nohadani, O. (2023). Identifying Patterns of Medical Intervention in Acute Respiratory Failure: A Retrospective Observational Study. Critical Care Explorations.

[BibTeX]

@article{2023_KruserSharmaHollNohadani_Medicalinterventionpatterns,
  year = {2023},
  journal = {Critical Care Explorations},
  date = {2023-09-30},
  author = {Kruser, Jacqueline and Sharma, Kartikey and Holl, Jane and Nohadani, Omid},
  title = {Identifying Patterns of Medical Intervention in Acute Respiratory Failure: A Retrospective Observational Study}
}

Kossen, T., Hirzel, M., Madai, V. I., Boenisch, F., Hennemuth, A., Hildebrand, K., Pokutta, S., Sharma, K., Hilbert, A., Sobesky, J., Galinovic, I., Khalil, A. A., Fiebach, J. B., and Frey, D. (2022). Towards Sharing Brain Images: Differentially Private TOF-MRA Images with Segmentation Labels Using Generative Adversarial Networks. Frontiers in Artificial Intelligence. DOI: 10.3389/frai.2022.813842

[BibTeX]

@article{2022_KossenEtAl_Brainimagesharing,
  year = {2022},
  journal = {Frontiers in Artificial Intelligence},
  date = {2022-05-02},
  doi = {10.3389/frai.2022.813842},
  author = {Kossen, Tabea and Hirzel, Manuel and Madai, Vince I. and Boenisch, Franziska and Hennemuth, Anja and Hildebrand, Kristian and Pokutta, Sebastian and Sharma, Kartikey and Hilbert, Adam and Sobesky, Jan and Galinovic, Ivana and Khalil, Ahmed A. and Fiebach, Jochen B. and Frey, Dietmar},
  title = {Towards Sharing Brain Images: Differentially Private TOF-MRA Images with Segmentation Labels Using Generative Adversarial Networks}
}

Nohadani, O., and Sharma, K. (2022). Optimization Under Connected Uncertainty. INFORMS Journal on Optimization. DOI: 10.1287/ijoo.2021.0067 [arXiv]

[BibTeX]

@article{2022_NohadaniSharma_Connecteduncertainty,
  year = {2022},
  journal = {INFORMS Journal on Optimization},
  date = {2022-01-24},
  doi = {10.1287/ijoo.2021.0067},
  archiveprefix = {arXiv},
  eprint = {2202.10602},
  primaryclass = {math.OC},
  author = {Nohadani, Omid and Sharma, Kartikey},
  title = {Optimization Under Connected Uncertainty}
}

Pokutta, S. (2021). Mathematik, Machine Learning Und Artificial Intelligence. Mitteilungen Der DMV. [URL]

[BibTeX]

@article{2021_Pokutta_MathematikMachineLearning,
  year = {2021},
  journal = {Mitteilungen der DMV},
  month = jan,
  url = {https://app.box.com/s/8u6fsucm78e1shyyzoghxar40f7xk6ni},
  author = {Pokutta, Sebastian},
  title = {Mathematik, Machine Learning Und Artificial Intelligence}
}

Ziemke, T., Sering, L., Vargas Koch, L., Zimmer, M., Nagel, K., and Skutella, M. (2021). Flows Over Time As Continuous Limits of Packet-based Network Simulations. Transportation Research Procedia, 52, 123–130. DOI: 10.1016/j.trpro.2021.01.014 [URL]

[BibTeX]

@article{2021_ZiemkeEtAl_Flowscontinuouslimits,
  year = {2021},
  journal = {Transportation Research Procedia},
  volume = {52},
  pages = {123-130},
  doi = {10.1016/j.trpro.2021.01.014},
  url = {https://sciencedirect.com/science/article/pii/S2352146521000284},
  author = {Ziemke, Theresa and Sering, Leon and Vargas Koch, Laura and Zimmer, Max and Nagel, Kai and Skutella, Martin},
  title = {Flows Over Time As Continuous Limits of Packet-based Network Simulations}
}

Klus, S., and Gelß, P. (2019). Tensor-based Algorithms for Image Classification. Algorithms, 12(11), 240. DOI: 10.3390/a12110240 [URL] [arXiv]

[BibTeX]

@article{2019_KlusGelss_Tensoralgorithmsclassification,
  year = {2019},
  journal = {Algorithms},
  date = {2019-11-09},
  volume = {12},
  number = {11},
  pages = {240},
  doi = {10.3390/a12110240},
  url = {https://mdpi.com/1999-4893/12/11/240},
  archiveprefix = {arXiv},
  eprint = {1910.02150},
  primaryclass = {cs.LG},
  author = {Klus, Stefan and Gelß, Patrick},
  title = {Tensor-based Algorithms for Image Classification}
}

🔬 Projects

Ongoing Projects

AI-Based High-Resolution Forest Monitoring

Preserving global vegetation is crucial for addressing and mitigating climate change. Accurate, up-to-date forest health data is essential. AI4Forest aims to develop advanced AI methods to monitor forests using satellite imagery, including radar and optical data. The project will create scalable techniques for detailed, high-resolution maps of the globe, e.g., to monitor canopy height, biomass, and to track forest disturbances. Please also visit the project website at ai4forest.eu.

AI4Forest

Jun 2023 to May 2027

Decomposition Methods for Mixed-integer Optimal Control

In this project, we study domain decomposition approaches for optimal control in gas transport networks. Our goal is to couple space-time-domain decomposition with machine learning and mixed-integer programming. We will develop NeTI (Network Tearing and Interconnection), a data-driven and physics-informed algorithm combining mixed-integer nonlinear programming, surrogate model learning, and graph decomposition strategies.

TRR-154 A05

Jan 2022 to Jun 2026

Expanding Merlin-Arthur Classifiers: Interpretable Neural Networks Through Interactive Proof Systems

Existing approaches for interpreting Neural Network classifiers that highlight features relevant for a decision are based solely on heuristics. We introduce a theory that allows us to bound the quality of the features without assumptions on the classifier model by relating classification to Interactive Proof Systems.

MATH+ EF1-24

Apr 2023 to Mar 2026

Completed Projects

Learning to Schedule Heuristics in IP

Heuristics play a crucial role in exact solvers for Mixed Integer Programming (MIP). However, the question of how to manage multiple MIP heuristics in a solver has not received sufficient attention. This project addresses the strategic management of primal heuristics in MIP solvers, aiming to replace static, hard-coded rules with dynamic, self-improving procedures.

HLEARN

Nov 2021 to Oct 2023

Adaptive Algorithms Through Machine Learning: Exploiting Interactions in Integer Programming

The performance of modern mixed-integer program solvers is highly dependent on a number of interdependent individual components. Using tools from machine learning, we intend to develop an integrated framework that is able to capture interactions of individual decisions made in these components with the ultimate goal to improve performance.

MATH+ EF1-9

Jan 2021 to Dec 2022

Globally Optimal Neural Network Training

Training artificial neural networks is a key optimization task in deep learning. To improve generalization, robustness, and explainability, we aim to compute globally optimal solutions. We will use integer programming methods, exploiting mixed-integer nonlinear programming and enhancing solving techniques like spatial branch-and-cut. Additionally, we'll leverage symmetry to reduce computational burden and ensure symmetry in solutions, and incorporate true sparsity using a mixed-integer nonlinear programming framework.

GONNT

Mar 2021 to Feb 2022