sqIRL
2025
Parameterized synthetic text generation with simplestories

Parameterized synthetic text generation with simplestories

A dataset full of simple yet diverse stories; the MNIST for language

NeurIPS'25
A taxonomy of interpretation and explanation methods for capsule network architectures

A taxonomy of interpretation and explanation methods for capsule network architectures

This paper presents a comprehensive taxonomy of interpretation and explanation methods developed for capsule network (capsnet) architectures, analyzing their mechanisms, applicability, and performance across diverse problem domains.

Neurocomputing
Finding manifolds with bilinear autoencoders

Finding manifolds with bilinear autoencoders

Decomposing activations into sparse polynomials and using their geometry

NeurIPS'25
spotlightworkshop
Explaining and interpreting hyperdimensional computing classifiers on tabular data

Explaining and interpreting hyperdimensional computing classifiers on tabular data

We make hdc classifiers for tabular data more interpretable

Neurocomputing
Smooth infomax - towards easier post-hoc interpretability

Smooth infomax - towards easier post-hoc interpretability

Sim makes post-hoc interpretability tools more effective through latent space constraints

ECML-PKDD'25
Svebi: towards the interpretation and explanation of spiking neural networks

Svebi: towards the interpretation and explanation of spiking neural networks

A posthoc explanation method for spiking neural networks

ECML-PKDD'25
Improving neural network accuracy by concurrently training with a twin network

Improving neural network accuracy by concurrently training with a twin network

We show the applicability of twin network augmentation on convolutional neural networks

ICLR'25
Label-efficient learning for radio frequency fingerprint identification

Label-efficient learning for radio frequency fingerprint identification

We introduce a label-efficient approach for radio frequency fingerprint identification, achieving competitive accuracy with up to 10x fewer labels.

IEEE WCNC'25
Towards the characterization of representations learned via capsule-based network architectures

Towards the characterization of representations learned via capsule-based network architectures

This paper provides a systematic and principled study on the interpretability of capsule network (capsnet) representations, aiming to characterize the nature and structure of the learned features across diverse architectures and datasets

Neurocomputing
2024
Compositionality unlocks deep interpretable models

Compositionality unlocks deep interpretable models

Introducing a global svd-like algorithm for multi-linear models.

AAAI'25
workshop
Bilinear MLPs enable weight-based mechanistic interpretability

Bilinear MLPs enable weight-based mechanistic interpretability

Using bilinear MLPs to reverse-engineer shallow MNIST and tiny stories models from their weights.

ICLR'25
spotlight
Twin network augmentation

Twin network augmentation

A novel training strategy for improved spiking neural networks and efficient weight quantization.

arXiv
Tokenized SAEs: disentangling SAE reconstructions

Tokenized SAEs: disentangling SAE reconstructions

We use a per-token bias in SAEs to separate token reconstructions from interesting features.

ICML'24
workshop
Weight-based decomposition: a case for bilinear MLPs

Weight-based decomposition: a case for bilinear MLPs

Introducing bilinear MLPs as a new approach to weight-based interpretability.

ICML'24
workshop
On the coherency of quantitative evaluation of visual explanations

On the coherency of quantitative evaluation of visual explanations

Measuring the correlation between different quantitative evaluation metrics for visual explanations.

CVIU vol. 241
Recognizing actions in high-resolution low-framerate videos: a feasibility study in the construction sector

Recognizing actions in high-resolution low-framerate videos: a feasibility study in the construction sector

This study investigates the applicability of established action recognition methodologies in the dynamic setting of the construction sector.

VISAPP'24
Deep learning model compression for resource efficient activity recognition on edge devices: a case study

Deep learning model compression for resource efficient activity recognition on edge devices: a case study

This paper presents an approach to adapt an existing activity recognition model for efficient deployment on edge devices.

VISAPP'24
A contrastive learning method for multi-label predictors on hyperspectral images

A contrastive learning method for multi-label predictors on hyperspectral images

Self-supervised contrastive learning for multi-label hyperspectral image classification

WHISPERS
2023