Publications - sqIRL Lab

2025

Parameterized synthetic text generation with simplestories

A dataset full of simple yet diverse stories; the MNIST for language

A taxonomy of interpretation and explanation methods for capsule network architectures

This paper presents a comprehensive taxonomy of interpretation and explanation methods developed for capsule network (capsnet) architectures, analyzing their mechanisms, applicability, and performance across diverse problem domains.

Neurocomputing

Finding manifolds with bilinear autoencoders

Decomposing activations into sparse polynomials and using their geometry

NeurIPS'25

spotlightworkshop

Smooth infomax - towards easier post-hoc interpretability

Sim makes post-hoc interpretability tools more effective through latent space constraints

ECML-PKDD'25

Improving neural network accuracy by concurrently training with a twin network

We show the applicability of twin network augmentation on convolutional neural networks

ICLR'25

Label-efficient learning for radio frequency fingerprint identification

We introduce a label-efficient approach for radio frequency fingerprint identification, achieving competitive accuracy with up to 10x fewer labels.

IEEE WCNC'25

Towards the characterization of representations learned via capsule-based network architectures

This paper provides a systematic and principled study on the interpretability of capsule network (capsnet) representations, aiming to characterize the nature and structure of the learned features across diverse architectures and datasets

Neurocomputing

Analyzing the explanation and interpretation potential of matrix capsule networks

This study investigates the internal mechanisms of matrix capsule networks with the EM routing algorithm

ECML-PKDD

workshop

2024

Compositionality unlocks deep interpretable models

Introducing a global svd-like algorithm for multi-linear models.

AAAI'25

workshop

Bilinear MLPs enable weight-based mechanistic interpretability

Using bilinear MLPs to reverse-engineer shallow MNIST and tiny stories models from their weights.

ICLR'25

spotlight

Twin network augmentation

A novel training strategy for improved spiking neural networks and efficient weight quantization.

arXiv

Tokenized SAEs: disentangling SAE reconstructions

We use a per-token bias in SAEs to separate token reconstructions from interesting features.

ICML'24

workshop

Weight-based decomposition: a case for bilinear MLPs

Introducing bilinear MLPs as a new approach to weight-based interpretability.

ICML'24

workshop

A contrastive learning method for multi-label predictors on hyperspectral images

Self-supervised contrastive learning for multi-label hyperspectral image classification

WHISPERS

2023

Training methods of multi-label prediction classifiers for hyperspectral remote sensing images

A deep learning model for hyperspectral remote sensing, shifting from traditional single-label, pixel-level classification to multi-label, patch-level analysis

Remote Sensing

The trifecta: three techniques for deeper forward-forward networks

Three techniques to significantly improve the forward-forward algorithm. We achieve 84% on CIFAR-10.

TMLR

Gated recurrent unit with multilingual universal sentence encoder for arabic aspect-based sentiment analysis

This study presents a deep learning model for arabic aspect-based sentiment analysis (ABSA) using gated recurrent units (gru) combined with features from the multilingual universal sentence encoder (muse)

Knowledge Based Systems