MatFyz CONNECTIONS 2025

Name: MatFyz CONNECTIONS 2025
Start: 2025-11-26T00:00:00+01:00
End: 2025-11-26T23:00:00+01:00
Location: No location set

November 26, 2025

Europe/Bratislava timezone

Contrasting Human and Emergent Concepts in Image Classifiers

Nov 26, 2025, 11:12 AM

Študenti informatika Poster session + káva: prezentácie študentov informatika Poster session + káva: prezentácie študentov informatika

In the age of AI becoming an everyday partner and support tool in both professional and private domains, users and stakeholders are increasingly confronted with the question of interpretability. This work contributes to the search for answers by exploring hidden meanings in the internal layers of convolutional neural networks trained on image classification tasks. Combining supervised concept-based and unsupervised learning paradigms, the goal is to discover semantically meaningful representations that can be contrasted with human-defined concepts (defined once per dataset). More specifically, we extracted layer-wise network-inherent clusters using hierarchical agglomerative clustering. To evaluate their semantic fidelity, we trained auxiliary classifiers on concepts as well as cluster memberships, and evaluated across all layers.
Our experiments reveal a higher classification accuracy for clusters extracted from each layer compared to human-defined concepts, indicating better separability and indication that clusters may capture patterns beyond human labels. Additionally, the classification accuracy increases for both clusters and concepts toward the output layer. Beyond quantitative evaluation, we provide qualitative insights using visualization techniques such as UMAP projections and Concept Localization Maps. Our findings highlight the potential of hybrid approaches for post hoc explainability and point to promising directions in uncovering emergent structures within deep neural networks.

Pracovisko fakulty (katedra)/ Department of Faculty	Katedra aplikovanej informatiky
Tlač postru/ Print poster	Budem požadovať tlač /I hereby required to print the poster in faculty

Igor Farkaš Tamara Bila (Comenius University Bratislava)

There are no materials yet.

MatFyz CONNECTIONS 2025

Contrasting Human and Emergent Concepts in Image Classifiers

Description

Authors

Presentation materials

Choose timezone

MatFyz CONNECTIONS 2025

Description

Authors

Presentation materials