Improve model card
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,13 +1,42 @@
|
|
| 1 |
---
|
| 2 |
-
language: en
|
| 3 |
-
tags:
|
| 4 |
-
- audio-classification
|
| 5 |
-
- medical
|
| 6 |
-
license: mit
|
| 7 |
base_model:
|
| 8 |
- MIT/ast-finetuned-audioset-10-10-0.4593
|
|
|
|
|
|
|
|
|
|
| 9 |
pipeline_tag: audio-classification
|
|
|
|
|
|
|
|
|
|
| 10 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
## Citation
|
| 12 |
-
If you use this model, please cite
|
| 13 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
base_model:
|
| 3 |
- MIT/ast-finetuned-audioset-10-10-0.4593
|
| 4 |
+
language:
|
| 5 |
+
- en
|
| 6 |
+
license: mit
|
| 7 |
pipeline_tag: audio-classification
|
| 8 |
+
tags:
|
| 9 |
+
- audio-classification
|
| 10 |
+
- medical
|
| 11 |
---
|
| 12 |
+
|
| 13 |
+
# Geometry-Aware Optimization for Respiratory Sound Classification (AST + SAM)
|
| 14 |
+
|
| 15 |
+
This repository contains the model weights for the paper "[Geometry-Aware Optimization for Respiratory Sound Classification: Enhancing Sensitivity with SAM-Optimized Audio Spectrogram Transformers](https://huggingface.co/papers/2512.22564)".
|
| 16 |
+
|
| 17 |
+
## Description
|
| 18 |
+
Respiratory sound classification is often hindered by limited datasets and class imbalance. This framework enhances the **Audio Spectrogram Transformer (AST)** by using **Sharpness-Aware Minimization (SAM)**. Instead of merely minimizing training loss, this approach optimizes the geometry of the loss surface, guiding the model toward flatter minima that generalize better to unseen patients. The method specifically aims to improve sensitivity, a crucial metric for reliable clinical screening.
|
| 19 |
+
|
| 20 |
+
## Key Results (ICBHI 2017 Official Split)
|
| 21 |
+
|
| 22 |
+
| Metric | Score |
|
| 23 |
+
| :--- | :--- |
|
| 24 |
+
| **Sensitivity (Se)** | **68.31%** |
|
| 25 |
+
| **Specificity (Sp)** | **67.89%** |
|
| 26 |
+
| **ICBHI Score** | **68.10%** |
|
| 27 |
+
|
| 28 |
+
## Links
|
| 29 |
+
- **GitHub Repository**: [Atakanisik/ICBHI-AST-SAM](https://github.com/Atakanisik/ICBHI-AST-SAM)
|
| 30 |
+
- **Paper**: [arXiv:2512.22564](https://arxiv.org/abs/2512.22564)
|
| 31 |
+
|
| 32 |
## Citation
|
| 33 |
+
If you use this model or code in your research, please cite:
|
| 34 |
+
|
| 35 |
+
```bibtex
|
| 36 |
+
@article{isik2025geometry,
|
| 37 |
+
title={Geometry-Aware Optimization for Respiratory Sound Classification: Enhancing Sensitivity with SAM-Optimized Audio Spectrogram Transformers},
|
| 38 |
+
author={I\c{s}{\i}k, Atakan and Vulga I\c{s}{\i}k, Selin and I\c{s}{\i}k, Ahmet Feridun and Taylan, Mah\c{s}uk},
|
| 39 |
+
journal={arXiv preprint arXiv:2512.22564},
|
| 40 |
+
year={2025}
|
| 41 |
+
}
|
| 42 |
+
```
|