Skip to content

Downsampling#1888

Merged
TParcollet merged 31 commits into
speechbrain:developfrom
salah-zaiem:downsampling
Mar 24, 2023
Merged

Downsampling#1888
TParcollet merged 31 commits into
speechbrain:developfrom
salah-zaiem:downsampling

Conversation

@salah-zaiem

Copy link
Copy Markdown
Collaborator

Code for the best technique in the paper "Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study" : https://arxiv.org/abs/2303.06740, allowing for sequence downsampling during fine-tuning of SSL models. This leads to lower inference times with low performance drops.

@salah-zaiem salah-zaiem reopened this Mar 16, 2023

@TParcollet TParcollet left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks! See my comments.

Comment thread recipes/LibriSpeech/ASR/CTC/README.md Outdated
Comment thread recipes/LibriSpeech/ASR/CTC/README.md Outdated
Comment thread recipes/LibriSpeech/ASR/CTC/README.md Outdated
Comment thread recipes/LibriSpeech/ASR/CTC/extra_requirements.txt Outdated
Comment thread speechbrain/lobes/downsampling.py
Comment thread speechbrain/lobes/downsampling.py
Comment thread speechbrain/lobes/downsampling.py
Comment thread speechbrain/lobes/downsampling.py
Comment thread speechbrain/lobes/downsampling.py
Comment thread tests/recipes/LibriSpeech.csv Outdated
from speechbrain.utils.distributed import run_on_main
from hyperpyyaml import load_hyperpyyaml
from pathlib import Path
from pyctcdecode import build_ctcdecoder

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it should be optional nop? Now this is mandatory to pip install pyctcdecode in order to use the CTC wav2vec...

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes it should be optional, will put the import later

@TParcollet TParcollet left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@TParcollet TParcollet merged commit 2f86201 into speechbrain:develop Mar 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants