Skip to content
GitLab
Menu
Projects
Groups
Snippets
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
Meysam Shamsi
s4d
Commits
1fa2d4a1
Commit
1fa2d4a1
authored
Nov 27, 2020
by
Meysam Shamsi
Browse files
add config files for VAD
parent
7c40ba23
Changes
3
Hide whitespace changes
Inline
Side-by-side
dihard_diaboliic_dev.yaml
0 → 100644
View file @
1fa2d4a1
data_file_extension
:
.wav
# wav_dir: /lium/raid01_c/dihard3/wav/
wav_dir
:
/lium/raid01_b/mshamsi/dihard3/data/dev/wav_diaboliic_dev/
#mdtm_dir: /lium/raid01_b/mlebour/GEM/expes/09-20/dihard_lstm/results/mdtm/
mdtm_dir
:
/lium/raid01_b/mshamsi/dihard3/data/dev/mdtm_diaboliic_dev/
sample_rate
:
16000
output_rate
:
100
validation_ratio
:
0.1
batch_size
:
64
seed
:
1234
mode
:
vad
filter_type
:
gate
collar_duration
:
0.025
train
:
duration
:
2.
chunk_per_segment
:
5
overlap
:
0.5
transformation
:
pipeline
:
MFCC
spec_aug
:
0.5
temp_aug
:
0.5
noise_file_ratio
:
0.8
noise_snr
:
[
5.0
,
15.0
]
noise_db_csv
:
list/musan.csv
noise_root_db
:
./data/musan/
reverb_file_ratio
:
0.0,
reverb_depth
:
[
2.0
,
10.0
]
reverb_width
:
[
1.0
,
10.0
]
reverb_height
:
[
2.0
,
5.0
]
reverb_absorption
:
[
0.2
,
0.9
]
reverb_noise
:
None,
reverb_snr
:
[
5.0
,
15.0
]
eval
:
duration
:
2.
transformation
:
pipeline
:
MFCC
spec_aug
:
0.5
temp_aug
:
0.5
augmentation
:
spec_aug
:
0.0
temp_aug
:
0.0
dihard_diaboliic_train.yaml
0 → 100644
View file @
1fa2d4a1
data_file_extension
:
.wav
# wav_dir: /lium/raid01_c/dihard3/wav/
wav_dir
:
/lium/raid01_b/mshamsi/dihard3/data/dev/wav_diaboliic_train/
#mdtm_dir: /lium/raid01_b/mlebour/GEM/expes/09-20/dihard_lstm/results/mdtm/
mdtm_dir
:
/lium/raid01_b/mshamsi/dihard3/data/dev/mdtm_diaboliic_train/
sample_rate
:
16000
output_rate
:
100
validation_ratio
:
0.1
batch_size
:
64
seed
:
1234
mode
:
vad
filter_type
:
gate
collar_duration
:
0.025
train
:
duration
:
2.
chunk_per_segment
:
5
overlap
:
0.5
transformation
:
pipeline
:
MFCC
spec_aug
:
0.5
temp_aug
:
0.5
noise_file_ratio
:
0.8
noise_snr
:
[
5.0
,
15.0
]
noise_db_csv
:
list/musan.csv
noise_root_db
:
./data/musan/
reverb_file_ratio
:
0.0,
reverb_depth
:
[
2.0
,
10.0
]
reverb_width
:
[
1.0
,
10.0
]
reverb_height
:
[
2.0
,
5.0
]
reverb_absorption
:
[
0.2
,
0.9
]
reverb_noise
:
None,
reverb_snr
:
[
5.0
,
15.0
]
eval
:
duration
:
2.
transformation
:
pipeline
:
MFCC
spec_aug
:
0.5
temp_aug
:
0.5
augmentation
:
spec_aug
:
0.0
temp_aug
:
0.0
sts.yaml
0 → 100644
View file @
1fa2d4a1
feature_size
:
30
loss
:
cce
sequence_to_sequence
:
blstm_sizes
:
128
post_processing
:
lin1
:
output
:
50
activation1
:
True
lin2
:
output
:
50
activation2
:
True
lin3
:
output
:
2
# softmax: True
\ No newline at end of file
Write
Preview
Supports
Markdown
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment