-
b9ca701d69
.wav conversion, handle audio with no detected speech
Max Bain
2023-03-31 23:02:38 +01:00
-
d0fa028045
fix tfile naming
Max Bain
2023-03-30 19:24:42 +01:00
-
ae4a9de307
add vad model external dl
Max Bain
2023-03-30 18:57:55 +01:00
-
18b63d46e2
skeleton v2
Max Bain
2023-03-30 05:31:57 +01:00
-
1e7c2c337b
Merge pull request #148 from FernanOrtega/main
m-bain
2023-03-24 07:57:43 -07:00
-
-
33dd3b9bcd
Update decoding.py
Fernando O. Gallego
2023-03-24 11:56:41 +01:00
-
-
d1b4ff8228
Merge pull request #114 from mshakirDr/patch-1
m-bain
2023-03-23 15:12:09 -07:00
-
-
d31f6e0b8a
Merge branch 'm-bain:main' into danish_alignment
Marcus Brandt
2023-03-06 10:52:47 +01:00
-
-
-
-
809700e286
remove soundfile version constraint
m-bain
2023-03-06 00:20:31 +00:00
-
cea42ca470
Fix hugging face error
Muhammad Shakir
2023-03-04 19:12:13 +01:00
-
-
c8404d9805
added a danish alignment model
Marcus Brandt
2023-03-04 13:20:40 +01:00
-
-
d1d420e70c
Merge pull request #111 from Barabazs/patch-1
m-bain
2023-03-04 11:46:57 +00:00
-
-
844eb30710
fix: force soundfile version update for mp3 support
Barabazs
2023-03-04 11:01:26 +01:00
-
-
31e6fe7e36
Merge pull request #107 from JCGoran/fix/python3.7_compatibility
m-bain
2023-03-02 15:31:36 +00:00
-
-
cfcede41f6
Added Python 3.7 compatibility
JCGoran
2023-03-02 15:09:02 +01:00
-
-
186b06e032
paper drop
m-bain
2023-03-02 12:04:16 +00:00
-
847a3cd85b
Merge pull request #96 from smly/fix-batch-processing
m-bain
2023-02-22 12:11:01 +00:00
-
-
2b1ffa12b8
Merge pull request #97 from smly/gpu-vad-filter
m-bain
2023-02-21 18:57:14 +00:00
-
-
57f5957e0e
Pass device to pyannote.audio.Inference
smly
2023-02-22 03:48:20 +09:00
-
-
27fe502344
Fix assertion error in batch processing
smly
2023-02-22 02:45:13 +09:00
-
-
f7093e60d3
Merge pull request #90 from Pikauba/translation_starting_point_improvement
m-bain
2023-02-18 21:59:57 +00:00
-
-
a1d2229416
Improvement to transcription starting point with VAD
Antoine Dufour
2023-02-18 11:12:23 -05:00
-
-
4cb167a225
Merge pull request #74 from Camb-ai/level-bug-fix
m-bain
2023-02-14 19:22:22 +00:00
-
-
2e307814dd
added if clause for checking
arnavmehta7
2023-02-10 14:48:51 +05:30
-
-
d687cf3358
Merge pull request #58 from MahmoudAshraf97/main
m-bain
2023-02-01 22:11:51 +00:00
-
-
0a3fd11562
update readme
Max Bain
2023-02-01 22:09:11 +00:00
-
29e95b746b
Merge pull request #57 from TengdaHan/main
m-bain
2023-02-01 20:37:54 +00:00
-
-
039af89a86
support batch processing
Tengda Han
2023-02-01 19:41:20 +00:00
-
-
9f26112d5c
added turkish wav2vec2 model
Mahmoud Ashraf
2023-02-01 21:38:50 +02:00
-
-
fd2a093754
Merge pull request #55 from jonatasgrosman/main
m-bain
2023-02-01 10:27:45 +00:00
-
-
31f069752f
Merge pull request #53 from MahmoudAshraf97/main
m-bain
2023-02-01 10:27:25 +00:00
-
-
4cdf7ef856
Merge pull request #48 from Barabazs/main
m-bain
2023-02-01 10:26:58 +00:00
-
-
d294e29ad9
fix: error when loading huggingface model with embedded language model
Jonatas Grosman
2023-01-31 23:24:26 -03:00
-
-
-
0eae9e1f50
added several wav2vec2 models by jonatasgrosman
Mahmoud Ashraf
2023-02-01 03:02:10 +02:00
-
1b08661e42
change arabic model to jonatasgrosman
Mahmoud Ashraf
2023-01-31 19:32:31 +02:00
-
a49799294b
add arabic wav2vec2 model form elgeish
Mahmoud Ashraf
2023-01-31 19:07:48 +02:00
-
-
-
d83c74a79f
doc: format checklist
Barabazs
2023-01-29 16:07:58 +01:00
-
-
acaefa09a1
Merge pull request #46 from Barabazs/main
m-bain
2023-01-28 19:05:36 +00:00
-
-
76f79f600a
fix short seg timestamps bug
Max Bain
2023-01-28 19:04:19 +00:00
-
33073f9bba
Create FUNDING.yml
Barabazs
2023-01-28 19:43:27 +01:00
-
-
50f3965fdb
fix tsv file ext
Max Bain
2023-01-28 17:39:07 +00:00
-
df2b1b70cb
increase vad cut default
Max Bain
2023-01-28 14:49:53 +00:00
-
c19cf407d8
handle non-alignable whole segments
Max Bain
2023-01-28 13:53:03 +00:00
-
8081ef2dcd
add custom vad binarization for vad cut
Max Bain
2023-01-28 00:22:33 +00:00
-
c6dbac76c8
cut up vad segments when too long to prevent OOM
Max Bain
2023-01-28 00:01:39 +00:00
-
69673eb39b
buy-me-a-coffee
Max Bain
2023-01-27 15:12:49 +00:00
-
5b8c8a7bd3
pandas fix
Max Bain
2023-01-27 15:05:08 +00:00
-
7f2159a953
Merge branch 'main' of https://github.com/m-bain/whisperX into main
Max Bain
2023-01-26 10:46:36 +00:00
-
-
16d24b1c96
only pad timestamps if not using VAD
Max Bain
2023-01-26 10:46:13 +00:00
-
d20a2a4ea2
typo in --diarize flag
m-bain
2023-01-26 10:28:54 +00:00
-
-
312f1cc50c
Merge pull request #40 from MahmoudAshraf97/main
m-bain
2023-01-26 00:34:03 +00:00
-
-
99b6e79fbf
Update README.md
Mahmoud Ashraf
2023-01-26 00:56:10 +02:00
-
e7773358a3
Update transcribe.py
Mahmoud Ashraf
2023-01-26 00:42:35 +02:00
-
6b2aa4ff3e
Merge pull request #1 from MahmoudAshraf97/patch-1
Mahmoud Ashraf
2023-01-26 00:37:38 +02:00
-
-
c3de5e9580
Update README.md
Mahmoud Ashraf
2023-01-26 00:36:29 +02:00
-
-
58d7191949
add diarize
Max Bain
2023-01-25 19:40:41 +00:00
-
286a2f2c14
clean up logic, use pandas where possibl
Max Bain
2023-01-25 18:42:52 +00:00
-
eec6d1f8d8
missing word timestamps
Max Bain
2023-01-24 16:37:19 +00:00
-
d1600e5b0f
Merge branch 'main' of https://github.com/m-bain/whisperX into main
Max Bain
2023-01-24 15:38:05 +00:00
-
-
d395c21b83
new logic, diarization, vad filtering
Max Bain
2023-01-24 15:02:08 +00:00
-
ba102feb7f
vad filter
Max Bain
2023-01-20 12:54:20 +00:00
-
4569cb982a
fix file_ass display bug
m-bain
2023-01-12 12:57:12 +00:00
-
ce281eb6f6
Merge pull request #28 from aosfatos/update/wav2vec2-large-xlsr-53-portuguese
m-bain
2023-01-12 09:10:02 +00:00
-
-
7adead16e0
Update pt model to wav2vec2-large-xlsr-53-portuguese
Rhenan Bartels
2023-01-11 19:50:34 -03:00
-
-
a4edb130ef
Merge pull request #27 from FelippeChemello/main
m-bain
2023-01-11 15:35:15 +00:00
-
-
7459bf8ad0
Add PT (pt-br) align support
Felippe Chemello
2023-01-11 12:11:41 -03:00
-
-
d51353a4b6
uncomment .ass
m-bain
2023-01-08 18:02:36 +00:00
-
-
78c87d3bfd
handle negative / tiny duration segments, final
Max Bain
2023-01-08 14:01:10 +00:00
-
a6eb33778b
additional waveform segment check
Max Bain
2023-01-08 12:24:35 +00:00
-
857bcca238
Merge branch 'main' of https://github.com/m-bain/whisperX into main
Max Bain
2023-01-07 15:00:22 +00:00
-
-
44b62064f6
fix starting timestamp for multiple fail-to-aligned words
Max Bain
2023-01-07 14:59:11 +00:00
-
2aa074e0e6
remove duplicate line
m-bain
2023-01-05 13:12:11 +00:00
-
-
5a668a7d80
fallback on whisper alignment failures, update readme
Max Bain
2023-01-05 11:15:19 +00:00
-
93d661f2e4
fix whisper hallucination outside of audio length
m-bain
2022-12-29 10:54:23 +00:00
-
644b04e8d1
Merge pull request #13 from egorsmkv/patch-1
m-bain
2022-12-25 20:32:56 +00:00
-
-
97526f1111
Add Ukrainian wav2vec2 model
Yehor Smoliakov
2022-12-24 15:05:13 +02:00
-
-
c912f96ed3
add examples.md
Max Bain
2022-12-23 00:50:32 +00:00
-
c6fa7df3cc
add chinese, dutch. python usage. readme update
Max Bain
2022-12-23 00:41:12 +00:00
-
e909f2f766
support huggingface + model select based on lang.
Max Bain
2022-12-20 19:54:55 +00:00
-
8b2f40d02a
Change resolution for Japanese example video
Yasutaka Odo
2022-12-21 02:44:11 +09:00
-
93e568b3bf
replace magic strings
Yasutaka Odo
2022-12-21 02:11:08 +09:00
-
d7546def91
fix errors when recognizing non-japanese audios
Yasutaka Odo
2022-12-21 01:42:45 +09:00
-
7a3575dfdf
fix formatting
Yasutaka Odo
2022-12-21 01:23:02 +09:00
-
c8000f6e3f
resolve conflicts
Yasutaka Odo
2022-12-21 01:20:35 +09:00
-
-
2ad813c57d
undo unnecessary formatting
Yasutaka Odo
2022-12-21 01:03:52 +09:00
-
080dc1b337
fix path for sample japanese audio
Yasutaka Odo
2022-12-21 00:42:39 +09:00
-
f6cd4b9bed
correct example command for japanese
Yasutaka Odo
2022-12-21 00:39:48 +09:00
-
0e61f10443
Merge branch 'main' of https://github.com/yasutak/whisperX into main
Yasutaka Odo
2022-12-21 00:31:38 +09:00
-
-
5d7c3b521c
fix error message
Yasutaka Odo
2022-12-20 22:29:18 +09:00
-
cbaeb85034
restructure readme,
Max Bain
2022-12-19 19:41:39 +00:00
-
228b857597
add back word .srt, update readme
Max Bain
2022-12-19 19:12:50 +00:00
-
398ede98d5
Add missing space
Yasutaka Odo
2022-12-20 04:02:25 +09:00
-
d1288204c8
add sample Japanese audio
Yasutaka Odo
2022-12-19 23:41:23 +09:00
-
f00e9cb149
add sample audio for japanese example
Yasutaka Odo
2022-12-19 23:40:05 +09:00
-
5437be01db
Update README.md
Yasutaka Odo
2022-12-19 23:30:01 +09:00
-
1c8ea78975
Update README.md
Yasutaka Odo
2022-12-19 23:12:04 +09:00
-
479088ac31
Update README.md
Yasutaka Odo
2022-12-19 23:09:05 +09:00
-
-
62adf12d4e
add preliminary japanese support
Yasutaka Odo
2022-12-19 22:34:59 +09:00
-
50cda426ba
add preliminary japanese support
Yasutaka Odo
2022-12-19 22:28:28 +09:00
-
-
6b64cb079a
add arch figure, citation
m-bain
2022-12-18 18:43:33 +00:00