Document memory access fault on long audio

This commit is contained in:
cjs
2025-12-27 04:10:59 +00:00
parent 7d70dcafb6
commit 0c6bdd9fbe

31
ROCM.md
View File

@@ -124,3 +124,34 @@ Set `HSA_OVERRIDE_GFX_VERSION` based on your GPU:
## Upstream ## Upstream
- Original: [m-bain/whisperX](https://github.com/m-bain/whisperX) - Original: [m-bain/whisperX](https://github.com/m-bain/whisperX)
## Known Issues
### Memory Access Fault on Long Audio
When transcribing longer audio files (>60s), you may encounter:
```
Memory access fault by GPU node-1 (Agent handle: 0x...) on address 0x...
Reason: Page not present or supervisor privilege.
```
**Status**: Under investigation. Short clips (~60s) work fine at ~28x realtime with small model.
**Workaround**: Process audio in chunks, or use CPU mode for long files.
**Working example** (first 60s):
```python
from faster_whisper import WhisperModel
model = WhisperModel("small", device="cuda", compute_type="float16")
segments, info = model.transcribe("audio.wav", language="en", clip_timestamps=[0, 60])
```
**Search terms for updates**:
- `"Memory access fault by GPU node" "Page not present or supervisor privilege" ROCm 7.1 PyTorch site:github.com`
- `"Memory access fault" ROCm CTranslate2 faster-whisper gfx1101`
This may be related to:
- ROCm 7.1.1 + PyTorch nightly (2.11.0+rocm7.0) incompatibility
- GPU memory fragmentation with longer sequences
- HIP/ROCm memory management issues with certain operations