Hsoda030engsub Convert021021 Min Upd -
Both outputs retain the original line order and timestamps.
import pysubs2
subs = pysubs2.load("hsoda030engsub.ass", encoding="utf‑8")
| Subset | Description | Files | |--------|-------------|-------| | Core | hsoda030engsub and 99 similar SSA files (educational series) | 100 | | Extended | Random selection of 1 100 public domain SSA subtitles (various languages) | 1 100 | | Total | — | 1 200 | hsoda030engsub convert021021 min upd
| Limitation | Impact | Mitigation |
|------------|--------|------------|
| Karaoke tags (k) not representable in VTT/SRT | Minor visual loss for music videos | Store karaoke timing as separate VTT cues with class="karaoke" and provide custom CSS. |
| Complex ASS vector drawing (p) | Not supported in plain‑text formats | Preserve as a comment for downstream processing (e.g., burn‑in). |
| Non‑UTF‑8 source files | Decoding errors | Detect encoding automatically via chardet and re‑encode. | Both outputs retain the original line order and timestamps