Hsoda030engsub Convert021021 Min Upd -

Both outputs retain the original line order and timestamps.

import pysubs2
subs = pysubs2.load("hsoda030engsub.ass", encoding="utf‑8")

| Subset | Description | Files | |--------|-------------|-------| | Core | hsoda030engsub and 99 similar SSA files (educational series) | 100 | | Extended | Random selection of 1 100 public domain SSA subtitles (various languages) | 1 100 | | Total | — | 1 200 | hsoda030engsub convert021021 min upd

| Limitation | Impact | Mitigation | |------------|--------|------------| | Karaoke tags (k) not representable in VTT/SRT | Minor visual loss for music videos | Store karaoke timing as separate VTT cues with class="karaoke" and provide custom CSS. | | Complex ASS vector drawing (p) | Not supported in plain‑text formats | Preserve as a comment for downstream processing (e.g., burn‑in). | | Non‑UTF‑8 source files | Decoding errors | Detect encoding automatically via chardet and re‑encode. | Both outputs retain the original line order and timestamps