It appears that there are two issues that affect small parts of the captured datasets:
1) Colored character names are not handled properly. I looked for <th> tags, not <th bgcolor="beige"> tags.
2) Character names that start with a lower case character are not handled. This may have to do with other episodes using lower case prefixed table headers for stage directions, I have to double check.