Impact of automatic segmentation on the quality, productivity and self-reported post-editing effort of intralingual subtitles
Title of edited book
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)
Year of publication
This paper describes the evaluation methodology followed to measure the impact of using a machine learning algorithm to automatically segment intralingual subtitles. The segmentation quality, productivity and self-reported post-editing effort achieved with such approach are shown to improve those obtained by the technique based in counting characters, mainly employed for automatic subtitle segmentation currently. The corpus used to train and test the proposed automated segmentation method is also described and shared with the community, in order to foster further research in this area.