It took me several hours of searching. I recently discovered sites where one can download the text of subtitles for thousands of movies and TV shows. At last, I’m using the text for NARCOS (netflix) to learn spanish, and importing the text into LingQ as private lessons. I had this idea back in March. I recently finished season 1, and now I’m rewatching it.
Anyway, the files are *.SRT
After hitting many dead-ends searching for “transcripts” and “captions,” the magic keywords to find NARCOS subtitle files by searching “NARCOS SRT”. So you could find your favorite series by searching " [series name] SRT "
You can open .SRT files as simple text (.txt) files. To make them more user friendly in LingQ for a private lesson, I use “find and replace” to delete the timestamp info. Takes about 10 minutes to format. A one hour episode of NARCOS seems to have about 4000 words, and at this point, about 400 new words for me.
can you explain a little more how you import the text, delete the time stamps and also how you study this material? i may try to do this if i find suitable content in german
Here is also an additional tip: Quite a few youtube videos now come with full subtitles. I use a site called keepvid.com where I can copy the youtube video link and then download the text and audio. Keep in mind that some clean up is needed with the SRT file as already mentioned above and in some cases you have to convert the audio file to MP3 (I use http://online-audio-converter.com/) before uploading it to Lingq.
If you study Spanish and like to use transcripts you might want to look up RTVE. RTVE (Spanish state TV) has transcripts for their programs. For example here is a link to a chapter of Cuentame como paso. Cuéntame cómo pasó - Temporada 8 - Capítulo 13 | Ver serie gratis
On the right side there is the full transcript. I move the curser so that it points at the transcript and press ctrl + a so that it copies the whole transcript. Then I paste it in a word document. On both ends of the pasted transcript there are images and some other text. I then get rid of the useless text and images so that I am left with the complete transcript and import it to lingq. On episode between 1.10 to 1.35 tends to mean about 7000 to 8000 words.