Word frequencies are used in many text mining and information retrieval applications. Nowadays, and thanks to the Internet, it is easy to find lists of word frequencies for various languages.
The Gutenberg list was built from the Gutenberg project book collection. This is a collection of classic books that fell into the public domain. The list has been last updated in 2005. All books in the considered collection were published before 1923. The TV movies list was built from TV shows and movies scripts and transcripts.