amathguywhocodes/day-45-100-movies-to-watch

Analysis updated 2026-06-24

★ 0PythonAudience · developerComplexity · 1/5Setup · easy

Mindmap

mindmap
  root((Day-45-100-movies-to-watch))
    Inputs
      Archived Empire URL
      HTML page
    Outputs
      movies.txt
      Ordered titles list
    Use Cases
      Practice web scraping
      Build a movie watchlist
      Day-by-day learning
    Tech Stack
      Python
      BeautifulSoup
      Requests

mindmap root((Day-45-100-movies-to-watch)) Inputs Archived Empire URL HTML page Outputs movies.txt Ordered titles list Use Cases Practice web scraping Build a movie watchlist Day-by-day learning Tech Stack Python BeautifulSoup Requests

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Practice BeautifulSoup by extracting an ordered list of movie titles from an HTML page.

USE CASE 2

Generate a personal movies.txt watchlist of the top 100 movies of all time.

USE CASE 3

Use an Internet Archive snapshot as a stable scraping target for a reproducible exercise.

USE CASE 4

Drop in as a day-45 milestone in a 100 day Python coding course.

What is it built with?

PythonBeautifulSoupRequests

How does it compare?

	amathguywhocodes/day-45-100-movies-to-watch	0xhassaan/nn-from-scratch	a-little-hoof/dsr
Stars	0	0	0
Language	Python	Python	Python
Setup difficulty	easy	moderate	hard
Complexity	1/5	4/5	5/5
Audience	developer	developer	researcher

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · easy Time to first run · 30min

No code is provided, the reader installs BeautifulSoup and requests and writes the scraper from the README brief.

License is not stated in the available content.

In plain English

This repository is a small Python exercise that asks the reader to scrape the top 100 movies of all time from a webpage and save the result to a plain text file. The output file is called movies.txt and lists the titles in ascending order, starting from one. The README gives a short example of what the first few lines should look like, with titles such as The Godfather, The Empire Strikes Back, The Dark Knight, and The Shawshank Redemption. The stated purpose of the project is to practice using BeautifulSoup, a Python library that reads the HTML of a webpage and lets you pull pieces of data out of it. The README points to Empire's best movies list as the source, but also mentions that similar curated lists from Timeout or Stacker would work for the same exercise. There is no further code in the README, only the brief on what the script should do. The README includes one important note about the source link. Because live websites change layout often, the project recommends pointing the scraper at a snapshot stored on the Internet Archive. A specific archived URL from May 2020 is provided so that the page structure stays the same every time the script runs. This keeps the exercise reproducible long after the original page may have been updated or moved. The project looks like a single day of a longer learning series, judging by the repository name that includes Day 45. There is no list of dependencies, no setup script, and no test suite described in the README. A reader is expected to install BeautifulSoup and a request library on their own, fetch the archived page, find the right HTML elements that hold the movie titles, and write the ordered list to disk. The README is sparse, and that matters for anyone arriving at this repo. There is no license file mentioned, no contribution guide, and no description of the final solution. The repository works best as a starting prompt for someone practicing web scraping in Python, rather than as a finished tool to install and run.

Copy-paste prompts

Prompt 1

Write a Python script that uses requests and BeautifulSoup to scrape the archived May 2020 Empire top 100 movies page and save the titles in ascending order to movies.txt.

Prompt 2

Adapt the scraper to also work on a Timeout or Stacker best-movies list with minimal selector changes.

Prompt 3

Add a small test that asserts movies.txt has 100 lines and the first line is The Godfather.

Prompt 4

Refactor the scraper into a function that takes an archived URL and an output filename.

Prompt 5

Walk me through which BeautifulSoup selectors to use to pull the titles from the archived Empire page.

Frequently asked questions

What is day-45-100-movies-to-watch?

Small Day 45 Python exercise that scrapes an archived Empire top-100 movies page with BeautifulSoup and writes the titles to movies.txt in ascending order.

What language is day-45-100-movies-to-watch written in?

Mainly Python. The stack also includes Python, BeautifulSoup, Requests.

What license does day-45-100-movies-to-watch use?

License is not stated in the available content.

How hard is day-45-100-movies-to-watch to set up?

Setup difficulty is rated easy, with roughly 30min to a first successful run.

Who is day-45-100-movies-to-watch for?

Mainly developer.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Verify against the repo before relying on details.