Lotto update #1
12/27/22
I’m currently working on a project where I’m logging the results of the lottery. I have an AWS ECS task that scrapes the MI Lottery’s website to save data on the number of prizes remaining for each scratch off game.
This started off as a fun weekend web scraping project but once I finished it I guess it’s time to analyze the pile of data I have…
The data looks like this:
Infrastructure wise here’s what I have so far:
- A docker image that scrapes the data and saves it to both
S3
(parquet
) and a postgres database inRDS
. - The docker image is in
ECR
which I used to create a recurring job usingECS
that goes and performs the scraping. It currently runs once a day.
Now I’m working on a dashboard in Dash
that I’m planning to deploy using some sort of AWS product, maybe EC2
or maybe one of the container services like Elastic Beanstalk
, not really sure yet.