Web Automation Data Extraction (Betting Platform)
Overview
Python scripts that automate data extraction from a client betting site using its underlying APIs. The scripts accept betting codes as input and return structured data, three previously-undocumented APIs were extracted from the site, and several existing code bugs were fixed. Outcome: Gave the client a fully working extraction toolchain with reliable structured output and minimal supervision required.
Architecture & Pipeline
flowchart LR
n0["Betting Code InputUser-supplied codes"]
n1["Custom Python ClientRequests · Selenium"]
n2["Site APIs3 endpoints integrated"]
n3["Parse + ValidateBug fixes layered in"]
n4["Structured OutputReliable data"]
n0 --> n1
n1 --> n2
n2 --> n3
n3 --> n4
classDef step0 fill:#f1f5f9,stroke:#64748b,color:#1e293b,stroke-width:2px,rx:10,ry:10;
classDef step1 fill:#ecfeff,stroke:#06b6d4,color:#1e293b,stroke-width:2px,rx:10,ry:10;
classDef step2 fill:#f0fdfa,stroke:#0d9488,color:#1e293b,stroke-width:2px,rx:10,ry:10;
classDef step3 fill:#ecfdf5,stroke:#10b981,color:#1e293b,stroke-width:2px,rx:10,ry:10;
classDef step4 fill:#fffbeb,stroke:#f59e0b,color:#1e293b,stroke-width:2px,rx:10,ry:10;
class n0 step0;
class n1 step1;
class n2 step2;
class n3 step3;
class n4 step4;
End-to-end flow derived from this project's scope and tech stack. Tap View Fullscreen for a larger view, or scroll horizontally on small screens.
Key Features
- Automated retrieval via client-provided APIs
- Discovery and integration of three internal APIs
- Bug fixes and reliability improvements to existing code
- Local and server-deployable execution
- Tech Stack:** Python, Requests, BeautifulSoup, Selenium, Pandas