Smarter document extraction starts here.
The PdfData wrapper over PdfDataExtractor is inspired by https://www.npmjs.com/package/pdf-parse, which is currently unmaintained. PdfDataExtractor itself is a simple ...
A Twitter/X scraper built with Playwright for browser automation and OpenAI GPT-4 for AI-powered tweet analysis. Features timeline scraping, historical search, keyword search, checkpoint/resume, proxy ...