Skip to content
View koprjaa's full-sized avatar
🤗
🤗
  • TLV s.r.o.
  • Prague
  • 23:30 (UTC +02:00)

Highlights

  • Pro

Block or report koprjaa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
koprjaa/README.md

Hi, I'm Jan Alexandr Kopřiva

Business Intelligence student at VŠE Prague, no-code / low-code developer, and data-extraction hobbyist. Most of what I build falls into three buckets: web scrapers, applied-AI services, and small desktop tooling.

Python TypeScript FastAPI Next.js pandas n8n ChromaDB

Projects worth clicking

  • czech-tabloid-scraper — 2,400 headlines from 18 Czech news feeds in 14 seconds, stdlib + requests + rich
  • chromadb-embedding-visualizer — 3D fly-through of ChromaDB vector stores. Next.js + Three.js frontend, FastAPI + UMAP + HDBSCAN backend
  • email-verifier — async SMTP/DNS verification with catch-all detection and Czech-provider reputation logic
  • vocative-generator — scripts sklonuj.cz to batch-decline Czech first names, with adaptive concurrency and resumable checkpoints
  • czech-surname-restorator — Ray-distributed diacritic restoration for legacy Czech datasets (DvorakDvořák)
  • rm1000i-tray — tiny Windows menu-bar widget that reads live PSU power draw over USB HID

Want to collaborate?

I'm open to projects in data extraction, workflow automation, and applied AI integrations (RAG, structured-output LLMs, agent-style pipelines).

Reach me at jan.alexandr.kopriva {@} gmail dot com.

LinkedIn

Pinned Loading

  1. email-verifier email-verifier Public

    Async SMTP email verification with catch-all detection, DNS caching, and a Flask bulk-upload UI.

    Python

  2. 4IZ210 4IZ210 Public

    This project uses machine learning classifiers to predict heart disease presence in patients based on health metrics, developed for the 4IZ210 course.

    Jupyter Notebook

  3. protext-scraper protext-scraper Public

    Concurrent web scraper for Protext.cz press releases. Tor-powered with automatic circuit rotation and rate-limit bypass.

    Python

  4. shopify-scraper shopify-scraper Public

    Utility to extract product feed from any Shopify store

    Python