|
i'm an AI/ML engineer based in the US. right now i'm building production AI systems at Reallytics.ai and Verticiti, mostly getting large language models to do useful things in the real world. not demos, actual systems with real users and real traffic. before this i was at Afiniti and Cloud Kinetics for a few years. fraud detection, voice analytics, enterprise search. the kind of stuff that pages you at 3am when something breaks. honestly what keeps me going is when an agent you built solves something you never explicitly told it to do. that feeling never gets old. what i'm working on right now:
|
|
|
Agentic AI Workflows |
RAG Enterprise Search |
|
Voice AI Platform |
LLM Fine-Tuning LoRA |
|
RLHF LLM Optimization |
Sentinel Fraud Detection |
not going to pretend i use everything equally. here's what i actually reach for:
the full picture (click to expand)
| daily drivers | Python, PyTorch, FastAPI, Docker, Git, VS Code |
| LLM and GenAI | LangChain, LlamaIndex, HuggingFace Transformers, vLLM, PEFT/LoRA/QLoRA |
| data and vector | FAISS, ChromaDB, Pinecone, PostgreSQL, MongoDB, Redis, Kafka, Elasticsearch |
| cloud and MLOps | AWS (SageMaker, Bedrock, Lambda, ECS), GCP Vertex AI, Azure OpenAI |
| ML frameworks | TensorFlow, scikit-learn, XGBoost, LightGBM, ONNX |
| infrastructure | Kubernetes, Terraform, GitHub Actions, MLflow, Weights & Biases |
i write about what i'm building and learning. nothing polished, more like notes to my future self that happen to be public.
Efficient Fine Tuning Of Foundation Models In Prod
|
Generative Agents For Real Time Decision Making In
|
|
Reinforcement Learning From Human Feedback
|
💬 Commented on [Bug]: Vertex Gemini web search streaming crashes on 3/3.1 F in BerriAI/litellm (2026-05-15)
💬 Commented on HTMLSemanticPreservingSplitter processes malformed and unsaf in langchain-ai/langchain (2026-05-15)
💬 Commented on Knowledge metadata supported in storage but not configurable in crewAIInc/crewAI (2026-05-15)
💬 Commented on Difficulty of Running mmdetection2.x (for bevformer) on Blac in open-mmlab/mmdetection (2026-05-15)
💬 Commented on PythonInterpreter: paths containing commas are silently misp in stanfordnlp/dspy (2026-05-15)
💬 Commented on Add design docs directory to Milvus in milvus-io/milvus (2026-05-15)
💬 Commented on Properly return usage information for BedrockClientV2 for th in cohere-ai/cohere-python (2026-05-15)
⭐ Starred YuyangSunshine/Awesome-Continual-learning-of-Vision-Language-Models (2026-05-15)
stuff i've been digging into recently. mostly papers, blog posts, and rabbit holes that kept me up too late.
🔬 Efficient Fine-Tuning of Foundation Models in Production Settings
🔬 Production-Grade Retrieval-Augmented Generation (RAG) at Scale
🔬 Multi-Modality in Production: Combining Vision, Text, and Speech Models
🔬 Fine-Tuning LLMs with LoRA (Low-Rank Adaptation) for Domain-Specific Applications
🔬 Generative Agents for Real-Time Decision-Making in Production Systems
🔬 Reinforcement Learning from Human Feedback
📌 Data Drift Detector using KS-Test — Production Pattern (Python) (2026-05-15)
📌 RAG Relevance Scorer using Cross-Encoder — Production Pattern (Python) (2026-05-13)
📌 Batch Inference Pipeline with Progress Tracking — Production Pattern (Python) (2026-05-13)
🤖 Profile auto-updated on 2026-05-15 19:47 UTC


