Skip to content

dlyog/gemmacure

Repository files navigation

🧬 GemmaCure — Kaggle Gemma 4 Good Hackathon Submission

An AI-powered drug discovery educational game using fine-tuned Gemma 4


📦 Submission Structure

gemmacure/
├── 1_dataset/           ← Training data sourcing documentation
├── 2_model_finetune/    ← Fine-tuning methodology & published model
├── 3_gemmacure_api/     ← Molecule generation & scoring API
├── 4_gemmacure_3d_game/ ← Fully playable 3D game
└── README.md            ← This file

🚀 Quick Start (5 minutes)

cd gemmacure/4_gemmacure_3d_game
pip install -r backend/requirements.txt
./gemma_cure_game.sh start
open http://localhost:5018

🎮 What is GemmaCure?

An educational drug discovery game where:

  • You battle disease pathogens as bosses (SARS-CoV-2, HIV, Alzheimer's, Cancer, etc.)
  • Gemma 4 AI generates drug candidates in real-time
  • IBM MAMMAL scores binding affinity (Hit = game win, Miss = game loss)
  • You earn XP and badges for discovering effective drugs

It's real drug discovery turned into an engaging game for high school students.


🤖 The AI Model

  • Base: Google Gemma 4-E2B-it (5.2B parameters)
  • Training: 225,060 drug-target pairs (BindingDB, ChEMBL, MOSES)
  • Method: Unsloth RS-LoRA (r=64, α=128) — 2x faster, 70% memory savings
  • Performance: 94.2% SMILES validity, R²=0.72 affinity prediction
  • Published: https://huggingface.co/dlyog/gemma-cure (CC-BY 4.0)

👨‍👧 The Story

Father (AI Engineer) + Daughter (High School Student) built this together. She suggested: "Make it a game — the disease is a boss, the drug is ammunition." This became PathoHunt 3D.


📚 Documentation Structure

  • 1_dataset/README.md — How we sourced 225K training pairs
  • 2_model_finetune/README.md — Complete training guide with hyperparameters
  • 3_gemmacure_api/README.md — API endpoints & deployment
  • 4_gemmacure_3d_game/SETUP.md — Game setup & troubleshooting
  • kaggle_submission.html (parent dir) — Kaggle writeup + image prompts

🏆 Prize Strategy

  1. Unsloth Prize ($10K) — Best fine-tuned Gemma 4
  2. Health & Sciences Impact ($10K) — Educational platform, real disease targets
  3. Main Track ($50K+) — Production-quality software, not just a prototype

✅ What's Included

  • ✅ Fine-tuned model (published on Hugging Face)
  • ✅ Full API with molecule generation + scoring
  • ✅ Playable 3D game with 8 disease targets
  • ✅ One-click launcher script for students
  • ✅ Complete documentation for judges
  • ✅ Open source (CC-BY 4.0)

🔗 Quick Links


Made by Tarun & Tanisha Chawdhury | May 2026

About

AI Drug Discovery for all disease

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors