experiment2026

Monkey Gesture Detector

A real-time webcam experiment where I map face + hand gestures to live meme states

Monkey Gesture Detector

Overview

I built this in-browser gesture detector using MediaPipe Face Landmarker + Hand Landmarker to read facial expressions and hand poses frame by frame, then trigger meme states instantly. The biggest focus was making it feel smooth in real life, so I added confidence thresholds, gesture history smoothing, no-hand timing logic, and priority-based state transitions to cut false triggers and flicker.

What I Learned

  • How to run a multi-model CV pipeline in the browser (face + hand) without killing responsiveness.
  • How to turn raw landmarks into practical gesture rules for tongue-out, fist, middle finger, mouth-open, and two-hand detection.
  • How much temporal smoothing matters: gesture history + no-hand frame thresholds made the UX way more stable.
  • How to tune real-world detection with confidence gating, fallback logic, and live debug overlays.
  • How to structure experimental code so it is still clean, modular, and easy to iterate on quickly.

Stack & Architecture

Frontend

HTML5 CanvasVanilla JavaScriptMediaPipe Tasks Vision

Backend

WebRTC getUserMediaFace LandmarkerHand Landmarker

Infra

GitHub PagesGPU Delegate (Web)Client-Side CV Pipeline

Visual Overview

Interested in this project?