Abserny
Voice-activated vision assistant for the visually impaired. No screen required, ever.
Discover how it worksUnderstanding Through AI
A graduation project that gives visually impaired individuals a fully spoken, gesture-driven window into the world around them, powered by Gemini AI, working even offline.
The Problem
Over 2.2 billion people worldwide live with visual impairment. Most AI assistive tools still assume a sighted user, requiring menus, icons, and screens to set up. Even the first launch demands someone who can see.
Our Answer
Abserny is built entirely around a voice-first philosophy. Every interaction, from first launch and language selection to daily use, is driven by simple gestures and spoken feedback. No screen. No visual menus. No sighted assistance needed.
How Abserny Works
A deterministic finite state machine governs every interaction, no race conditions, no ambiguity.
Starting up…
Core Features
Detection Modes
Four modes for four daily needs. Swipe left or right in the app to cycle between them at any time.
Broad environmental awareness. Hazards and obstacles are mentioned first, followed by spatial context, up to four items per description, always with directional words like ahead, to your left, nearby.
Close-range object identification. Returns the precise name plus one functional detail, ideal for identifying items on a table, in a bag, or on a shelf.
Full text recognition via on-device OCR. Reads all visible text verbatim, top to bottom, for signs, labels, documents, packaging, and displays.
Social and navigation awareness. Reports the count of people in frame, their spatial location, and observable activity, for navigating crowds, entering rooms, or approaching conversations.
Technology Stack
Performance Results
Measured across 50 trials on a mid-range Android device (Snapdragon 665, WiFi connection).
1,430 ms
Median end-to-end latency from gesture to first spoken word, online via Gemini. Offline via ML Kit achieves 380 ms median.
87% Spatial Rate
87% of AI-generated scene descriptions include at least one spatial orientation term. 100% of descriptions with hazards mention them first.
Get Abserny on Android
Real-time, AI-powered scene description with gesture-driven control, available now for Android, iOS in progress.
Download Now