LogoVisionLog

Sign to Text

Real-time ASL hand gesture recognition via webcam, converts signs to text output

Sign to Text translates American Sign Language (ASL) gestures into text in real time using the device webcam. The system uses MediaPipe hand landmark detection on the frontend and a gesture classification model on the backend, capturing frames every 300 ms and committing a sign only after it has been held consistently for a configurable number of readings.

Supported Signs

Letters

The following 16 static ASL letter signs are supported:

A B C D F I K L O R S U V W X Y

Note: J and Z require motion strokes and are not supported in static-frame mode.

Word Gestures

GestureSign Description
HELLOAll fingers spread open
GOODThumbs up
BADThumbs down
I LOVE YOUThumb + index finger + pinky
STOPOpen flat palm facing forward
ROCK ONIndex finger + pinky extended
OKThumb and index finger form a circle

How It Works

  1. Camera initialises on page load — allow camera access when prompted.
  2. Click Start Recognition — the system begins capturing frames every 300 ms.
  3. Each frame is sent to the backend /sign-language/detect endpoint.
  4. A stability filter requires the same sign to appear 3 consecutive times before it is committed to the text output. This prevents accidental or partial gestures from being written.
  5. Word gestures are added with surrounding spaces; letters are appended directly.

Text Output Controls

ButtonAction
SpaceManually insert a space between words
DeleteRemove the last character
ClearClear the entire text buffer
CopyCopy the full text to clipboard

Best Practices

  • Ensure your hand is clearly visible and well-lit against a contrasting background.
  • Hold each sign steady for at least 1 second for reliable detection.
  • Position your hand roughly in the centre of the camera frame.
  • For letter-by-letter spelling, pause briefly between each letter to allow the stability counter to reset.

Limitations

  • Only one hand is tracked at a time.
  • Signs requiring motion (J, Z) are not supported.
  • Accuracy may decrease in low-light conditions or with fast hand movement.

On this page