Layer 1: Input

The Input Layer is the entry point for all video sources into the VisionLog system. It handles video capture from webcams, video files, and image files.

Overview

Purpose: Capture video/image input for face processing
Input Types: Webcam, Video Files, Images
Library: OpenCV

Supported Input Sources

Webcam

Direct-connected cameras for real-time processing.

Feature	Specification
Connection	USB
Resolution	1280x720 (default)
Frame Rate	30 FPS
Use Case	Real-time attendance, enrollment

Video Files

Offline processing of recorded footage (CCTV, recordings).

Format	Extension	Description
MP4	.mp4	Most common, H.264/H.265
AVI	.avi	Windows standard
MKV	.mkv	High quality container
MOV	.mov	Apple format
WMV	.wmv	Windows Media

Image Files

Single image processing for enrollment or recognition.

Format	Extensions
JPEG	.jpg, .jpeg
PNG	.png
BMP	.bmp
WebP	.webp

Input Modes

Mode	Description
Camera	Real-time recognition from webcam
Video	Process recorded video files
Image	Single image analysis
Folder	Batch process folder of images

Frame Skipping

For video processing, frames can be skipped for performance optimization.

Skip Value	Use Case
1	Process all frames (maximum accuracy)
5	Balanced (default)
10-30	Fast scan mode

Best Practices

Camera Placement

Height: Face level (1.5-1.8m)
Distance: 0.5-3 meters for clear faces
Lighting: Even, front-facing light
Angle: Direct face view preferred
Background: Avoid busy patterns

Image Quality

Factor	Recommendation
Minimum Face Size	50x50 pixels
Resolution	720p or higher
Lighting	>300 lux
Focus	Sharp, not blurry

Video Processing

Scenario	Frame Skip	Notes
Real-time	1	Process all frames
Standard	5	Balanced (default)
Fast scan	10-30	Quick overview

Configuration

Setting	Description	Default
Camera Index	Default camera device	0
Frame Width	Capture resolution width	1280
Frame Height	Capture resolution height	720
FPS	Target frame rate	30
Frame Skip	Process every Nth frame	5

Layer 1: Input

On this page