StreamInsight

Table of Contents

Scalable video analysis pipeline that processes uploaded videos through an event-driven architecture on AWS. An S3 upload triggers Lambda, which orchestrates Step Functions to coordinate Fargate containers and SageMaker Serverless inference, with results stored across two DynamoDB tables.

GitHub: github.com/RutanshS/stream-insight

Languages: Python, Bash

Infrastructure: AWS CDK, S3, Lambda, Step Functions, Batch (Fargate), SageMaker Serverless, DynamoDB

ML: OpenAI CLIP (ViT-B/32), PySceneDetect, FFmpeg

How It Works
#

S3 upload triggers Lambda → Step Functions orchestrate the full pipeline
A containerized video processing job uses FFmpeg and PySceneDetect to extract keyframes at scene boundaries, dynamically scaling extraction volume with video duration to reduce downstream inference load by ~95%
OpenAI CLIP (ViT-B/32) runs on SageMaker Serverless endpoints for zero-shot frame classification, with scale-to-zero to eliminate idle compute costs

How It Works#

Related

How It Works
#