Files
media-analyzer/README.md
2025-08-25 04:03:28 -03:00

64 lines
2.4 KiB
Markdown

# Real-Time Video AI Analysis Platform
![Control Panel Overview](def/panel_capture.png)
A production-ready video streaming platform with real-time AI logo detection, demonstrating scalable microservices architecture and modern web technologies.
## Quick Demo
```bash
docker compose up
```
**Test the system:**
1. Open http://localhost:3000 (frontend)
2. Start webcam stream or use RTMP from OBS
3. Show logos from `/logos/` folder to camera for real-time detection
4. Watch live detection results and visual overlays
## Architecture Overview
![System Architecture](def/architecture/architecture_diagram.svg)
**Key Design Patterns:**
- **Source Adapters** (`streaming/source_adapters.py`) - Abstract webcam vs RTMP input
- **Execution Strategies** (`ai_processing/execution_strategies/`) - Local vs distributed processing
- **Analysis Adapters** (`ai_processing/adapters/`) - Pluggable AI models (CLIP, GCP Vision)
- **Queue Segregation** - Separate Celery workers for different analysis types
## Code Organization
```
├── backend/
│ ├── streaming/ # Video ingestion (RTMP/Webcam)
│ ├── ai_processing/ # AI analysis pipeline
│ │ ├── adapters/ # Pluggable AI models
│ │ ├── execution_strategies/ # Local/cloud/distributed
│ │ └── tasks.py # Celery workers
│ └── effects/ # Real-time video effects (future)
├── frontend/ # Angular 17+ SPA
├── k8s/ # Kubernetes manifests
└── logos/ # Test images (Apple, Nike, etc.)
```
## Tech Stack
- **Backend**: Django + Channels, Celery, PostgreSQL, Redis
- **AI/ML**: PyTorch + CLIP, OpenCV, GCP Vision API
- **Frontend**: Angular 17, WebSockets, HLS.js
- **Infrastructure**: Docker, Kubernetes, NGINX
## Features Implemented
**Real-time logo detection** (CLIP + GCP Vision)
**Live video streaming** (webcam/RTMP → HLS)
**WebSocket overlays** (detection boxes, confidence scores)
**Kubernetes deployment** (auto-scaling, health checks)
**Modular architecture** (adapters, strategies, queues)
🔄 **In progress**: Visual properties, audio transcription, distributed processing
---
*This project demonstrates full-stack capabilities: AI/ML integration, real-time systems, cloud-native architecture, and modern web development.*