System Architecture Map

Smartphone + TouchDesigner + Cloud APIs + Projector Output

INPUT

Smartphone

Camera

Rear camera feed

Camo App

video in

Microphone

Headset / built-in mic

Camo App

sound in

Cloud

Typeform

6 questions survey
Browser interface

Web

Other

MOV File

Pre-recorded video input

.mov

TRANSFORM

TouchDesigner

Video Pipeline

Camera In

Video capture

TOP

Pixelation

Effect chain

TOP

to image transform

Image In

Still image source

TOP

Image Preview

Pre-diffusion frame

TOP

prompt + image

StreamDiffusion Node

Real-time img2img generation
Prompt from transcript

TOP

generated image

Audio Pipeline

Audio Device In

Mic capture 44100Hz

CHOP

Audio File Out

Record toggle by script

CHOP

Analysis CHOP

RMS Power — voice trigger

CHOP

Audio Filter

Bandpass 4–8kHz
Clap detection

CHOP

rms value

Script Pipeline

CHOP Execute

RMS trigger
onValueChange()

DAT

CHOP Execute

Button override
onOffToOn()

DAT

call API

Python Script

urllib.request → OpenAI Whisper
multipart/form-data WAV upload

DAT

transcribed text

Text + Compositing

Transcript Output

Latest speech text
Resets each take

DAT

Text Container

Renders text overlay

TOP

Current Question

Clap-navigated
loops at 6

DAT

Image Transform

Scale / position
Color adjust

TOP

composite

Movie File Out

Snapshot on silence
take_TIMESTAMP.jpg

TOP

Cloud APIs

OpenAI Whisper

/v1/audio/transcriptions
whisper-1 model

API

DeepDream / SD

Image generation
prompt + image in

API

OUTPUT

Display

Merged Video Out

Final composite frame
Text + generated image

TOP

HDMI

Projector

Physical projection
Room-scale output

Hardware

Browser

Typeform Display

Questions shown
to participant

Web

Local Storage

take_TIMESTAMP.wav

Raw audio per take

.wav

take_TIMESTAMP.txt

Transcription text

.txt

take_TIMESTAMP.jpg

Visual snapshot

.jpg

CHOP Channel Operator

DAT Data Operator

TOP Texture Operator

COMP Component

API External Service

FILE Disk Storage

APP Application