Dr.Social PROD

Multi-tenant, talent-centric social-media orchestrator built for marketing agencies that manage many independent talents across IG · TikTok · LinkedIn (+ future). White-label persona agents act in each talent's voice, owner-approved content queues gate every post, and channel-isolated execution keeps accounts safe.

Status: V1 design phase, narrowed for fastest ship. Several capabilities are deliberately deferred to V2 to keep the first slice small — Browser channel, the iproxy fleet, fingerprints, audience identity resolution, account signup automation, email post-request path, per-talent tick cadence, Supabase Realtime, and a React frontend. See ARCHITECTURE_V2.html for the deferred set with the trigger conditions that bring each one back.

Live mockups: Dev — operator dashboard ↗ · Dev — talent portal ↗ · Prod — operator dashboard ↗ · Prod — talent portal ↗

Contents — V1

Mission
Multi-Tenancy & Talent Model
System Overview
The 6 Layers
Orchestration — 4 LLM Agents + Services
LLM Provider (abfs.tech router)
Platform Adapter Layer
Execution Backends (Device + MCP)
⛔ Channel Isolation Rule
Identity (V1 minimal set)
Content Pipeline
Media Generation Stack (Replicate + ElevenLabs + FFmpeg)
Post Request Pipeline
Data Model
Dashboards (operator + talent)
Workflows & Scheduling
Tech Stack
Deployment (Railway)
Repo Layout

1. Mission

A single system that can create, read, update, and delete on any major social platform — starting with Instagram TikTok LinkedIn and easily extending to Twitter / X YouTube Threads Facebook.

Who this is for

Dr.Social is built for marketing agencies. A single agency is one tenant and manages many independent talents (not a group — each talent is an autonomous, real human personality with their own goals, voice, look, and audience). Every talent runs multiple accounts across multiple platforms, each with their own following, and is represented end-to-end in the system by a rich talent profile stored in the database.

What a talent profile holds

The profile is the source of truth for everything the system does on behalf of a talent:

Identity & contact: display name, real name, emails, phone, timezone, language, owner-user link.
Connected accounts: every social handle the talent owns, each bound to one execution channel (see isolation rule).
Credentials & secrets: per-account session cookies, OAuth grants — all stored in Supabase Vault, never in plaintext columns.
Personality & voice: personality narrative, tone-of-voice, content pillars, "do not say" list, goals and KPIs, stories and lore that frame how the talent presents online.
Media library: face references, voice clone (opt-in), brand kit, pictures, videos, wardrobe — reused across every account so the same talent feels consistent everywhere.
Content queue & schedule: drafts and ready-to-post pieces, posting cadence per account, time windows, blackout dates.
Approval state: which pieces have been approved by the talent (the actual human owner) and are cleared to publish.

Capabilities (V1)

Agency-grade tenancy: one agency manages many talents; one talent manages many accounts; one account is bound to one platform and one channel.
Account lifecycle: import existing accounts (cookie-based or OAuth), manage them, retire them. Automated signup → V2.
Content production: generate videos, photos, audio, and text — always conditioned on the target talent's profile — then publish them as posts / reels / shorts / articles.
Inbound communications, in-character: read and respond to DMs, public-post replies, comments, and mentions as that specific talent.
White-label PersonaAgent: a reusable agent template that is conditioned on a talent's profile at every call. Carries that talent's personality, media, goals; acts on their behalf. Many talents share the same agent code; each call is stateless.
Owner approval gate: the talent portal gives the real human behind each talent a dedicated view to approve or reject their own content queue before anything is published.
Autonomous operations: a tenant-wide cron tick fans out per-talent agent work.
Operator supervision: a web dashboard exposes accounts, queue, and inbox for human override at the agency level.

Core safety rule: every social account is permanently bound to one execution channel (Device or MCP in V1; Browser added in V2) for its entire lifetime — no mixing, no fallback, no migration. This is the single most important constraint in the system; it caps blast radius when a channel gets burned and prevents the behavioral inconsistencies that trip platform anti-fraud. See the isolation rule below.

1.b Multi-Tenancy & Talent Model

Dr.Social is a multi-tenant SaaS: every record in the system is owned by a tenant (a client of ours — an agency, a creator team, a brand). Inside a tenant, the unit of "who" is a talent — a real human personality whose social presence we manage. One talent can run many accounts, across many platforms, on any channel. This shape is the spine of the data model and shows up in every agent, every query, every dashboard view.

tenant (a paying client of Dr.Social) │ ├── talents (the humans the tenant wants to be online — talent, founders, brand personas) │ │ │ ├── talent_profile (face refs, voice clone, style guide, brand kit, bio variants…) │ │ │ └── accounts (one per platform-per-channel; channel is immutable — see §7) │ │ │ ├── content, content_queue, jobs, events │ │ │ └── threads / messages / comments │ └── tenant_users (agency operators)

🏢 Tenant

The billing entity and the security boundary.
Every non-global table carries tenant_id NOT NULL.
Supabase Row-Level Security (RLS) enforces tenant isolation at the database — even a bug in application code can't leak data across tenants.
Operators belong to a tenant via Supabase Auth + a tenant_users table with roles (owner / editor / viewer).

🧑 Talent

A real, independent human persona — the entity the social content is "about." An agency-tenant typically manages many of these.
Identity & contact: real name, display name, emails, phone, timezone, language, link to the owner-user who approves their content.
Personality & goals: tone-of-voice, personality narrative, stories/lore, content pillars, hashtag preferences, "do not say" list, KPIs and goals the talent is optimizing for.
Media library: face references, voice clone (opt-in), brand kit — reused across every account so a reel on TikTok and a reel on IG feel like the same human.
Connected accounts & secrets: per-account session cookies / OAuth grants, vaulted under the talent.
Content queue & schedule: drafts, ready-to-post pieces — owned by the talent, not by individual accounts.
ContentAgent and PersonaAgent always condition generation and replies on this profile.

📲 Account

FK → talents.id. FK → tenants.id. One per (platform, talent, channel).
The account's channel is immutable (Device or MCP in V1).
Analytics roll up two ways: per-account (operational) and per-talent (cross-platform reach).

What "talent-centric" buys us

Cross-account media consistency. Every video, photo, and audio piece generated for the talent uses the same face reference / voice clone / brand kit. A new account spun up for an existing talent inherits all of that on day one.
Operator UX. The dashboard groups accounts by talent; the operator manages "Aurora Studios" as one entity, not 3 disconnected handles.
Compliant deletion. Tenant-scoped RLS plus a talent-level "right to erasure" makes GDPR/CCPA workflows tractable.

Deferred to V2: cross-platform audience identity resolution — "@aurora_fan_tt on TikTok = @aurorafan on IG = same human." V1 reports per-account follower stats only; V2 adds audience_members + audience_identities + merge UI. See V2 §6.

Tenant isolation is enforced in two layers: (1) Supabase RLS policies on every table — USING (tenant_id = auth.jwt() ->> 'tenant_id') for operator-facing queries. (2) Service-role queries made by the worker always include WHERE tenant_id = $job.tenant_id — verified in code review and via composite FKs (tenant_id, id) so the database refuses cross-tenant attachments.

2. System Overview

The system stacks into six layers. Each upper layer only knows about the layer immediately below it through a stable interface, so the lower layers are individually replaceable.

┌────────────────────────────────────────────┐ │ 👤 Dashboards (operator + talent /me) │ │ Python templates + htmx · poll for live │ └─────────────────────┬──────────────────────┘ │ ┌─────────────────────▼──────────────────────┐ │ 🧠 Orchestration │ │ 4 LLM agents (Persona · Content · │ │ Inbox · Moderation) + cron Workflow │ │ ↳ all LLM calls → abfs.tech router │ └─────────────────────┬──────────────────────┘ │ ┌─────────────────────▼──────────────────────┐ │ 🧩 Platform Adapter Interface │ │ IG · TikTok · LinkedIn · + future │ └─────────────────────┬──────────────────────┘ │ ┌──────────┴──────────┐ ▼ ▼ ┌────────────────┐ ┌──────────────┐ │ Device │ │ MCP │ │ XCUITest / W3C │ │ Composio │ │ / ADB │ │ (official │ │ (iOS/Android) │ │ platform │ │ │ │ APIs) │ └────────┬───────┘ └──────┬───────┘ │ │ └─────────┬─────────┘ │ ┌──────────────────────▼─────────────────────┐ │ 🔐 Identity (V1 minimal) │ │ Credential vault · per-account session │ └─────────────────────┬──────────────────────┘ │ ┌─────────────────────▼──────────────────────┐ │ 💾 Supabase (single backend) │ │ Postgres · Auth · Storage · Vault · │ │ pg_cron │ └────────────────────────────────────────────┘

What's not pictured (in V2): the Browser execution channel + iproxy proxy fleet + per-account fingerprint pool + burn-detection loop sit between Layer 3 (Execution) and Layer 2 (Identity) when V2 lands. See ARCHITECTURE_V2.html §1–§4.

3. The 6 Layers

Layer 6 · Surface

Dashboards

Two server-rendered web UIs: operator dashboard (multi-talent supervision) and talent portal (/me, single-talent self-service with the approval queue). Python templates + htmx for partials. Polling for "live" panels at 5s.

▲

Layer 5 · Orchestration

Agents & Workflow

4 LLM-driven agents (PersonaAgent, ContentAgent, InboxAgent, ModerationAgent) — all routing through the abfs.tech LLM router. Plus the cron Workflow handler and a handful of plain Python services (AccountImporter, RequestAgent for raw-media preprocessing, Analytics SQL).

▲

Layer 4 · Platforms

Platform Adapter Interface

One transparent SocialPlatform contract. IG, TikTok, LinkedIn implement it. Adding Twitter or YouTube means writing one more adapter — agents above don't change.

▲

Layer 3 · Execution

Two Isolated Backends (V1)

Device (XCUITest / W3C / ADB) and MCP (Composio.dev). Each account is permanently bound to exactly one backend — no cross-channel fallback, no migration. See channel isolation rule. Browser channel arrives in V2 §1.

▲

Layer 2 · Identity

V1 Minimal: Credentials + Sessions

Supabase Vault for per-account session cookies (Device) and OAuth refresh tokens (MCP). No fingerprint pool, no proxy registry, no signup automation, no 2FA relay — those are V2. V1 talents bring existing logged-in accounts.

▲

Layer 1 · Data

Supabase — one backend for everything (multi-tenant)

Postgres for state (tenant-scoped via RLS), Supabase Storage for media, Supabase Vault for secrets, Auth for operator login, pg_cron for the workflow tick. No Redis, no Vaultwarden, no separate object store, no Realtime in V1 (polling is enough; Realtime arrives in V2 §9).

4. Orchestration — 4 LLM Agents + Services

V1 has exactly four agents that need an LLM in the loop — anything reasoning, generating, or classifying. Everything else is a plain Python module (no agent ceremony). Don't commit to a heavyweight agent framework (e.g. Google ADK A2A) until V2 actually needs peer-to-peer agent topology; V1's orchestration is a star (Workflow → agent) and a simple async function-call graph is enough.

LLM agent

🎭 PersonaAgent white-label, stateless

A stateless function: persona_decide(talent_id, context) → action[]. On every call it loads the talent's profile (personality, voice, stories, goals, media library, content pillars, "do not say" list, connected accounts) and decides what to post / reply to / wait on, in that talent's voice. No daemon, no long-lived instance — each tick fires a fresh call. Owns: "what to post next," "reply to this DM in voice," "draft this brief."

LLM agent

🎬 ContentAgent

Generates videos, photos, audio, and text — always conditioned on the target talent's profile (face refs, voice clone, brand kit, style guide). Same talent → consistent face/voice/look across every account on every platform. Brief in, asset out. LLM for captions/hashtags/scripts; media models for image/video.

LLM agent

📥 InboxAgent

Reads DMs, comments, mentions, replies. Classifies (lead / fan / spam / escalation). Drafts in-character replies via PersonaAgent for the talent or operator to approve. Polls on a cadence; MCP adapters with webhook support skip polling.

LLM agent

🛡 ModerationAgent

Filters generated content and inbound messages for brand-rule / sensitivity / "do not say" violations before either is acted on. Rules-based first pass (literal "do not say" hits), LLM second pass for tone/brand judgement calls.

Services (plain Python — no agent ceremony)

service

⏱ Workflow

pg_cron-driven function. On each tick, iterates active talents and fans out work to the LLM agents. No reasoning of its own.

service

🪪 AccountImporter

Imports existing accounts: paste in session cookies (Device channel) or complete an OAuth grant (MCP). Validates by issuing a health-check call. No signup automation, no 2FA relay — those are V2.

service

📤 PostAgent

Picks an account, picks a piece of approved content, picks the bound backend, publishes. Records the post ID. Folded into the platform adapter — not its own LLM agent in V1.

service

📝 RequestAgent

Pre-processes raw media attached to a post request: scene detection, OCR, transcription. Then hands off to ContentAgent for per-platform variant generation. No LLM directly — it's a pipeline of media-analysis calls.

service

📊 Analytics

Aggregates post performance, account health, inbox SLAs. SQL queries, not an LLM agent. Feeds the dashboard and informs ContentAgent's next-brief decisions.

Example tick (cron → per-talent fan-out)

workflow.tick()
  for talent in tenants.active_talents(due_now):
      ctx = load_talent_context(talent)                  # profile, recent posts, inbox, queue

      for account in ctx.accounts:
          if persona_decide_should_post(talent, account, ctx):
              brief  = persona_decide_brief(talent, account, ctx)
              draft  = content_generate(talent, brief)         # ContentAgent
              if moderation_approve(draft):
                  content_queue.enqueue(talent, account, draft,
                                        status='awaiting_owner_approval')

          # publish only what the owner has already approved
          for piece in content_queue.due_and_approved(talent, account):
              post_publish(account, piece)                     # PostAgent / adapter

          for msg in inbox_fetch(account):                     # InboxAgent
              if moderation_approve_inbound(msg):
                  reply = persona_draft_reply(talent, msg)
                  inbox_send(account, msg.thread, reply,
                             status='awaiting_owner_approval')

Two invariants to notice: (1) no backend is selected at runtime — the dispatcher reads accounts.channel and refuses any other. (2) nothing publishes without owner approval — every draft and reply lands in the queue and only the pieces the human owner has approved are eligible.

5. LLM Provider — abfs.tech router

All four LLM agents call a single internal provider: the abfs.tech LLM router at https://www.abfs.tech/v1/. The router is already operational and serves Anthropic Claude models through Anthropic-compatible and OpenAI-compatible endpoints, with subscription rotation and per-key logging. Dr.Social doesn't talk to Anthropic / OpenAI / any model provider directly — only to this router.

Endpoints

POST /v1/messages — Anthropic Messages API shape
POST /v1/chat/completions — OpenAI Chat Completions shape
GET /v1/models — list available model IDs
GET /health — router health

Auth

One API key per Dr.Social environment (dev, prod). Sent as either Authorization: Bearer <key> or x-api-key: <key>. Keys are issued from the abfs.tech admin dashboard at /admin and stored in Supabase Vault on our side.

Streaming

Server-Sent Events on both endpoints. Anthropic streams pass through verbatim (message_start, content_block_delta, message_stop). For ContentAgent caption work we use buffered responses; for InboxAgent reply drafting we stream so the operator sees the draft as it composes.

Model selection

Literal model IDs pass through; /regex/flags syntax matches against the available model list and picks the best fit. V1 defaults: Sonnet for content generation + inbox drafting, Haiku for moderation classification, Opus for the rare expensive PersonaAgent calls (long-context, multi-account decisions).

Why this matters

Single billing surface. Every LLM dollar Dr.Social spends shows up in the abfs.tech admin — usage attribution, key revocation, subscription rotation handled there, not here.
Drop-in model swaps. Bumping Sonnet 4.6 → 4.7 (or trying GPT for an experiment) is a config change, not a code change. The router exposes a uniform shape on both sides.
One thing to monitor. Latency / error rate / token consumption all observed at the router level (already wired). Dr.Social's worker just retries on 5xx.

# Pseudocode — every LLM call in Dr.Social goes through this client.
class LlmRouterClient:
    def __init__(self, base_url="https://www.abfs.tech", api_key=ENV["ABFS_LLM_API_KEY"]):
        self.base_url = base_url
        self.api_key  = api_key

    async def messages(self, *, model, system, messages, stream=False):
        # POST /v1/messages — Anthropic-shaped
        ...

    async def chat(self, *, model, messages, stream=False):
        # POST /v1/chat/completions — OpenAI-shaped
        ...

6. Platform Adapter Layer

Every social network is reduced to one Python interface. Agents above call this interface — they never know if they're driving Instagram or LinkedIn.

class SocialPlatform(Protocol):
    name: str  # "instagram" | "tiktok" | "linkedin" | ...

    # account lifecycle (V1 = import-only)
    async def import_account(self, profile: ImportedProfile, backend: Backend) -> Account: ...
    async def login(self, account: Account, backend: Backend) -> Session: ...
    async def logout(self, account: Account) -> None: ...
    async def health_check(self, account: Account) -> AccountHealth: ...

    # create / update / delete
    async def publish_post(self, account: Account, content: Content) -> PostRef: ...
    async def edit_post(self, account: Account, post: PostRef, patch: ContentPatch) -> PostRef: ...
    async def delete_post(self, account: Account, post: PostRef) -> None: ...

    # read
    async def read_feed(self, account: Account, limit: int) -> list[Post]: ...
    async def read_post(self, account: Account, post: PostRef) -> Post: ...
    async def read_dms(self, account: Account, since: datetime) -> list[Message]: ...
    async def read_comments(self, account: Account, post: PostRef) -> list[Comment]: ...
    async def read_mentions(self, account: Account, since: datetime) -> list[Mention]: ...

    # respond
    async def send_dm(self, account: Account, thread: ThreadRef, body: str, media: list = []) -> None: ...
    async def reply_comment(self, account: Account, comment: CommentRef, body: str) -> None: ...
    async def react(self, account: Account, target: Ref, reaction: str) -> None: ...

phase 1

InstagramAdapter

Posts, reels, stories, DMs, comments. V1: Device only (existing accounts imported via session-cookie paste). Browser-channel IG arrives in V2 §1.

phase 1

TikTokAdapter

Video upload, replies, DMs. V1: Device only. (MCP coverage is weak; Browser is V2.)

phase 1

LinkedInAdapter

Articles, posts, DMs, comments. V1: MCP via Composio. Cleanest API surface — no Device needed.

future

TwitterAdapter, YouTubeAdapter…

Drop-in. Agents and dashboard pick them up automatically once registered.

7. Execution Backends (V1: Device + MCP)

V1 ships with two backends. The Browser channel is documented as V2 §1 — its absence is what makes V1 tractable.

backend A

📱 Device — XCUITest / W3C / ADB

Real iOS / Android device pool driven via Appium / XCUITest / W3C WebDriver / ADB.
One device → one (or N rotated) accounts.
Highest trust signal: real IMEI, real Wi-Fi/cellular, real sensor data.
Best for: TikTok video upload, Instagram reels, anything that gates on device attestation.
Trade-off: slowest, requires physical infra (USB hub on a local Mac/PC, reached via outbound HTTPS).
Auth: session cookies imported once into Vault; the device replays them.

backend B

🔌 MCP — Composio.dev (official APIs)

Calls the platform's official REST/GraphQL endpoints via MCP tools (Composio).
OAuth-managed credentials, no fingerprint risk.
Fastest and cheapest. Subject to API-tier rate limits and feature coverage.
Best for: LinkedIn posting, read endpoints, anything the platform exposes officially.

Capability matrix (what each backend can do)

Operation	Device	MCP
Post text	✓	✓
Post video / reel	✓ (best)	~
Read feed	✓	✓
Read DMs	✓	~
Send DM	✓	~
Reply to comment	✓	✓
Delete post	✓	✓
Account signup	→ V2 §5

The matrix above is descriptive (what's possible). It is not a runtime fallback path — see the isolation rule below.

⛔ Channel isolation rule (hard constraint)

One account, one channel — forever.
Every social account is permanently bound to exactly one execution backend at enrollment time (V1: accounts.channel ∈ {device, mcp}; V2 expands to include browser). That binding is the account's identity. The orchestrator will refuse to act on an account through any other channel. There is no fallback, no failover, no "try the other backend." Ever.

Why this rule exists

Anti-correlation: platforms correlate device IDs, IPs, fingerprints, and OAuth grants. An account that suddenly hops between channels trips trust signals instantly.
Blast-radius cap: if a channel gets burned, only accounts on that channel are affected. The other channel stays clean.
Forensic clarity: when an account gets banned, the cause is unambiguous — you know which channel's signals to investigate without confounding variables.
Behavioral consistency: a real human uses one entry point. Mixing channels is super-human and statistically detectable.

What this rule enforces

The account's channel column is set on enrollment and is immutable at the DB level (CHECK + trigger that rejects updates).
The worker reads account.channel at job dispatch and loads only that backend. No "required_channel" duplicate column on jobs — the account row is the source of truth.
If an operation is requested on an account whose channel can't perform it, the operation is rejected at the orchestrator. Never silently rerouted to another channel.
If the channel becomes unavailable (device offline, MCP quota exhausted), the account is paused — not migrated.
Each channel has its own isolated identity primitives: a device account has device binding + cookies; an mcp account has OAuth grant only. These never cross.

-- accounts.channel is set once and never changes
alter table accounts
  add column channel text not null
  check (channel in ('device', 'mcp'));         -- V2: add 'browser'

create or replace function lock_channel() returns trigger as $$
begin
  if NEW.channel is distinct from OLD.channel then
    raise exception 'channel is immutable on account %', OLD.id;
  end if;
  return NEW;
end $$ language plpgsql;

create trigger accounts_channel_immutable
  before update on accounts
  for each row execute function lock_channel();

-- worker dispatch (no `required_channel` on jobs; the account row is truth)
job = claim_next_job()
account = load(job.account_id)
backend = load_backend(account.channel)         -- ONLY this backend, no fallback
backend.execute(job)

8. Identity (V1 minimal set)

V1 needs the absolute minimum to act on existing accounts. Anti-bot stealth, signup automation, 2FA relay, fingerprint pools, proxy fleets — all V2.

🔐 Credential Vault

Supabase Vault — encrypted credentials stored in the same Postgres. Holds session cookies (Device channel) and OAuth refresh tokens (MCP channel). RLS restricts which service role can decrypt what; never decrypted into logs.

📂 Session import (V1)

Talents (or operators with talent permission) paste in cookies / complete an OAuth grant once per account. Validated by a health-check call. No automated signup — that's V2.

🚨 Burn signal (V1 minimal)

Adapters report outcome (ok / soft-block / hard-block) on every call. V1 reaction: pause the account, alert the operator. No fingerprint rotation, no proxy rotation (no fingerprints or proxies in V1). Full burn-detection loop is V2 §4.

Deferred to V2: fingerprint pool (per-account stable browser fingerprints), iproxy.online mobile-carrier proxies, automated burn rotation, 2FA relay (Telegram bot + Dr.Emails IMAP worker), and account signup automation. See V2 §1–§5.

9. Content Pipeline

ContentBrief ──► ContentAgent ──► raw assets ──► ModerationAgent ──► PublishQueue ▲ │ │ │ ├─► LLM router (abfs.tech) — caption, hashtags, script │ ├─► Replicate (FLUX, Whisper) — image gen, transcription │ ├─► ElevenLabs — voice clone, TTS │ ├─► FFmpeg (in-process) — video cuts, aspect-reformat, BGM, subtitles │ └─► Supabase Storage (bytes) + content row in Postgres (metadata) │ Analytics (what worked last week → next brief)

Briefs come from Analytics (past performance), PersonaAgent (autonomous), the operator (manual), or a talent post request (see §10).
Assets are persisted to Supabase Storage keyed by content hash; only metadata lives in the Postgres content table.
ModerationAgent blocks anything flagged before it reaches the publish queue.
The PublishQueue is platform-aware: a single brief produces one variant per target platform — each variant lands as its own content_queue row with appropriate aspect, length, hashtags.

9.b Media Generation Stack

ContentAgent decides what to make. The vendors below actually make it. V1 picks the smallest set that produces images, voiceover, subtitles, and edited video — without the V2-grade cost and quality penalty of text-to-video generation.

images

🖼 Replicate · FLUX 1.1 Pro

Image generation. Talent face consistency via per-talent LoRA fine-tunes seeded from talent_assets[kind='face']. One API key (REPLICATE_API_TOKEN) covers the FLUX family + Whisper + many other models, so swapping image generators is a config change, not a vendor change.

Cost shape: ~$0.03/image, pay-per-use. ~$1–3/mo per talent at V1 volumes.

audio

🎙 ElevenLabs · voice clone + TTS

Voice cloning (1–5 min of clean reference audio per talent → cloned voice profile) and streaming TTS. Opt-in per talent — Brand & Voice section in /me manages consent + reference uploads. Only vendor in the stack for which "the talent's own voice" is non-substitutable; the rest are commodity model markets.

Cost shape: $22–99/mo tiered subscription; minutes-of-output budgeted per tenant.

transcription

📝 Replicate · Whisper-large

Voice memos in talent post requests get transcribed and used as the prompt. Long-form video uploads get a transcript that feeds caption generation + subtitle burn-in. Same Replicate key as image gen — no second vendor.

Cost shape: ~$0.005/min audio, pay-per-use.

video editing

🎬 FFmpeg (in-process)

Runs inside the Dr.Social worker process. No vendor, no fees. Handles:

cuts & trims (driven by scene-detection output)
aspect reformat: 16:9 ↔ 9:16 ↔ 1:1 with auto-pan/crop
subtitle burn-in (ASS format, styled to brand kit)
BGM overlay + ducking
color grading (LUTs, basic)
image-sequence → reel (Ken Burns pan/zoom) for "no raw video" requests

deferred to V2

🎞 Text-to-video generation

Runway Gen-3 · Luma Dream Machine · Google Veo · Sora. All slow (30–90s per clip), expensive ($0.40–1.00 per 5s reel), and weak at talent-face consistency. V1 sidesteps the problem: talents either upload raw video that we edit (FFmpeg), or get "video reels" composed from FLUX-generated stills + ElevenLabs voiceover + FFmpeg pan/zoom. We add text-to-video when one of those falls short.

deferred to V2

🎵 Generated music / BGM

Suno and Udio sound great but don't have stable production APIs yet. V1 uses royalty-free libraries (Pixabay-style) or each talent's pre-uploaded brand-kit BGM in talent_assets[kind='bgm'].

How a "no-media" post request becomes a reel

When a talent submits a request with prompt-only and TikTok as the target, the pipeline composes a reel without text-to-video:

prompt ─► ContentAgent (LLM router) ─► script (8 beats) │ ├─► FLUX × 8 (Replicate) ─► 8 stills, talent face LoRA applied │ ├─► ElevenLabs ─► narration in talent's cloned voice │ ├─► Whisper (Replicate) ─► word-level timing for subtitles │ └─► FFmpeg ─► 8 stills + Ken Burns + voiceover + burned subtitles + BGM + 9:16 aspect → final.mp4 → Supabase Storage

Vendor pinning

Replicate stays as the image+transcription router; specific models swappable in media/config.py.
ElevenLabs is the one vendor we'd genuinely struggle to replace — voice clone quality + streaming TTS at their level isn't a commodity yet. Worth paying for.
The media/ module exposes a stable shape: generate_image(prompt, talent) → url, tts(text, talent) → url, transcribe(url) → text+timings, compose_reel(brief) → url. Swap vendors behind it without touching ContentAgent.

10. Post Request Pipeline

Content reaches the approval queue from two distinct sources — both treated equally downstream:

AI-initiated — ContentAgent invents content from scratch using the talent's profile, goals, content pillars, and recent engagement. This is the "what to post next" path described in §9.
Talent post requests — the talent explicitly asks for a post: a prompt + a target-platform selection, plus optional media attachments. This section describes that path.

Post requests matter because most talents have specific things they want said or shown that the AI won't dream up on its own — a launch, a reaction, a behind-the-scenes moment. Media is optional: a request can be just a prompt + platforms (the system generates content from scratch in the talent's voice), just media + platforms (the system writes captions and cuts the assets for each platform), or both.

What a post request carries

Target platforms (required) — one or more of the talent's connected accounts. Each selected account produces a platform-tailored variant the talent approves independently in the queue.
Prompt (optional) — a sentence or paragraph saying what the post is about. Empty prompt = the AI picks a topic from the talent's content pillars.
Attached media (optional) — photos, video, voice memos. If present, the system edits them to fit each platform; if absent, content is generated from scratch.
Schedule hint (optional) — preferred publish time or "ASAP". Defaults to the talent's posting cadence per account.

Two ingress paths (both V1)

🌐 Browser form

The talent's /me portal exposes a "Request a post" form with a prominent platform picker (chips per connected account), a prompt textarea, and an optional drag-and-drop area for media. Attachments upload directly to Supabase Storage (per-talent path, RLS-scoped). The submit creates one post_requests row.

📥 Email via Cloudflare catch-all + Gmail IMAP poll

Each talent gets a stable address <talent-slug>[email protected] (e.g. [email protected]). DNS for dr-social.app is on Cloudflare; an Email Routing catch-all rule forwards every inbound message to one operator mailbox (currently [email protected] for the MVP). The Dr.Social worker IMAP-polls that mailbox every 30s with an App Password, parses the original To: header to identify the talent, drops attachments into Storage, transcribes any voice memos for the prompt, and creates the same post_requests row as the browser form. Processed messages get labeled drsocial/ingested and archived. Subject line is the prompt unless the body has one; platform tags #TT #IG #LI in the body override the talent's last-used platform selection.

Why this works at MVP without SES/Postmark

Cloudflare Email Routing preserves the original To: header on forward — so we can still extract the talent slug from each message even though every email lands in one Gmail inbox.
One Gmail App Password (with 2FA on the Gmail account) gives us IMAP-IDLE-compatible access; no SES domain config, no SNS topic, no Edge Function MIME parser.
Operator visibility is a side effect: every talent request also appears in the operator's Gmail, which is useful for the MVP-stage operator who wants to eyeball traffic.
If Gmail's IMAP rate limits become a problem (≫1 message/second sustained), we graduate to SES or Postmark inbound — described in V2 §7. Not before.

Sender identity: V1 trusts the talent's slug as the routing key — anyone who knows [email protected] can submit. That's fine when the address is shared only with the talent and their immediate collaborators (camera op, etc.). For tighter security, V2 adds a talent_request_senders allowlist (see V2 §7).

Pipeline stages

post_request (prompt? + target_accounts[] + media[]? + submitting user) │ ▼ ┌────────────────────┐ │ RequestAgent │ normalize + analyze (plain Python service) │ (service) │ · attachments: scene detection / cuts / OCR / transcript └─────────┬──────────┘ · prompt: intent classification, length hints │ ▼ one fan-out per target_account ┌────────────────────┐ │ ContentAgent │ produce a platform-tailored variant │ (LLM agent) │ · aspect / length / hashtags / format per platform └─────────┬──────────┘ · caption in the talent's voice (via abfs.tech LLM router) │ ▼ ┌────────────────────┐ │ ModerationAgent │ brand-rule + sensitivity checks └─────────┬──────────┘ │ ▼ content_queue.status = awaiting_owner_approval (one row per platform variant, all linked to the same post_request) │ ▼ (talent clicks Approve per variant on /me) content_queue.status = approved │ ▼ PostAgent (service) publishes on the scheduled slot

What the pipeline produces

One request → one row per selected platform. If the request targets TikTok + Instagram, two content_queue rows are created, grouped under the same post_request_id in the talent's approval queue UI. The talent sees the request as a single "post idea" with stacked per-platform variants.
Per-platform approval. The talent can approve the TikTok variant and reject the Instagram one (or vice versa). An "Approve all" shortcut covers the common case where every variant looks good. PostAgent only publishes approved rows.
Mixed in with AI-initiated content. The approval queue doesn't segregate by origin; AI-initiated and request-derived ideas sit side by side, each tagged with provenance.
Voice cloning is opt-in per talent. If the talent has agreed and uploaded reference audio (managed under Brand & voice in /me), transcribed voice memos can return as polished narration in their voice.
Rejection learning loop. When a talent rejects a variant with a reason, the text is appended to that talent's style guide and used to filter future drafts.
Operator visibility. Operators see every request a talent has sent but cannot publish without the talent's approval (unless the talent has explicitly opted into operator-override mode in their preferences).

Schema additions (V1 sketch)

post_requests          (id, tenant_id, talent_id,
                        source CHECK in ('browser', 'email'),
                        prompt_text NULL,                  -- empty = let AI pick from pillars
                        target_account_ids[],              -- one variant generated per
                        schedule_hint NULL,
                        sender_email NULL,                 -- populated for source='email'
                        gmail_message_id NULL UNIQUE,      -- populated for source='email'; dedup key
                        status in ('received','analyzing','drafting','complete','failed'),
                        created_at, processed_at)

post_request_attachments (id, post_request_id, storage_path, mime, bytes, duration_s,
                          analysis_json NULL,              -- scenes, faces, hooks, OCR …
                          transcript_text NULL)
                          -- zero rows if the request had no media

content_queue.origin            enum: 'ai_initiated' | 'request_browser' | 'request_email'
content_queue.post_request_id   NULL unless origin is 'request_*'
content_queue.rejection_reason  captured at owner rejection — fed into talent style guide

Talent slug rules

Each talent has a globally unique slug column (e.g. aurora-lee). The catch-all email forwarding model means slugs collide across tenants, so uniqueness is enforced tenant-globally, not tenant-scoped, for this one column.
Slug is lowercase ASCII, hyphens allowed, 3–32 chars, no leading/trailing hyphen. Anything else is rejected at create time.
Format: <slug>[email protected] for post requests. Reserve +post, +settings, +inbox as future intents — the IMAP parser refuses unknown suffixes for now.

Why media is optional: the unit of work is the request, not the upload. Some talents will mostly send prompts ("Make a LinkedIn post about hiring SRE in 2026" — no media); others will mostly send raw clips ("Turn this MP4 into 3 reels"); most do both. The pipeline treats them as one flow with two optional inputs (prompt, media) and one required input (which platforms).

11. Data Model (V1)

~9 tables. Every tenant-scoped table carries tenant_id NOT NULL and is gated by an RLS policy. Composite FKs (tenant_id, id) on every cross-table reference make the database refuse cross-tenant attachments.

Tenancy & identity

tenants                (id, name, plan, status,
                        default_tick_interval_seconds,
                        created_at)
                       -- one row per agency. Per-talent override → V2 §8

tenant_users           (id, tenant_id, supabase_user_id,
                        role CHECK in ('owner','editor','viewer'))

talents                (id, tenant_id, display_name, real_name,
                        bio_short, bio_long, timezone, language, status,
                        owner_user_id,                       -- the real human who must approve
                        primary_email, contact_emails[], phone,
                        created_at)

talent_profile         (talent_id PK,
                        personality_md, stories_md,
                        goals_json,
                        face_ref_urls[], voice_clone_ref NULL,
                        brand_kit_json, style_guide_md,
                        content_pillars[], hashtag_preferences[], do_not_say[],
                        updated_at)
                       -- 1:1 with talents; split out only for size

talent_assets          (id, talent_id, kind in ('face','voice','logo','wardrobe','bgm',
                                                'picture','video','document'),
                        storage_url, hash, metadata_json, created_at)

Accounts & channel-bound identity

accounts               (id, tenant_id, talent_id, platform, handle, status,
                        channel CHECK (channel in ('device','mcp')) IMMUTABLE,
                        vault_ref,                          -- session cookies OR OAuth refresh
                        posting_schedule_json,
                        created_at)
                       -- one row per (platform, talent, channel)

Deferred to V2: fingerprints, iproxy_connections, devices, oauth_grants, sessions, ip_rotations. V1 stuffs both session cookies and OAuth refresh tokens into accounts.vault_ref (a single Vault entry per account, shape varies by channel).

Content, queue, requests

content                (id, tenant_id, talent_id, hash, type, asset_url,
                        caption, hashtags, generated_by, moderated, created_at)

content_queue          (id, tenant_id, talent_id, account_id, content_id,
                        origin enum ('ai_initiated','request_browser'),
                        post_request_id NULL,
                        scheduled_for,
                        state CHECK in
                          ('draft','awaiting_owner_approval','approved','rejected',
                           'queued','published','failed'),
                        approved_by_user_id, approved_at,
                        rejected_reason,
                        created_at, updated_at)
                       -- per-talent publish queue shown on /me

post_requests          (id, tenant_id, talent_id,
                        prompt_text NULL,
                        target_account_ids[],
                        schedule_hint NULL,
                        status in ('received','analyzing','drafting','complete','failed'),
                        created_at, processed_at)

post_request_attachments (id, post_request_id, storage_path, mime, bytes, duration_s,
                          analysis_json NULL, transcript_text NULL)

Inbox & activity

threads                (id, tenant_id, account_id, peer_handle, last_message_at)
                       -- V2 adds audience_identity_id (cross-platform identity resolution)

messages               (id, tenant_id, thread_id, direction, body, media_urls,
                        classified_as, status, created_at)

jobs                   (id, tenant_id, account_id, kind, payload_json, status, run_after)
                       -- no required_channel column; account.channel is the source of truth

events                 (id, tenant_id, ts, kind, account_id, payload_json)
                       -- audit + activity stream

Invariants enforced in Postgres: (1) accounts.channel is immutable (trigger blocks UPDATE). (2) tenant_id consistency — every FK across tables uses composite (tenant_id, id), so the database refuses to attach an account in tenant A to a talent in tenant B. (3) RLS policies on every tenant-scoped table.

V2 schema additions (kept here as a forward reference): audience_members, audience_identities, audience_interactions, identity_link_signals (V2 §6); fingerprints, iproxy_connections, devices, oauth_grants, sessions, ip_rotations (V2 §1–§4); talent_request_senders (V2 §7); talents.tick_interval_seconds (V2 §8).

12. Dashboards (operator + talent self-service)

Two server-rendered web UIs, two different audiences, one shared backend. Both are tenant-scoped — Supabase RLS enforces tenant isolation on every query. Python templates + htmx for partials; polling at ~5s for "live" panels. (React + Vite frontend → V2 §10. Supabase Realtime → V2 §9.)

12.a · Operator dashboard — multi-talent supervision view for agency staff. Mockup: dashboard/_layout.html.
12.b · Talent self-service portal — single-talent workspace for the human represented by the persona. Mockup: dashboard/me.html.

12.a — Operator dashboard (V1: 6 pages)

Top-level navigation pivots around talents, not accounts. Agency staff see all talents under their tenant, run overrides, supervise inboxes, and watch system health.

📊 Overview

Live activity stream, accounts grouped by talent, queue-next-24h snapshot, inbox-awaiting-operator, channel health, plus an analytics rollup panel (folded in — no standalone Analytics page in V1).

🧑 Talents

Every talent in the tenant, with status, accounts grouped, queue volume, profile completeness. Click into a talent to see their workspace.

📲 Accounts

Every managed account across every platform, with health, last-active, current backend, vault status. Per-talent filter.

🗓 Publish Queue

All scheduled posts across the tenant: draft → moderated → awaiting owner approval → approved → queued → published. Moderation flags shown inline (no standalone Moderation page in V1). Operators can reorder, pull a piece, or force-approve (only when the talent has opted into operator-override).

💬 Inbox

In-character reply drafts from PersonaAgent pending approval, escalations, conversation timeline per peer, per-account inbox.

⚙ Settings

Integrations (Supabase, abfs.tech LLM router key, Composio), team & roles, defaults (tenant tick cadence, content language, posting cap), billing, danger zone.

Deferred operator pages (V2): Audience explorer (V2 §6), Fingerprints (V2 §3), iproxy Fleet (V2 §2), Devices, MCP Grants, Agents live-status, Jobs inspector, Audit Log UI, standalone Moderation, standalone Analytics. Each maps to the V2 feature that justifies adding it. See V2 §11.

12.b — Talent self-service portal (`/me`)

A standalone single-page workspace for the talent themselves. Scoped to one talent — the one whose owner_user_id matches the logged-in Supabase Auth user. Different chrome from the operator dashboard (warmer accent, no tenant switcher, no cross-talent navigation). Mockup: dashboard/me.html.

✅ Approval queue primary action

Pending content piece-by-piece, grouped by post idea with per-platform variants stacked under each idea. Tagged AI-initiated or from your request. The talent can approve each platform independently, or click "Approve all N variants" on the parent. Reject with a reason (fed into the style guide), edit, or reschedule.

📝 Request a post

Form with a prominent platform picker (chips per connected account), a prompt textarea, and an optional drag-and-drop area for media. Feeds the Post Request Pipeline. A "recent requests" list shows each request's status (analyzing → drafting → in queue).

👤 Profile (CRUD)

Display name, tagline, short bio, personality traits, goals this quarter, content pillars, target audience. These fields drive every PersonaAgent draft.

🎨 Brand & voice (CRUD)

Brand colors, logo/wordmark, face reference images, voice samples (for opt-in audio cloning), tone-of-voice positive examples, never-say list. ModerationAgent hard-blocks anything in the never-say list.

📲 Connected accounts

Per-platform handles with health and channel binding visible. Re-authenticate on demand. Adding a new account in V1 = importing existing credentials (Device cookies OR MCP OAuth); automated signup is V2.

🔐 Vault (read-only inventory)

Metadata of every stored credential — secrets are never displayed, only "last verified" timestamps and expiry. A red "revoke everything & pause my persona" panic switch.

⚙ Preferences (CRUD)

Daily posting cap, blackout windows, timezone, default content language, two-factor on the portal, and the critical operator-override toggle (off by default). Per-talent tick interval → V2 §8; V1 inherits from the tenant.

📈 Your growth private to the talent

The talent's own analytics — total followers and 7d/30d growth, per-platform breakdown, top posts, audience composition + geographies, what content patterns are above/below their own benchmark. Aggregated only across their accounts.

Separation by design: the talent portal and the operator portal share data (same Supabase tables, same RLS policies) but not chrome. A talent never lands on the operator sidebar; an operator who wants to see the talent's view jumps via "Preview talent portal" and is rendered as that talent's session.

13. Workflows & Scheduling

Scheduling lives inside Supabase via pg_cron. A row in a jobs table is inserted on every tick; the worker process polls jobs WHERE status='pending' using SELECT … FOR UPDATE SKIP LOCKED — a clean Postgres-native queue with no Redis or Celery.

Tick cadence is tenant-level in V1

V1 has a single cadence per tenant — tenants.default_tick_interval_seconds — driving every talent in that tenant. The Workflow handler iterates the tenant's active talents on each tick and fans out per talent. Per-talent override and dynamic adjustment (warm-up, blackouts, burn back-off) → V2 §8.

Baseline wake-up: pg_cron inserts a tick job at the tenant's configured interval → Workflow picks it up and fans out.
Posting cadence: per-account schedule stored in accounts.posting_schedule_json. The tick checks "is it time?" and enqueues a publish job — but only for items already approved in content_queue.
Inbox polling: per-account interval; MCP adapters that support webhooks register a Supabase Edge Function URL and skip polling entirely.
Live dashboard fan-out (V1): dashboard polls at ~5s. Real-time subscriptions → V2 §9.

-- pg_cron entry — tenant cadence configurable, not hard-coded
select cron.schedule(
  'drsocial-baseline-tick',
  '* * * * *',                                         -- once per minute baseline
  $$ insert into jobs (kind, payload) values ('tick', '{}') $$
);

-- Workflow handler (per tick, per tenant)
for tenant in tenants.active():
    if not tenant.is_tick_due(now):
        continue
    for talent in tenant.active_talents():
        fan_out(talent, now)

-- Atomic job pick
select id, kind, payload
from jobs
where status = 'pending' and run_after <= now()
order by run_after
limit 1
for update skip locked;

14. Tech Stack

One Railway service, one managed backend (Supabase), one external LLM provider (abfs.tech router). That's the whole infrastructure for V1.

Railway service (single)

🌐 API + Worker + Dashboard

Python FastAPI serving REST + the htmx-rendered dashboards, plus the asyncio job loop running as a parallel task in the same process. One container, one port, one restart loop. Splitting into two services → V2 §12 when load demands it.

managed

🗄 Supabase

Postgres + Auth + Storage + Vault + pg_cron. The only database. The only secret store. The only object store. The only scheduler. Realtime + Edge Functions reserved for V2 features.

external

🧠 abfs.tech LLM router

All LLM calls go to https://www.abfs.tech/v1/ — see §5. Bearer-auth API key per environment, stored in Supabase Vault. One billing surface for every model call Dr.Social makes.

What's used (V1)

Runtime

Python 3.12 · asyncio · uv

API + UI server

FastAPI

Dashboard frontend

Server-rendered Python templates + htmx. No Node, no Vite, no React. React + Vite → V2 §10.

LLM provider

abfs.tech LLM router — Anthropic + OpenAI-compatible endpoints

Device backend

Appium / XCUITest / W3C / ADB (off-cloud, on a local Mac/PC; reaches API via outbound HTTPS)

MCP backend

Composio.dev MCP client

Image gen + transcription

Replicate — FLUX 1.1 Pro (images), Whisper-large (transcription), face-LoRA fine-tunes per talent. One key, swappable models.

Voice clone + TTS

ElevenLabs — per-talent voice clone (opt-in) + streaming TTS for narration on generated reels.

Video editing

FFmpeg in-process. Cuts, aspect reformat, subtitle burn-in, BGM. Text-to-video gen → V2.

Email post-request intake

Cloudflare Email Routing catch-all on dr-social.app → operator Gmail mailbox → IMAP poll by the worker every 30s. No SES/Postmark in V1.

Persistence

Supabase Postgres (schema in plain SQL or one tiny migration tool)

Secrets

Supabase Vault (no Vaultwarden)

Media storage

Supabase Storage (no S3 / GCS / Drive)

Queue + scheduling

Postgres jobs table + pg_cron (no Redis, no Celery)

Live updates

htmx polling at 5s. Supabase Realtime → V2 §9.

What's deliberately NOT used (V1)

❌ Redis · ❌ Celery · ❌ RabbitMQ · ❌ Vaultwarden · ❌ S3 / GCS / Drive · ❌ Separate websocket server · ❌ Kubernetes · ❌ Docker Compose stack with N services · ❌ Node toolchain · ❌ React / Vue / Svelte · ❌ Google ADK A2A framework (pick when needed)

14.b Deployment — Railway.app

┌────────────────────────────────────────────────────────────────┐ │ Railway.app │ │ │ │ ┌──────────────────────────────────────────┐ │ │ │ service: drsocial (single) │ │ │ │ FastAPI + htmx dashboard + worker loop │ │ │ │ one Dockerfile · $PORT public │ │ │ └──────────┬───────────────────────────────┘ │ │ │ │ └──────────────┼─────────────────────────────────────────────────┘ │ ┌──────────┼─────────────────┬─────────────┬─────────────┐ ▼ ▼ ▼ ▼ ▼ ┌─────────┐ ┌──────────┐ ┌─────────────┐ ┌──────────┐ ┌──────────┐ │ abfs │ │ Replicate│ │ ElevenLabs │ │ Composio │ │ Supabase │ │ .tech │ │ (images, │ │ (voice TTS, │ │ (MCP for │ │ (managed)│ │ LLM │ │ Whisper)│ │ clone) │ │ LinkedIn)│ │ Postgres │ │ router │ │ │ │ │ │ │ │ Storage │ │ │ │ │ │ │ │ │ │ Vault │ │ │ │ │ │ │ │ │ │ pg_cron │ └─────────┘ └──────────┘ └─────────────┘ └──────────┘ └──────────┘ ▲ │ IMAP poll (30s) │ ┌──────────────┐ │ Gmail inbox │ │ (operator's) │ └──────▲───────┘ │ │ catch-all forward ┌──────────────┐ │ Cloudflare │ │ Email Routing│ │ dr-social.app│ └──────────────┘ Device backend runs OFF-CLOUD (local Mac/PC with USB device hub) and connects to the api service via outbound HTTPS + bearer token.

One service, one Dockerfile, one Procfile-style start command.
No databases on Railway. No volumes on Railway. Restart-safe by design — all state in Supabase.
Cost target: $5–20/mo Railway + Supabase free tier + Replicate pay-per-use + ElevenLabs starter tier. Scales with media volume, not user count.

V1 environment variables

# Auto-set by Railway
PORT
RAILWAY_ENVIRONMENT_NAME

# Supabase (one set per env)
SUPABASE_URL
SUPABASE_SERVICE_KEY
SUPABASE_ANON_KEY

# LLM provider (abfs.tech router)
ABFS_LLM_API_KEY                  # bearer for https://www.abfs.tech/v1/

# Media generation
REPLICATE_API_TOKEN               # FLUX, Whisper, etc.
ELEVENLABS_API_KEY                # voice clone + TTS

# MCP backend (LinkedIn, etc.)
COMPOSIO_API_KEY

# Device backend (on-prem hub)
DEVICE_HUB_URL                    # e.g. https://devicehub.yourbiz.com:4723
DEVICE_HUB_TOKEN                  # bearer for the device hub

# Email post-request intake (Gmail IMAP poll, V1 MVP shape)
GMAIL_INTAKE_USER                 # e.g. [email protected]
GMAIL_INTAKE_APP_PASSWORD         # Gmail App Password (requires 2FA on the account)

Live environments mockup hosting only — no business runtime yet

Two Railway environments host the design-phase mockups via serve.py (a small stdlib HTTP server that templates the operator-dashboard fragments and serves the talent portal + this doc). The real FastAPI service arrives when implementation begins.

Environment	URL	Deploys from	Gate
Production	drsocial-production.up.railway.app ↗	`master` only	(planned: CI check once tests exist)
Dev	drsocial-dev.up.railway.app ↗	any branch (auto on push)	None

15. Repo Layout

Today (planning phase)

Dr.Social/
├── docs/
│   ├── ARCHITECTURE.html       # this document (V1)
│   └── ARCHITECTURE_V2.html    # deferred features
├── dashboard/
│   ├── _layout.html            # operator-portal shell ({{TITLE}}, {{CONTENT}}, …)
│   ├── _overview.html          # operator-portal page fragments (5)
│   ├── _talents.html
│   ├── _accounts.html
│   ├── _queue.html
│   ├── _inbox.html
│   ├── _settings.html
│   └── me.html                 # standalone talent self-service portal
├── serve.py                    # tiny stdlib HTTP server that renders the fragments
├── Procfile                    # web: python serve.py  (Railway start command)
├── pyproject.toml              # minimal — signals Python project to Railway nixpacks
└── README.md

Target (V1 implementation)

Dr.Social/
├── src/drsocial/
│   ├── agents/                  # the 4 LLM agents — each thin, all call llm_router
│   │   ├── persona.py           # persona_decide_*  — stateless functions
│   │   ├── content.py           # content_generate
│   │   ├── inbox.py             # inbox_fetch + classify + reply-draft
│   │   └── moderation.py        # moderation_approve, moderation_approve_inbound
│   ├── services/                # plain Python — no agent ceremony
│   │   ├── workflow.py          # pg_cron-driven tick handler
│   │   ├── account_importer.py  # session-cookie / OAuth import + health check
│   │   ├── request_agent.py     # raw-media preprocessing (scenes/OCR/transcript)
│   │   ├── post_publisher.py    # publishes approved content via platform adapter
│   │   └── analytics.py         # SQL rollups
│   ├── media/                   # media generation + editing — stable API, swappable vendors
│   │   ├── images.py            # generate_image(prompt, talent) — Replicate FLUX + face LoRA
│   │   ├── audio.py             # tts(text, talent), clone_voice(refs) — ElevenLabs
│   │   ├── transcribe.py        # Whisper via Replicate
│   │   └── ffmpeg.py            # reel composer, aspect reformat, subtitle burn-in, BGM
│   ├── intake/                  # post-request ingress
│   │   ├── browser.py           # /me form handler — multipart upload to Storage
│   │   └── gmail_imap.py        # IMAP poller; parses To: → talent_slug; dedup via gmail_message_id
│   ├── platforms/               # adapter layer
│   │   ├── base.py              # SocialPlatform protocol
│   │   ├── instagram.py
│   │   ├── tiktok.py
│   │   └── linkedin.py
│   ├── backends/                # execution
│   │   ├── device.py            # Appium / XCUITest / ADB client
│   │   └── mcp.py               # Composio client
│   ├── llm_router.py            # thin client for https://www.abfs.tech/v1/
│   ├── supabase_client.py       # one client (DB + Storage + Vault + Auth)
│   ├── jobs.py                  # pg_cron-driven queue helpers
│   ├── api.py                   # FastAPI — REST + serves dashboard + serves /me
│   ├── settings.py
│   └── main.py                  # entrypoint: runs api + worker in one asyncio loop
├── dashboard/                   # htmx + Jinja templates, served by api.py
├── supabase/
│   ├── migrations/              # plain .sql files
│   └── seed.sql
├── tests/
├── Dockerfile                   # single image — runs api + worker
├── railway.json                 # single-service Railway config
└── pyproject.toml

What V2 adds to the repo: agents/ stays at 4; services/ grows with fingerprints.py, iproxy.py, burn_detection.py, signup.py, audience_resolver.py; backends/browser.py appears; supabase/functions/inbound_email/ appears; dashboard/ swaps htmx for React + Vite if V2 §10 fires; the single Dockerfile splits into Dockerfile.api + Dockerfile.worker if V2 §12 fires.