What we buildWorkAboutHostingContactStart a project
← Back to work
Automation · AI

Aptos Content Pipeline

Trending topic in, publish-ready short-form video out — fully automated, across a whole network of accounts.

0Videos / day target
0Connected accounts
0+Docs in the RAG base
0Platforms, auto-published
The problem

Why this had to exist.

Producing a single short-form video the right way takes a content researcher, a scriptwriter, a video editor, a motion-graphics artist, and a social-media manager — then multiply that by a network of accounts and a 60-video-a-day target and it simply doesn’t scale by hand. Trends move in hours; manual research, scripting, editing, captioning and cross-platform publishing move in days. The challenge was to compress that entire chain into an automated system that still looks hand-crafted, stays factually grounded in real Aptos research, and produces genuinely distinct content per account rather than the same clip reposted thirty times.

What we built

An end-to-end content pipeline

The Content Pipeline transforms trending social topics into publish-ready short-form video for the Aptos blockchain ecosystem. It monitors social platforms for momentum, generates research-backed scripts grounded in a curated knowledge base, personalizes them to distinct account voices, produces full videos with AI presenters and supporting visuals, layers on branded logo overlays and animated captions, and distributes across TikTok, YouTube Shorts, Instagram Reels and Facebook Reels. It is not a single-account tool — it runs a network of accounts, each with its own personality, voice, presenter and style, all coordinated by a resumable Python orchestration layer over 7+ external services.

How it works

Seven stages, fully automated

One trending topic flows through seven stages and comes out the other side as multiple unique, publish-ready videos — one per account, each authentically its own.

1

Topic Discovery

Scans TikTok for videos gaining traction across Aptos-relevant hashtags and keywords, applies engagement filters, then uses AI to extract and categorize the real topics — regulation, adoption, payments, institutional moves — with confidence scoring. One intentional human checkpoint keeps strategy aligned. Hours of trend monitoring compressed into minutes.

2

Script Generation

Produces a research-backed narration script tuned for 30–40s. A RAG architecture queries a knowledge base of 47 technical articles, 5 academic papers, 17 narrative documents and 136 Aptos Improvement Proposals, tracks previously written content to vary the narrative angle, and integrates live market data. Output is structured as hook / body / CTA.

3

Script Personalization

Rewrites the base script to match a specific account’s voice, tone and style — defined by a Personality Profile and a Video Style Guide. A full rewrite of sentence structure, word choice, energy and pacing, not token substitution. This is the core enabler for multi-account scaling: same facts, genuinely distinct delivery.

4

Video Production

The most complex stage — a 12-step sub-pipeline in four phases: script prep with AI placing 10–13 visual cut-away markers; per-account TTS narration with word-level transcription for frame-accurate timing; parallel asset generation of presenter footage and b-roll, where intelligent rendering only generates presenter footage for moments the presenter is actually visible; then final assembly compositing every asset with transitions. Coordinates 4+ services and dozens of intermediate files.

5

Visual Overlays

Detects mentions of known entities — cryptos, companies, platforms — and overlays animated branded logos at the exact moment each is named. Alias resolution maps “Ripple” / “XRP” / “$XRP” to one entity, word-level timestamps place the overlay, and AI selects the animation style (fade, swing, bounce, zoom, shake) by context. Makes automated content look hand-crafted.

6

Captions

Animated word-by-word captions precisely synced to narration. Word-level transcription is grouped into 1–2 word display units matching speech rhythm, rendered in bold high-contrast fonts with a dark stroke, lower-third positioning, current-word highlighting and scale-in animation. Broadcast-quality captions in minutes instead of 30–60 minutes by hand.

7

Publishing

Distributes across TikTok, YouTube Shorts, Instagram Reels and Facebook Reels through a unified integration, with one dashboard tracking followers, views and engagement across all 30 accounts. Tuned for 60 videos/day with staggered publish windows and rate-limit balancing.

Why it scales

Multi-account by configuration, not by code

Each account is a configuration, not a code fork — an account config (name, platform, voice, avatar) plus a personality profile plus a video style guide. Base research and the source script happen once per topic; personalization, production and enhancement then run per account, with isolated outputs and full run history.

Adding a new account requires zero code changes — just configuration.
A 205+ document knowledge base with category-aware retrieval grounds every script in real Aptos research.
Fully resumable execution — if any stage fails, the run picks up exactly where it left off without re-running earlier stages.
Intelligent rendering optimization only generates AI presenter footage for visible segments — custom cost-engineering on the single most expensive operation.
Frame-accurate synchronization: word-level audio transcription drives timing for cut-aways, logo overlays and captions alike.
Tech & architecture

What it's made of.

A Python orchestration layer coordinating 7+ external services — parallelized throughout and fully resumable from any stage.

Python orchestrationAI / LLMsRAG retrievalText-to-speechAI avatar videoWord-level transcriptionMotion graphicsVideo compositingMulti-platform distributionParallel + resumable jobs
Your move

Want something like this?

We design, build and run custom software end-to-end. Tell us the problem — we'll build the system that solves it.