vFlank-webapp — UI/UX & backend design¶

Status: proposed (2026-06-15). The detailed design for the separate vFlank-webapp repo, now that the library prerequisites shipped in 0.5.0 (vflank.pipeline.run_small/run_fusion, buffer input). Scope is v1 per web-app-and-hosting.md: single-variant / tiny-batch, modes A/B (reference + gnomAD, no BAM, no PHI), UCSC + gnomAD APIs, FastAPI on Render. BAM/WASM is v2.

What the library now gives the webapp¶

The webapp is thin because 0.5.0 did the work: one call does everything.

from vflank.pipeline import run_small      # or run_fusion
result = run_small(
    uploaded_buffer, genome_build="hg19",
    ref_source="api", pop_source="api",     # no local files
    flank=200, emit_primer3=want_primer3,
)
# result: records, rows, skip_messages, skip_breakdown, n_processed/n_skipped,
#         ref_api_requests, api_requests

The webapp adds only: HTTP, a form, a record cap, error→status mapping, and rendering. All input validation is inherited (bad build/options → VflankError; bad rows → per-row skips).

Backend¶

Stack¶

FastAPI + uvicorn; Jinja2 for the one server-rendered page; Pydantic for the request/response models; no database. pip install "vflank>=0.5". Stateless → matches Render's ephemeral filesystem.

Endpoints¶

Method · path	Purpose
`GET /`	the single-page app (grid + upload, server-rendered shell)
`POST /api/validate`	parse-only (no sequence fetches): per-row skip reasons for live grid validation
`POST /api/run`	run the pipeline → JSON `{records, rows, skips, stats}`
`GET /api/run.fasta` (or `?format=fasta`)	same, returned as a FASTA download
`GET /healthz`	liveness (for keep-warm pings)

Request / response (Pydantic)¶

class RunRequest(BaseModel):
    mode: Literal["small", "fusion"]
    genome_build: Literal["hg19", "hg38"]
    text: str                       # pasted table OR assembled from the structured form
    flank: int = Field(200, ge=10, le=400)
    af_threshold: float = Field(0.001, ge=0, le=1)
    pop_data: Literal["genome", "exome", "both"] = "genome"
    emit_primer3: bool = False
    dedup: bool = True

class RunResponse(BaseModel):
    records: list[str]              # FASTA records (raw + masked)
    rows: list[dict]                # per-variant detail
    skips: dict[str, int]           # categorised skip breakdown
    skip_examples: list[str]
    stats: dict                     # processed/skipped, api requests, elapsed
    primer3: str | None             # Boulder-IO text if requested

The handler writes text to an in-memory io.StringIO and calls run_small/run_fusion(buf, ref_source="api", pop_source="api", …).

Policy the service owns (not the library)¶

≤ 10-record cap after parse → 422 with "the hosted tool accepts ≤10 records; use the CLI/PyPI package for bulk." Protects the shared instance and the gnomAD rate limit. Deliberately not a cap in run_small (it would regress local batch users).
Max upload / body size (e.g. 256 KB) at the ASGI layer.
Error mapping: VflankError/MafError/SvError → 400/422 with the message; per-row skips are data, returned in the body, never 500s.

Concurrency, rate-limit, cache — the one real subtlety¶

run_small is synchronous (urllib + pandas), so run it in a threadpool (await run_in_threadpool(run_small, …)) to keep the event loop free.

The catch: run_small builds a fresh ReferenceApiSource per call, so its ~1 req/s UCSC throttle is per-request, not global. Under concurrency, N requests can each hit UCSC at 1/s → N req/s total, which UCSC may block (it has blocked apps before). Options, cheapest first:

v1, low traffic: accept it; add a small per-process response cache (LRU keyed on (mode, build, sha256(text), params)) so repeats are free, and a per-IP request limiter (slowapi). Good enough for a demo/internal tool.
Shared source / global limiter: have the app hold one long-lived ReferenceApiSource (one throttle + one window cache for everyone) and drive the pipeline via the streaming primitives iter_small/collect with that injected source — instead of run_small building its own. This is the clean answer and motivates a tiny future library change: let run_small/ run_fusion accept optional pre-built reference=/gnomad= sources (inject over build). Recommended once traffic is non-trivial.

Recommendation: ship v1 with option 1 + a response cache; add the inject-sources enhancement to vflank when option 2 is needed (it's a small, non-breaking addition with a real caller — not speculative then).

Deployment (Render free tier)¶

render.yaml: a web service, pip install -e ., uvicorn app:app --host 0.0.0.0 --port $PORT. Plan for the free-tier traits already noted in the hosting note: cold-start after idle (acceptable; optional keep-warm cron hitting /healthz), single instance (the natural home for the cache + limiter), latency dominated by the external APIs.

Security¶

No auth (public, no PHI). HTTPS via Render. Same-origin server-rendered page → no CORS needed. Strict input size + record caps. No secrets (the APIs are unauthenticated). Add slowapi per-IP limiting to be polite to UCSC/gnomAD.

Frontend — UI/UX¶

One page, three moments: describe → run → read the result¶

sequenceDiagram
    participant U as User
    participant B as Browser (form + htmx)
    participant API as FastAPI /api/run
    participant V as vflank.run_small
    U->>B: pick mode/build, enter variant(s)
    B->>API: POST form (htmx, no full reload)
    API->>V: run_small(buffer, ref_source=api, pop_source=api)
    V-->>API: RunResult
    API-->>B: swap in the results panel
    B-->>U: masked flanks, downloads, skips

Use mouse to pan and zoom

Layout (top to bottom)¶

Header — name + one-line "Mask variant flanks for ddPCR assay design — no install, no upload of patient data." Slate+amber "Highlighter" palette, reused from the docs for brand continuity.
Mode — a segmented toggle: Small variant ↔ Fusion.
Build — GRCh37/hg19 ↔ GRCh38/hg38.
Input — an editable grid + file upload (one unified component). An Tabulator grid (MIT, one CDN <script>, no build step) is the primary input — one row = a single variant, a handful of rows = a small batch, so it subsumes the old "structured form / + add row".
Three ways in, all landing in the same grid: type directly; paste a block from Excel (Tabulator's clipboard module); or upload a MAF/TSV file, which populates the grid so the user can review and edit it before running. A "Load example" button seeds a known variant (BRAF V600E).
Columns switch by mode: small → Chromosome · Start · End · Ref · Alt · Gene; fusion → chr1 · pos1 · str1 · chr2 · pos2 · str2 · name.
Validate before submit (no duplicated logic): Tabulator does cheap client-side hints (numeric position, ACGT/- allele, non-empty chrom) for instant per-cell feedback; a debounced POST /api/validate runs vflank's own load_maf + parse_variant_row (parse-only — no UCSC/ gnomAD fetches) and returns authoritative per-row reasons, which the grid paints onto the offending cells. So validation is vflank-correct, feels instant, and is not reimplemented in JS. On Run, table.validate() gates the POST; the server's skips remain the final word.
The browser assembles the MAF/breakpoint TSV from the grid and posts it — no make_variant library change needed.
Advanced (collapsed): flank size (slider 10–400, default 200), AF threshold, pop-data (genome/exome/both), "also emit Primer3", dedup.
Run button → loading state ("Querying reference + gnomAD…", a spinner; latency is the external APIs).
Results panel (swapped in via htmx):
Per variant: the masked flank rendered monospace, the variant shown as [REF/ALT] and masked Ns highlighted in amber — the same "Reading a record" treatment as the docs. A Raw ↔ Masked toggle (the --records idea).
Downloads: FASTA, Primer3 (if emitted), TSV report.
Skips surfaced inline as a friendly list ("2 rows skipped — non-numeric position"), never as an error.
A quiet stats line: processed / skipped · reference+gnomAD API calls · time.

Principles¶

Progressive enhancement. The GET / form works without JS (full POST → rendered results). htmx enhances it: POST the form, swap only the results panel — no SPA build step, tiny payload, matches the "no JS toolchain for v1" decision.
No patient-data framing. Copy makes clear only public reference + gnomAD data flow through the tool (v1 has no BAM); reinforces trust.
Accessibility. Keyboard-navigable, ARIA labels, never color-only (masked regions get a mark and an underline); respects reduced-motion.
Responsive. The form is small; works on a phone.

Complete repo layout (vFlank-webapp)¶

Built to the same quality bar as vflank — git-flow, a ruff/mypy/pytest gate, CI, docs, containerised — right-sized for an app (no PyPI publish; "release" = a Render deploy).

vFlank-webapp/
├── app/
│   ├── __init__.py        __version__
│   ├── main.py            FastAPI app + routes (/, /api/validate, /api/run, /healthz)
│   ├── models.py          RunRequest / RunResponse / ValidateResponse (Pydantic)
│   ├── service.py         thin layer over vflank.run_small/run_fusion: cap, cache, error map
│   ├── validate.py        parse-only validation (vflank load_maf + parse_*_row, no fetches)
│   ├── grid.py            grid-rows <-> MAF/TSV text assembly (+ column schema per mode)
│   ├── settings.py        env config (max records, body size, cache size, rate limit)
│   ├── templates/         index.html (Jinja2 shell) + results/_partial.html (htmx swap)
│   └── static/            style.css (slate+amber), app.js (Tabulator init + htmx glue),
│                          vendored tabulator.min.js/css, htmx.min.js
├── tests/
│   ├── test_api.py        TestClient: happy path, ≤10 cap (422), bad input, skips surfaced
│   ├── test_validate.py   /api/validate row reasons match vflank's parse output
│   ├── test_grid.py       grid<->TSV assembly round-trips (unit, no server)
│   └── conftest.py        a tiny indexed FASTA fixture / mock the API sources
├── docs/                  (MkDocs Material — mirrors vflank, or a strong README only)
│   ├── index.md  ·  deploy.md  ·  api.md  ·  developer.md
├── .github/workflows/
│   ├── ci.yml             ruff + mypy + pytest (push to main/develop + PRs)
│   └── docs.yml           (optional) build/deploy the docs
├── pyproject.toml         deps: fastapi, uvicorn[standard], jinja2, python-multipart,
│                          vflank>=0.5,<0.6, slowapi ; dev: pytest, httpx, ruff, mypy
├── render.yaml            Render web service (IaC)
├── Dockerfile             container (local parity; optional GHCR image)
├── .dockerignore · .gitignore
├── CLAUDE.md              working guide for this repo (gate, git-flow, deploy)
├── CONTRIBUTING.md · CHANGELOG.md · LICENSE (Apache-2.0, matches vflank)
└── README.md              what it is, run locally, deploy, link to the docs

Conventions carried over from vflank: git-flow (main = deployed, develop = integration), the ruff + mypy + pytest gate before any change, typed code, __version__ + a CHANGELOG, the slate+amber palette. Dropped (not applicable to an app): PyPI/OIDC publish and versioned mike docs — deploys are continuous via Render on push to main; a Docker image to GHCR is optional for portability. vflank is pinned to a compatible range (>=0.5,<0.6) and bumped deliberately.

Phasing¶

v1 — the above: structured/paste input, modes A/B, API sources, downloads.
v1.1 — if a richer single-variant form is wanted server-side, add vflank.core.make_variant/make_fusion (the deferred builders) so structured input can skip the client-side table assembly; add the inject-sources option to run_small for a shared throttle/cache under load.
v2 — BAM via client-side WASM (biowasm/Aioli) + a JS port of the pure overlay; PHI stays in the browser. Likely a static/hybrid front-end then.

Open questions¶

Keep-warm vs. accept cold starts on the free tier (start by accepting).
Per-IP limit thresholds (be conservative toward UCSC/gnomAD).
Whether to show a small sequence-logo / position ruler in the result (nice, not v1).