AI transcription, processed exclusively in Europe.
Klangskript transcribes and translates your audio and video data with state-of-the-art AI — no US cloud, no permanent storage, no compromises on data protection. Built for those who know exactly where their data needs to be.
Klangskript
Sicher. Europäisch. Verständlich.
AI transcription is powerful. But where does it process your data?
The leading transcription and translation services are built on US cloud infrastructure. This means: even when European servers are used, operators like Amazon, Microsoft, Google or OpenAI are subject to the US CLOUD Act. US authorities can demand data handover — regardless of physical storage location.
For doctors and medical staff
Transcribing patient conversations or dictations via US cloud is incompatible with medical confidentiality under § 203 StGB. A violation is a criminal offence — the practical consequence: AI transcription is effectively blocked in most practices.
For lawyers, notaries and tax consultants
Client privilege and attorney confidentiality effectively preclude the use of US-based cloud services for sensitive conversation content. The legal uncertainty is too high, the liability risks too great.
For content creators, students and freelancers
Even without legal confidentiality obligations, the question remains: why should your lecture recordings, interview footage or research materials become training data for US models?
Klangskript is the answer to this dilemma.
Our architecture is uncompromisingly European — and technically verifiable.
Klangskript was built from the ground up as a sovereign alternative. Three principles define our infrastructure.
100% European infrastructure
Not a single byte leaves European soil.
Our orchestrator server runs with a German provider. The AI compute nodes are booted exclusively in German and European data centres — with providers operated entirely within Europe. Neither our infrastructure nor our subcontractors are subject to US law or the CLOUD Act.
Zero Data Retention through ephemeral processing
Your audio data never touches a hard drive.
A dedicated GPU instance is spun up for each session. AI model processing takes place exclusively in the volatile memory (RAM) of this instance. After the session ends, the memory is actively overwritten and the instance is irrevocably destroyed. No copy, no log file, no training dataset exists.
End-to-end encrypted transport
Military-grade encryption — across the entire chain.
Data transfer between your device, our orchestrator and the AI compute nodes uses dynamically generated WireGuard VPN tunnels. Each session uses its own keys. No third party — not even us at transport level — can view the transmission.
What we do — and what we never do.
At Klangskript, privacy isn't a setting. It's the architecture. Two sides of the same decision.
What we do
- Processing exclusively in European data centres
- § 203-suitable — contractually assured in the DPA
- Data processing under Art. 28 GDPR
- End-to-end encrypted between all components
- Zero data retention — volatile memory only
What we never do
- No US cloud, no CLOUD Act access
- No US AI services
- No tracking, no analytics
- No selling or sharing your data
- No AI training on your data
Klangskript serves two worlds with the same infrastructure.
Whether you are a student transcribing a lecture or a doctor conducting a patient consultation with a non-German-speaking family: the underlying security architecture is identical. What differs is the feature set and billing model.
For students, content creators and freelancers
Fast, precise transcription and translation without legal compliance requirements — but with the same level of data protection as our enterprise solution.
- File upload: Upload audio and video files, process in the background, retrieve results
- High transcription quality: faster-whisper large-v3 as the standard model
- Hallucination-free translation: Our base variant uses classic translation models (Opus-MT) instead of generative LLMs — your transcript is translated, not rewritten
- Premium option: Structured summaries, chapter marking and filler word removal using European-licensed open-weight models
- Pre-paid model: Buy credit in advance, billed per audio second, no surprises
- No subscription, no minimum term
Subtitling YouTube videos and podcasts · Transcribing lectures, seminars and interviews · Dictation and voice note transcription
For doctors, law firms and companies with legal confidentiality obligations
Booked sessions with a dedicated GPU instance, highest model quality and contractually guaranteed § 203 StGB compliance.
- Dedicated session booking: Reserve timeslots with 10 minutes lead time — exclusive GPU instance for your session only
- Two-phase processing: Live transcription during the conversation (latency under 2 seconds), high-precision post-processing with full conversation context after the session
- Two modes: Precision (recommended, post-processing with maximum quality) or Simultaneous (live translation for real-time communication)
- Structured outputs: Automatic generation of medical letter drafts, case notes or interview protocols from templates
- Premium AI models: Gemma 4 26B MoE for translation and structuring, faster-whisper large-v3 for transcription, pyannote 3.1 for speaker recognition
- Contractual guarantees: Data processing agreement under GDPR Art. 28, § 203 StGB commitment declarations for the entire subcontractor chain
- SLA-backed subscriptions: For practices and law firms with regular demand
Doctor-patient consultations with non-German-speaking patients · Dictations for medical letters, surgical reports, medical histories · Client meetings, witness statements, court transcripts · Confidential board meetings
The B2B pipeline is currently in pilot operation. We onboard selected practices and law firms personally.
Meetings in a foreign language? No more comprehension gaps.
Anyone who regularly participates in English-language meetings without being a native speaker knows the situation: an unfamiliar technical term comes up, a moment of uncertainty arises — and while you are still thinking about the meaning, the speaker is already three sentences ahead. The gap grows, the context is lost.
Available today: Post-meeting transcription and translation
Internal meetings are already routinely recorded in many companies. Klangskript enables you to upload these recordings afterwards and have them fully processed:
- Transcription of the entire meeting using faster-whisper large-v3 (90+ languages)
- Speaker recognition: Automatic attribution of who spoke when
- Translation into your own language — either hallucination-free (basic) or context-aware (premium)
- Result: A complete, searchable protocol in your own language
Coming soon: Live translation during the meeting
For situations where comprehension gaps cannot wait until after the meeting, we are developing live translation via the browser: microphone access on your device, real-time streaming to Klangskript, translation with under two seconds latency — read along on screen while the meeting is happening.
Availability will be announced separately.How Klangskript works technically.
We believe trust is built through transparency. That is why we describe our architecture openly.
The Orchestrator
- Location: Germany (currently Nuremberg)
- Technology stack: React, FastAPI, PostgreSQL, Redis
- Function: User management, billing, cloud resource orchestration
- Persistent data: exclusively metadata (account balance, session IDs, billing data) — never audio or transcription content
The AI compute nodes
- Location: German and European data centres only
- Hardware: NVIDIA H200 (141 GB VRAM) for B2B, NVIDIA L4 (24 GB VRAM) for B2C
- Lifetime: only for the duration of the respective session
- Storage: exclusively volatile RAM, no persistent disk for user data
- Network isolation: WireGuard tunnel as the only connection to the orchestrator
AI models in use
| Model | Function | Licence | Execution |
|---|---|---|---|
| faster-whisper large-v3 | Transcription | MIT | European GPU |
| pyannote 3.1 | Speaker recognition (diarization) | MIT | European GPU |
| Opus-MT (Helsinki-NLP) | Translation (B2C basic) | CC-BY | European GPU |
| Gemma 4 E4B | Structuring (B2C premium) | Gemma Licence | European GPU |
| Gemma 4 26B MoE | Premium translation & structuring (B2B) | Gemma Licence | European GPU |
All models run as self-hosted instances in our infrastructure. No external API calls to third parties. No telemetry to model providers.
Compliance chain
- Orchestrator provider: German company, standard GDPR contract
- GPU provider: operated entirely within Europe, no US parent
- Data processing: Contractual chain under GDPR Art. 28 across all subcontractors
- Special obligations for B2B: § 203 StGB commitment declarations for all relevant sub-processors
What we do not do
- No use of your data for model training
- No disclosure to third parties — except where legally required
- No advertising or analytics trackers, no cookie banner on this website
- No use of US-based CDNs for sensitive content
Frequently asked questions
How does Klangskript differ from providers like Otter.ai, Descript or Trint?
Is my data used to train AI models?
What happens technically when a session ends?
Can Klangskript be compelled by US authorities to hand over data?
How fast is live transcription?
Which languages are supported?
Can I use Klangskript already?
Who is behind Klangskript?
Can I use Klangskript for transcribing and translating company meetings?
Can I transcribe YouTube videos or podcast episodes via a link?
Interested in Klangskript?
Klangskript is still before market launch. For questions about the project, the architecture or future collaboration, you can reach us by email.
Get in touch