Question 1

What's included in a node?

Accepted Answer

Everything your team needs to run AI in production — hardware-native model builds, the intelligent runtime that handles routing, caching, and scaling, plus monitoring, access controls, and integrations. One node, one price, nothing extra.

Question 2

Why open-weight models instead of proprietary ones?

Accepted Answer

Open-weight models trail proprietary ones by roughly three months on average — and for some use cases they even exceed them. For the vast majority of enterprises, that gap is irrelevant. What you gain is full auditability, no API dependency, no vendor roadmap risk, and the ability to run on your own infrastructure with no data leaving your perimeter. Proprietary models require internet connectivity and route your data through third-party servers. Open-weight models don't. That's why every model on DiscreteStack is open-weight.

Question 3

What does "built in Europe" mean for my deployment?

Accepted Answer

DiscreteStack is incorporated and operated in the EU. That means no exposure to the US CLOUD Act — jurisdiction follows the company, not the data centre. Your deployment runs on infrastructure you control, in a jurisdiction you choose. There is no US-headquartered parent company that can be compelled to hand over data. For enterprises subject to GDPR or the EU AI Act, this is a structural guarantee, not a contractual promise.

Question 4

Can DiscreteStack run air-gapped with no internet connection?

Accepted Answer

Yes. DiscreteStack runs fully air-gapped — no internet connectivity required during operation. Models, inference runtime, scheduling, and the enterprise layer all run locally on your hardware. Updates are delivered through secure offline packages. No telemetry, no external API calls, no cloud dependencies. This is the deployment model used by organisations in defence, financial services, and critical infrastructure where data cannot leave the physical perimeter under any circumstances.

Question 5

What hardware do I need?

Accepted Answer

We support the full range of enterprise NVIDIA GPU architectures: Ampere (A100), Hopper (H100/200), Blackwell (B100, B200, B300, RTX 5000/6000 Pro). We'll design the right configuration based on your team size, model requirements, and workload profile.

Question 6

Can you provide the hardware as well?

Accepted Answer

Yes. We can deliver a fully configured node — hardware and software — ready to deploy in your server room or data center. We also offer a hardware lease option, so you can get started without upfront capital expenditure.

Question 7

Can I start with one node and scale later?

Accepted Answer

Yes. You can expand capacity horizontally (adding more machines) or vertically (replacing with more capable ones). Our GPUe model covers both.

Question 8

How does flat-rate licensing work compared to per-token pricing?

Accepted Answer

With DiscreteStack, you pay a flat-rate license per execution node per year. No token metering, no usage-based billing, no surprise invoices. Your costs are fixed regardless of how much your teams use AI. Per-token pricing works differently — every prompt, every response, every agent action is a billing event. As adoption grows, so does the bill. With a flat-rate license, growing adoption means lower cost per task, not a higher invoice.

Question 9

How does this compare to what we're spending on OpenAI/Anthropic today?

Accepted Answer

For a team of 50 power users, hyperscaler API costs typically run €150K–€250K per year depending on usage. One DiscreteStack node covers that same team for €50K per year, plus approximately €30K for a hardware lease. Run your own numbers in our cost comparison calculator.

Question 10

What models do you run?

Accepted Answer

We run the most powerful open models available — hundreds of billions to trillions of parameters. Models like Kimi, GLM, and Mistral among others in their most capable forms. Each one is compiled specifically for designated hardware, so you get maximum performance without managing model ops yourself.

Question 11

Who handles updates and maintenance?

Accepted Answer

We do. New models and updates on existing ones are evaluated and delivered quarterly. Runtime patches and security fixes are included in the license. Your team focuses on using AI, not operating it.

Question 12

How fast can we actually be live?

Accepted Answer

24 hours for full platform access (in shared environment). On-premise with existing hardware, typically within a week. If we're sourcing the hardware, expect 4–8 weeks depending on configuration and availability.

Question 13

Do you support our compliance requirements (SOC 2, GDPR, etc.)?

Accepted Answer

The platform runs entirely on your infrastructure — your data never leaves your environment. That simplifies most compliance requirements by design.

Feature	DiscreteStack	OpenAI / Anthropic	DIY	Hyperscalers (Azure/AWS)
Model Intelligence	Frontier − 3 months	Baseline	Mixed	Baseline
Cross-system Integration		Vendor specific	Partially	Vendor specific
Predictability	Yearly Contract	Vendor Roadmap	Self-managed	Vendor Roadmap
Operational Complexity	Low	Mid	High	Mid
US CLOUD Act exposure	None (EU-incorporated)	Subject	None	Subject
EU AI Act readiness	Full (August 2026 ready)	Partial	Self-managed	Partial
Cost model	Flat-rate license	Per-token	CapEx + engineering team	Per-token

Private AI infrastructure for your business

What European AI means in practice

Transparency

Control

Predictability

AI for every team — ready on day 0

AI for Software Engineering

AI for Finance & Operations

AI for Executive Teams

Single-server AI

Your GPUs don't stop when your developers go home.

From Model to Metal

Hardware-Native Builds

Intelligent Execution Runtime

Air-gapped. Auditable. Yours.

Data Sovereignty

Governance & Audit

EU AI Act Ready

DiscreteStack vs Cloud AI — Why Infrastructure Ownership Wins

Simple. Predictable. No surprises.

Insights & updates

How open models intelligence fuel sovereign AI in 2026

The True Cost of Enterprise AI: Why Token Metering is Killing Your ROI