Name: Thox.ai Edge Device
Brand: Thox.ai
Price: 799.00 USD
Availability: PreOrder
Rating: 4.9 (50 reviews)

Question 1

What AI models can the Thox.ai device run?

Accepted Answer

The Thox.ai device runs any Ollama-compatible AI model locally. It comes pre-installed with versatile models for coding, writing, analysis, research, and general assistance. It features a hybrid inference engine using Ollama for 7B models and TensorRT-LLM for 14B+ models. Models up to 32B parameters run efficiently with INT4/INT8 quantization.

Question 2

Who is Thox.ai designed for?

Accepted Answer

Thox.ai is designed for anyone who wants powerful AI on their desk: developers, researchers, healthcare professionals, legal teams, content creators, business analysts, educators, and enterprises. Any professional or organization that needs private, fast, local AI inference without cloud dependencies.

Question 3

How fast is the AI inference?

Accepted Answer

With our hybrid TensorRT-LLM architecture: 7B models run at 45-72 tokens/s via Ollama, 14B models achieve 45-56 tokens/s via TensorRT (60% faster than Ollama), and 32B models run at 20-24 tokens/s (100% faster than Ollama). The smart router automatically selects the optimal backend.

Question 4

What is TensorRT-LLM and how does it help?

Accepted Answer

TensorRT-LLM is NVIDIA's high-performance inference engine that delivers 60-100% faster performance on large models (14B+). Thox OS automatically routes requests to the optimal backend: Ollama for quick 7B responses, TensorRT-LLM for maximum performance on larger models.

Question 5

Do I need an internet connection?

Accepted Answer

No! The Thox.ai device runs completely offline. All AI inference happens locally on the device, ensuring your data never leaves your premises. This is critical for healthcare, legal, financial, and other privacy-sensitive industries. Internet is only needed for initial setup and optional software updates.

Question 6

What can I use Thox.ai for?

Accepted Answer

Thox.ai powers any AI workflow: coding assistance, document analysis, medical research, legal document review, content creation, data analysis, customer support automation, translation, education, scientific research, financial analysis, and much more. If it involves AI, Thox.ai can run it locally.

Question 7

How does the pre-order work?

Accepted Answer

Pre-orders require a $99 refundable deposit to secure your place in the queue. The remaining balance ($700) is charged when your device ships. We expect to begin shipping in Q3 2026. You can cancel your pre-order anytime for a full refund.

Question 8

What warranty and support is included?

Accepted Answer

Every Thox.ai device comes with a 2-year limited hardware warranty and lifetime software updates. You also get access to our user community and comprehensive documentation. Enterprise support plans with SLAs are available for businesses and organizations.

Question 9

Can I integrate it with my existing tools and workflows?

Accepted Answer

Absolutely! The device exposes an OpenAI-compatible REST API, supports WebSocket streaming, and works with any application that can connect to an API. We provide integrations for VS Code, CLI tools, and the Model Context Protocol (MCP) for seamless workflow integration.

Question 10

How quiet is the device?

Accepted Answer

The Thox.ai device uses a premium 40mm blower fan with advanced thermal management. Under typical load, it operates at less than 25 dBA - quieter than a whisper. Perfect for offices, clinics, home offices, and professional environments.

Question 11

What is MagStack™ technology?

Accepted Answer

MagStack™ is our patent-pending magnetic stacking technology that allows you to combine multiple Thox.ai devices into a unified compute cluster. When you stack devices, NFC antennas detect proximity (within 30mm), 8 N52 neodymium magnets self-align with sub-millimeter precision, and a 12-pin pogo connector establishes a 10 Gbps data link plus power passthrough. The cluster forms automatically in about 10 seconds with zero configuration needed.

Question 12

How many devices can I stack together?

Accepted Answer

You can stack up to 8 Thox.ai devices together using MagStack™ technology. A 2-device stack gives you 32GB RAM and 200 TOPS for running 70B models at 5-8 tokens/sec. A 4-device stack provides 64GB RAM and 400 TOPS for 100B+ models. The maximum 8-device configuration offers 128GB RAM and 800 TOPS for running frontier models up to 200B+ parameters.

Question 13

Is Thox.ai HIPAA and GDPR compliant?

Accepted Answer

Yes! Because all processing happens locally on your device, your data never leaves your premises. This makes Thox.ai ideal for HIPAA-compliant healthcare applications, GDPR-compliant European operations, and any industry with strict data sovereignty requirements. No data is ever sent to external servers.

Question 14

Do I need special cables to connect stacked devices?

Accepted Answer

No cables required for stacking! MagStack™ uses a built-in 12-pin pogo connector that automatically engages when devices are stacked, providing a 10 Gbps USB 3.2 data link for layer activations and up to 100W power passthrough. For network discovery, devices use Wi-Fi 6E or optional 2.5GbE Ethernet. The NFC antennas initiate handshake, magnets handle precise alignment, and pogo pins handle high-speed data.

Use Case	Recommended Model	Why
Large document analysis	thox-cluster-nano	1M context for full documents and datasets
Research & complex reasoning	thox-cluster-70b	70B params for advanced analysis
Healthcare, legal, enterprise	thox-cluster-100b	Expert-level professional workloads
Frontier-class AI tasks	thox-cluster-200b	Matches cloud AI capabilities locally

Thox.ai Edge Device

Available Colors

Technical Specifications

Dimensions

Compute

AI Performance

Connectivity

Power

MagStack™ Clustering

MagStack™ Clustering

How MagStack™ Works

Approach

Align & Connect

Form Cluster

Run Models

2-Device Bundle

4-Device Bundle

Cluster AI Models

thox-cluster-nano

Cluster Nano

Cluster Code

Cluster Swift

Cluster Deep

Cluster Secure

Cluster Scout

Cluster Maverick

Cluster 70B

Cluster 100B

Cluster 200B

Which Model Should I Use?

Latest Compatible Models

Ministral-3 8B

Llama 4 Scout

Qwen 3 14B

Phi-4 Mini (3.8B)

Qwen 2.5 Coder 14B

Gemma 3 8B

What's in the Box

Powered by Thox OS™

TensorRT-LLM Acceleration

Hybrid Smart Routing

Native Jetson Execution

Hybrid AI Runtime

Ready for Any Workflow

Frequently Asked Questions

Ready to Order?