Data Protection lawyers with 50+ years of experience

/insights

Updated Thursday, February 27, 2025

Private On-Premise AI Cloud: A Step-by-Step Guide

Organizations face the challenge of using AI without losing control over their data and data sovereignty. In this article, you will learn how you as a company or even as an individual can set up and operate your own AI cloud.

Steffen Groß

Partner (Attorney-at-law)

What is a private AI cloud?

Why a private AI cloud?

Our journey to our own AI cloud

Technical setup

Conclusion and outlook

Interested in a customized AI solution?

Get assistance from our lawyers

Data Protection compliance can be complicated. Let our experienced team simplify it for you.

Free initial consultation

Data sovereignty is more than just a buzzword. If data is the fuel of the 21st century, then control over cloud infrastructure is the engine that drives it. Relying on third-party providers increases the risk of losing control over your own data—whether due to regulatory uncertainties, technical changes imposed by the provider, or potential outages. For businesses, this can have not only legal and security implications but also direct commercial risks, ranging from rising costs to the loss of strategic independence.

What is a private AI cloud?

According to our understanding, a private AI cloud is a cloud infrastructure where only you as the key holder have access to the data. This means that there is no third party standing between you and your data that can prevent or control access. Neither authorities nor third parties can access your data unless they can crack 256 AES encryption. At the same time, you have access to your data and AI services from anywhere via the internet through a publicly accessible website.

Unlike public cloud services such as ChatGPT, Google Cloud, or Microsoft Azure, where your data is stored on third-party servers and providers potentially have access, with a private AI cloud, you retain complete control over your data and the AI models used.

Why a private AI cloud?

Advantages of a private AI cloud

The main advantages of a private AI cloud are:

Full data sovereignty: You retain complete control over your data.
Data protection and data security: Sensitive information does not leave your infrastructure.
Compliance: Easier compliance with data protection regulations such as the GDPR.
Independence: No dependence on external providers and their price changes.
Customizability: Tailored solutions for your specific requirements.

Use cases

In Simpliant's early days, we invested in the best speech recognition model on the market. The license cost was around €1,400 plus a microphone with high-end recording capabilities for about €350 per workstation. This investment paid off because it significantly accelerated our writing of audit reports, memos, and legal analyses.

When ChatGPT came along, especially the ChatGPT phone app, I was amazed at the superior quality of the speech recognition feature. Using my iPhone's regular microphone, it was far more accurate and faster than the expensive solution I had used before.

However, for our use case, it was not a practical day-to-day work tool, as we needed full control over our data, especially sensitive personal and customer information.

The private AI cloud can be used for various use cases:

Cloud Storage: Similar to Dropbox or OneDrive, but without access by third parties.
Speech recognition and transcription: Converting speech to text for meetings, interviews, or dictations.
Text generation and analysis: Creating reports, summaries, or analyses.
Document processing: Extracting information from documents.
Local AI assistants: Chatbots or assistants tailored to your specific needs.

Our journey to our own AI cloud

In the early days of Simpliant, we invested in the best speech recognition model on the market. The licensing costs were around €1,400, plus a high-end microphone with premium recording capabilities for about €350 per workstation. This investment paid off as it significantly accelerated our process of drafting audit reports, memos, and legal analyses.

When ChatGPT emerged, particularly the ChatGPT app for mobile devices, we were impressed by the superior quality of its speech recognition feature. Even with a standard iPhone microphone, it was far more accurate and faster than the expensive solution we had previously used.

However, for our use case, it was not a practical daily work tool, as we required full control over our data.

Technical setup

In our setup, the data is located on a computer in our office. This is, of course, just one possibility. You could also place your computer with a hosting provider (co-hosting) or operate a virtual computer on a hosting platform.

Hardware requirements

To run AI applications on your computer with sufficient performance, you need:

Powerful graphics card: A GPU with enough VRAM, such as an NVIDIA GeForce RTX 4070 or better.
Sufficient RAM: At least 16 GB, ideally 32 GB or more.
Fast processor: A modern multi-core processor.
Sufficient storage space: SSD storage for fast access to data and models.

Software components

Our solution is based on the following software components:

Operating system: Linux Ubuntu Server
AI models:
- Whisper for speech recognition
- Llama 8B and Qwen 2.5 as local LLMs
Container technology: Docker for easy deployment and scaling.
Backend: Flask app for the API interface.
Authentication: Custom authentication solution for secure access.

Access and encryption

In our setup, data access works via a Virtual Private Server (VPS) with a third-party provider. This provides the so-called access layer and is accessible via a public IP or a public website over the internet.

The connection between the VPS and the local computer is established via a Virtual Private Network (VPN). This means that an encrypted connection is established between the VPS and the local computer or server. We use WireGuard for this.

Security measures

You should harden your VPS with basic security measures:

Installation of a firewall
Installation of Fail2Ban
Changing your SSH port
Deactivation of password access
Changing the root user

Additionally, you can protect your domain by using third-party services such as Cloudflare to protect your domain against DDoS attacks. However, we advise against using the Cloudflare Tunnel, as Cloudflare would then have access to your data.

Your access layer should be protected by a service such as Authentik. You should ensure that access is only possible with two-factor authentication. Additionally, you can block IP addresses that do not belong to your country region.

Network isolation

Ideally, your private AI cloud should run isolated from your other devices in a Virtual LAN. This can be done through a physical device with WLAN functionality, such as a router or a switch.

Implementation steps

Prepare hardware

Ensure that your computer meets the necessary hardware requirements, especially a powerful GPU for AI workloads.

Install operating system

Install a Linux-based operating system such as Ubuntu Server for maximum control and security.

Set up AI models

Install the desired AI models such as Whisper for speech recognition and Llama or Qwen for text generation.

Configure network

Set up a VPS and configure WireGuard for a secure VPN connection between your local server and the VPS.

Implement security measures

Harden your VPS and your local server with the security measures mentioned above.

Develop user interface

Develop a user-friendly interface for accessing your AI services, either as a web app or as a native application.

Implement backup strategy

Set up regular backups to avoid data loss. Ideally, you can create virtual machines using free services such as Proxmox, on which your services run. You can mirror, copy, and create backup routines for these virtual machines.

Advantages in detail

Full data sovereignty: You have complete control over your data and do not need to enter into contracts with third parties, apart from the data processing agreement with your VPS hosting provider.
Data protection and data security: Your sensitive data does not leave your infrastructure, which is particularly important for companies working with confidential information.
Compliance: Easier compliance with data protection regulations such as the GDPR, as you know exactly where your data is stored and who has access to it.
Independence: No dependence on external providers and their price changes or service discontinuations.
Customizability: You can adapt the AI models and services exactly to your needs.

Challenges and limitations

Technical complexity: Setting up and maintaining a private AI cloud requires technical expertise.
Initial costs: The acquisition of the necessary hardware and the setup can be initially costly.
Maintenance effort: Your private AI cloud should ideally have redundancies, i.e., you should ideally have two computers at two different locations with different internet connections and various access points via VPS.
Power consumption: Powerful hardware for AI workloads consumes correspondingly high amounts of electricity.
Limited computing power: Compared to large cloud providers, the available computing power is limited.

Conclusion and outlook

Gathering data and strategically planning AI implementation in your organization will be crucial to gaining a competitive advantage in the years to come. In particular, we believe that it is not only the capacities of the large language models that matter as a foundation, but the data that you generate and intelligently integrate based on your specific business model.

To maintain a high level of data security, intellectual property protection, data protection, and compliance while keeping costs low and not relying on big tech monopolists, this could be the most crucial time for your company to invest in AI.

We use our private AI cloud daily to transcribe texts and use AI in a secure and legally compliant way.

Interested in a customized AI solution?

Together with our technology partners, we offer turnkey on-premises AI cloud solutions tailored precisely to your requirements—whether in your own data center or a public cloud environment.

As a law firm and consultancy, we ensure that all regulatory requirements, particularly GDPR compliance, are fully met. Contact us for a non-binding consultation!

Legal advice

Simpliant Legal - Wittig, Bressner, Groß Rechtsanwälte Partnerschaftsgesellschaft mbB

Consulting

Simpliant GmbH

Technology

Simpliant Technologies GmbH

+49 (0) 30 75438990

info@simpliant.eu

Fasanenstr. 12, 10623 Berlin

Why Simpliant?

Insights

Trainings

Careers

Data protection

We will support you in implementing all data protection requirements with the GDPR.

DPO

DPMS

Data protection consulting

Data protection audits

Information security

We support you in setting up a holistic ISMS such as ISO 27001.

ISO

ISMS

ISO 27001 audits

C5 attestation

Artificial intelligence

We advise you on the integration of AI and develop legally compliant usage concepts.

Business models with AI

Data protection compliant use

ChatGPT and GPT-API

AI system classification tool

Legal notice

Privacy statement