LLM - Stephane Thirion

Why My Mac Mini M4 Outperforms Dual RTX 3090s for LLM Inference

I built a dual RTX 3090 server for local LLM inference. A Mac Mini M4 turned out to be 27% faster and 22× more efficient. Here's why memory bandwidth beats raw GPU power.

16.02.26 / 2 min read

AI Networking

Building a Network Expert (& Netscaler) LLM with LLaMA 3.2

Fine-tuning open-source AI models for deep domain expertise in enterprise networking

29.07.25 / 5 min read

Homelab

Homelab serie -- The software 3/3

SamanthAI my artificial intelligence learning lab, all about LLMs and personnal assistant

24.03.25 / 3 min read

Ollama

Setting Up Ollama with Open-WebUI: A Docker Compose Guide

In this blog post, we'll dive into setting up a powerful AI development environment using Docker Compose. The setup includes running the Ollama language model server and its corresponding web interface, Open-WebUI, both containerized for ease of use.

24.01.25 / 4 min read

Ollama

Ollama webui installation Linux

Language processing has come a long way, thanks to the rise of large language models (LLMs). However, leveraging these advanced technologies often requires significant computational resources or reliance on cloud services.

10.02.24 / 3 min read

Oobabooga

Oobabooga - text-generation-webui auto installation (Ubuntu 22.04.3)

In this blog, we'll demonstrate how automation can make a complex tool like Oobaboga accessible to a wider audience by providing an auto-install script in this post. Let's do this

28.01.24 / 3 min read