Morgann
Riu

Infrastructure & security engineer.
I build, I break, I document.

Cybersecurity, Linux & DevOps — my best articles, straight to your inbox. No spam.

Professional photo of Morgann Riu, developer and cybersecurity expert, smiling and confident
66+ Tutorials
79+ Articles
13+ Years of experience
5 Training courses
8+ CyberScan scans

About me

Discover my background and my passion for technology

Expert in cybersecurity, Linux administration and full stack development. I design secure infrastructures and share my knowledge through free tutorials and training courses.

13+
Years of experience
10+
Projects delivered
15+
Technologies mastered
Discover my journey

Work

Projects shipped to production, not demos — application security, local AI and automation

CyberScan SaaS

Multi-module web vulnerability scanner built in Clean Architecture PHP. 35 active scanners, AI-assisted analysis via the NVIDIA Build API, exports to HackerOne / Bugcrowd / Intigriti.

  • 35 active security scanners
  • Clean Architecture & SOLID
  • Multi-platform bug bounty reports
PHP 8.3 Clean Architecture SQLite NVIDIA API

Local AI platform

Self-hosted AI infrastructure: Qwen3-Coder 30B LLM inference on CPU at 15.9 tokens/s, Kokoro text-to-speech, Whisper transcription and Phoenix observability on Proxmox.

  • 30B model at 15.9 tokens/s with no GPU
  • Built-in text-to-speech and transcription
  • Observability and power monitoring
ik_llama.cpp Qwen3-Coder Whisper Proxmox

Autonomous development agent

Iterative code-generation agent validated end to end at 74s per iteration, with triple fallback: NVIDIA API, local model, then degraded mode.

  • Autonomous generation loop
  • 74s per iteration, validated
  • Triple fallback for resilience
Python NVIDIA API LiteLLM Bash

My Services

Tailored solutions to automate and digitize your business

AI automation for businesses

Automate your business workflows with artificial intelligence. LLM integration, autonomous agents, document processing, internal chatbots and smart data pipelines.

  • Custom conversational AI agents
  • Automation of repetitive tasks
  • Claude, OpenAI and Mistral API integration
  • Document processing and analysis
  • Smart workflows and orchestration
Python Node.js Claude API LangChain RAG

Micro-SaaS development

Design and development of lightweight, focused SaaS applications. A simple tool that solves one precise problem, with monthly subscription and recurring revenue.

  • Dashboards and monitoring tools
  • API wrappers and third-party integrations
  • Tailored management platforms
  • Stripe payment systems
  • Hosting and deployment included
PHP JavaScript PostgreSQL Docker Stripe

Services for Schools & High Schools

Digital support for schools in Perpignan and the surrounding area. Technical support, network infrastructure, security and IT fleet management.

  • School network audit and hardening
  • IT fleet deployment and management
  • Introduction to coding and programming
  • Custom software development for schools
Cybersécurité Réseau Linux Pédagogie Perpignan

AI guidance

Building an AI PC, renting a GPU, choosing your tools or setting up your own private assistant: independent, no-nonsense guides from a practitioner who runs this stack every day.

Explore the guides

Training catalog

Programs, bundles and formats available in Linux cybersecurity and DevOps.

View training courses

Latest blog posts

Cybersecurity news, Linux tips and field-tested lessons learned

AI

GGUF Quantization: Q4_K_M, Q5_K_M, Q6_K or Q8_0 — How to Choose Without Wrecking Quality

The practical guide to picking your GGUF quant in 2026: bits per weight, perplexity impact, imatrix, and a VRAM/quality table. A Llama 3.1 8B drops from 32 GB in F32 to 4.9 GB in Q4_K_M.

AI

Local RAG with Ollama: an assistant that reads YOUR documents, 100% offline

Build a privacy-first RAG assistant on your own documents: embeddings, vector DB, chunking and a local LLM. With qwen3-embedding hitting 70.58 on multilingual MTEB, local finally rivals commercial APIs.

AI

Local LLM Runtimes in 2026: llama.cpp, Ollama, vLLM, LM Studio, TGI, Which One to Pick?

An honest comparison of local LLM inference engines in 2026: vLLM hits ~793 tok/s under concurrent load versus ~41 for Ollama, yet at a single user the gap drops below 10%. When to use each.

Get in Touch

A question or a project to discuss? Send me a message.

Contact form

For any question or collaboration, send me a message through the contact form.

Contact me

Or via LinkedIn