giacolees - Tech Blog

giacolees - Tech Bloghttps://giacolees.github.io/Recent content on giacolees - Tech BlogHugoenThu, 19 Mar 2026 22:58:56 +0100Tokenizers are easy!https://giacolees.github.io/posts/tokenizers/Thu, 19 Mar 2026 22:58:56 +0100https://giacolees.github.io/posts/tokenizers/TL;DR Your LLM has never read a single word. It reads tokens — and the way text gets chopped up matters more than you'd think. Splitting on spaces explodes the vocabulary and chokes on anything outside English. The fix? Byte-Pair Encoding: start from raw bytes, greedily merge the most frequent pairs, repeat. Simple idea, nasty bottleneck — the naive version scans every word on every merge, costing O(V × M).Hardware-Aware Programming for Dummies!https://giacolees.github.io/posts/hardware-aware-programming-for-dummies/Sat, 14 Mar 2026 19:30:19 +0100https://giacolees.github.io/posts/hardware-aware-programming-for-dummies/TL;DR Hardware-aware programming requires matching your computational task to the right processor architecture while aggressively minimizing data movement bottlenecks. While CPUs use large caches and complex logic to minimize latency for sequential tasks, GPUs use massive parallel arrays to maximize throughput for parallel workloads. However, the ultimate performance killer is data movement latency across the PCIe bus between the CPU and GPU; for small workloads, this transfer time completely eclipses the actual compute speed.Abouthttps://giacolees.github.io/about/Mon, 01 Jan 0001 00:00:00 +0000https://giacolees.github.io/about/I’m Giacomo — an AI Research Engineer who somehow convinced some universities that he deserved degrees, then used them to make computers look at things and get confused more than before. I’m what people charitably call a T-shaped engineer. The vertical bar goes deep into computer vision, sensor fusion, and autonomous driving perception — enough to know which of my own opinions are wrong. The horizontal bar stretches across LLMs, agentic AI, and the workflows that try to make all of it actually useful in production.Open Source Projectshttps://giacolees.github.io/projects/Mon, 01 Jan 0001 00:00:00 +0000https://giacolees.github.io/projects/ obsidian-math-convert ↗ Obsidian plugin that converts photos or screenshots of equations to LaTeX locally — no cloud, no subscription, runs fully offline via WebAssembly. JavaScript · Obsidian · WebAssembly