Jovan Chan - DEV Community

Jovan Chan

Jun 29

Ollama 'llama runner process has terminated'? Read the Exit Code, Then Fix It (2026)

#ollama #troubleshooting #localllm #cuda

6 min read

Jovan Chan

Jun 29

NVIDIA Skipping New Consumer GPUs in 2026: What the GDDR7 Shortage Means for Your Home Lab Budget

#gpu #nvidia #rtx3090 #rtx4090

6 min read

Jovan Chan

Jun 29

NVIDIA Nemotron 3 Ultra for Local AI in 2026: 550B/55B-Active MoE, 1M Context, NVFP4 — Which Consumer GPU Can Actually Run It

#nemotron #nvidia #localllm #moe

6 min read

Jovan Chan

Jun 28

NPU vs Discrete GPU for Local LLMs in 2026: Why Computex Laptops Lose on Tokens/Second Despite the TOPS Claims

#npu #gpu #localllm #rtx3090

6 min read

Jovan Chan

Jun 28

MOSS-TTS in ComfyUI 2026: Zero-Shot Voice Cloning From a 10-Second Clip on Your RTX or Mac

#comfyui #mosstts #voicecloning #tts

6 min read

Jovan Chan

Jun 28

MiniMax M3 Local AI Hardware Guide 2026: The 428B Open-Weight Model You (Probably) Can't Run at Home

#minimaxm3 #localllm #vram #moe

6 min read

Jovan Chan

Jun 27

LM Studio Locally + LM Link 2026: Control Your Home GPU Rig From Your iPhone

#lmstudio #lmlink #locally #iphone

6 min read

Jovan Chan

Jun 27

Kimi K2.7 Code for Local AI in 2026: VRAM Requirements, the 1T-Parameter Reality, and Which GPU Crosses Into Usable Speed

#kimik2 #localllm #moe #hardwareguide

6 min read

Jovan Chan

Jun 27

GLM 5.2 for Local AI in 2026: 744B MoE, MIT License, and Why It's Effectively Cloud-Only at Home

#glm #localllm #moe #vram

6 min read

Jovan Chan

Jun 27

MOSS-TTS 1.5 Review 2026: Apache Voice Cloning on 8GB

#tts #voicecloning #selfhosted #ai

6 min read

Jovan Chan

Jun 27

MiniMax M3 Review 2026: Open-Weight 1M-Context Frontier

#minimax #llm #moe #localllm

5 min read

Jovan Chan

Jun 27

GPTQ vs AWQ vs GGUF for vLLM 2026: Which 4-Bit Wins

#vllm #quantization #gptq #awq

5 min read

Jovan Chan

Jun 27

Agentjacking 2026: How a Fake Sentry Error Hijacks Cursor, Claude Code, and Cline — and the Settings That Cut Your Exposure

#security #cursor #claudecode #cline

5 min read

Jovan Chan

Jun 27

Goose AI Agent Review 2026: Apache 2.0, Any LLM, and the Best Free Local Coding Agent?

#goose #cline #aider #claudecode

5 min read

Jovan Chan

Jun 26

Gemma 4 QAT for Local AI in 2026: How Google's June 5 Checkpoints Put the 26B in 15GB

#gemma #google #qat #quantization

6 min read

Jovan Chan

Jun 26

EXO Framework in 2026: Can You Pool RTX 3090s to Beat a DGX Spark? The Honest Distributed-Inference Reality

#distributedinference #localllm #gpu #rtx3090

6 min read

Jovan Chan

Jun 26

DiffusionGemma 26B for Local AI in 2026: 18GB VRAM, 4 Faster Generation, and Which Consumer GPUs Actually Saturate the 1,000 tok/s Ceiling

#google #diffusiongemma #localllm #gpu

6 min read

Jovan Chan

Jun 26

Google Colab CLI: Run AI Agents on Cloud GPUs 2026

#googlecolab #gpu #aider #openinterpreter

6 min read

Jovan Chan

Jun 26

DeepSeek V4 Pro Review 2026: MIT 1.6T MoE for Self-Hosters

#deepseek #llm #moe #localllm

5 min read

Jovan Chan

Jun 26

Bonsai Image 4B Review 2026: 1-Bit Local Image Gen

#bonsaiimage #imagegeneration #flux #quantization

6 min read

Jovan Chan

Jun 26

Google Colab CLI review 2026: free and cheap GPUs for Claude Code, Codex, and Cursor agents — from your terminal

#claudecode #codex #localllm #setupguide

6 min read

Jovan Chan

Jun 26

GLM 5.2 as your Cursor and Cline backend in 2026: MIT-licensed open-weight coding model, the config that works, and the honest cost math

#glm #cursor #cline #continuedev

6 min read

Jovan Chan

Jun 26

GitHub Copilot Max $100/Month: Is the New Heavy-Use Tier Worth It vs Cursor Pro and Claude Code?

#githubcopilot #pricing #comparison #cursor

5 min read

Jovan Chan

Jun 25

Qualcomm's $10B Tenstorrent Bid: What RISC-V AI Cards Mean for Home Labs in 2026

#tenstorrent #riscv #qualcomm #aiaccelerator

6 min read

Jovan Chan

Jun 25

GMKtec EVO-X2 Review 2026: A Sub-$2,000 Mini PC That Runs 235B Models on Ryzen AI Max+ 395

#amd #ryzenaimax #strixhalo #minipc

6 min read

Jovan Chan

Jun 25

AMD Ryzen AI Halo vs NVIDIA DGX Spark 2026: Which 128GB AI Dev Kit Actually Pays Off

#amd #nvidia #ryzenaimax #dgxspark

6 min read

Jovan Chan

Jun 25

Qwen3.6-35B-A3B Local Setup 2026: Ollama and 24GB VRAM

#ollama #llm #coding #selfhosted

6 min read

Jovan Chan

Jun 25

Qwen3-Coder-Next Local Setup Guide 2026: Ollama and GGUF

#ollama #llm #coding #selfhosted

5 min read

Jovan Chan

Jun 25

OpenHands Review 2026: The 76K-Star Coding Agent

#openhands #codingagents #ai #opensource

5 min read

Jovan Chan

Jun 25

Gemma 4 QAT + Cline and Continue.dev in 2026: Which Quantized Coding Model Runs in 7GB, 15GB, or 18GB VRAM

#gemma #localllm #cline #continuedev

6 min read

Jovan Chan

Jun 25

Cursor 3.7 Canvas Design Mode: what the June 2026 visual UI update changes for frontend developers

#cursor #frontend #designmode #workflow

6 min read

Jovan Chan

Jun 25

Codestral 2 as your Cursor and Cline backend in 2026: Apache 2.0, $0.30/M tokens, 256K context, and whether it beats Gemini 3.5 Flash for daily coding

#mistral #cursor #cline #continuedev

5 min read

Jovan Chan

Jun 24

CUDA Out of Memory on Local AI? Every Fix That Works for Ollama, llama.cpp, ComfyUI, and vLLM (2026)

#cuda #gpu #localllm #troubleshooting

6 min read

Jovan Chan

Jun 24

Computex 2026 AI Hardware Reality Check: RTX Spark Laptops, NPU Desktops, and Whether the 'Agentic PC Era' Changes Your Home Lab Math

#computex2026 #localai #rtxspark #npu

6 min read

Jovan Chan

Jun 24

ComfyUI 'Torch not compiled with CUDA enabled'? Every Fix That Works on Windows, Linux, and Mac (2026)

#comfyui #cuda #pytorch #gpu

5 min read

Jovan Chan

Jun 24

Open-Source AI Security 2026: The OSSRA Wake-Up Call

#security #selfhosted #ollama #vllm

5 min read

Jovan Chan

Jun 24

Ollama MLX Backend Setup 2026: 2x Faster on Apple Silicon

#ollama #mlx #applesilicon #selfhosted

6 min read

Jovan Chan

Jun 24

Odysseus Review 2026: PewDiePie's Self-Hosted AI Workspace

#odysseus #selfhosted #aiworkspace #ollama

5 min read

Jovan Chan

Jun 24

Claude Fable 5 Is Now Credit-Only: What a Real Coding Session Costs After June 22

#claude #pricing #cursor #cline

6 min read

Jovan Chan

Jun 24

Claude Fable 5 for AI Coding in 2026: Tested as a Cursor and Cline Backend

#claude #review #cursor #cline

6 min read

Jovan Chan

Jun 24

AI Coding Tools on Windows AI PCs in 2026: Cursor, Claude Code, and Copilot on RTX Spark and Copilot+ Devices

#localllm #cursor #claudecode #githubcopilot

6 min read

Jovan Chan

Jun 23

ComfyUI Custom Node \"IMPORT FAILED\"? Read the Traceback, Then Fix It (2026)

#comfyui #troubleshooting #customnodes #python

5 min read

Jovan Chan

Jun 23

ComfyUI Black Image Output? Fix NaN Latents, VAE Precision, and the GTX 16-Series Trap (2026)

#comfyui #stablediffusion #troubleshooting #vae

6 min read

Jovan Chan

Jun 23

Codestral 2 for Local AI in 2026: Apache 2.0, 22B Params, 256K Context — Which GPU Runs It Best

#codestral #mistral #localllm #coding

6 min read

Jovan Chan

Jun 23

vLLM-ATOM Setup Guide 2026: AMD Instinct Native Backend

#vllm #amd #rocm #atom

5 min read

Jovan Chan

Jun 23

Zed Parallel Agents 2026: Open-Source Multi-Agent Editor

#zed #codingagents #ai #opensource

6 min read

Jovan Chan

Jun 23

ZAYA1-8B Review 2026: Apache 2.0 Reasoning MoE on AMD

#zaya1 #llm #reasoning #localllm

5 min read

Jovan Chan

Jun 23

WSL 3 for AI Coding on Windows 2026: GPU Passthrough, Claude Code, Aider, and Cline Without Dual-Booting

#wsl #localllm #ollama #claudecode

6 min read

Jovan Chan

Jun 23

MiMo Code Review 2026: Xiaomi's Open-Source Claude Code Challenger and Whether the 200-Step Benchmark Claims Hold Up

#mimo #claudecode #opencode #review

5 min read

Jovan Chan

Jun 23

Kimi K2.7 Code Review 2026: 1T Open-Weight Coding Model as a Cursor and Cline Backend

#kimi #localllm #cline #cursor

6 min read

Jovan Chan

Jun 16

Intel Arc B580 12GB for Local AI in 2026: Real Benchmarks and the CUDA-Free Reality

#gpu #localai #intelarc #llm

6 min read

Jovan Chan

Jun 15

FLUX.1 Kontext Dev for Local AI in 2026: Image Editing on Consumer GPUs Without the API Bills

#flux #comfyui #imageediting #localai

6 min read

Jovan Chan

Jun 15

WWDC 2026 Preview: Apple Foundation Models and Core AI — What On-Device AI Actually Means for Home Lab Builders

#apple #wwdc2026 #foundationmodels #ondeviceai

5 min read

Jovan Chan

Jun 15

Wan 2.1, 2.2, and 2.7 for Local AI Video Generation: Which GPU Can Actually Run It (2026 Guide)

#localai #gpu #videogeneration #wan

6 min read

Jovan Chan

Jun 14

AMD Ryzen AI Max+ 395 (Strix Halo) for Local LLMs in 2026: 128GB Unified Memory, 100 t/s on 30B Models, and Whether It Beats a Discrete GPU

#amd #ryzenaimax #strixhalo #localllm

6 min read

Jovan Chan

Jun 14

ROCm 7.2 on Ubuntu 24.04 for Local LLMs in 2026: Full Setup Guide for AMD GPUs

#amd #rocm #ubuntu #localllm

6 min read

Jovan Chan

Jun 14

Intel Arc B770 vs RTX 5060 for Local AI in 2026: The 16GB Budget War That Never Happened

#gpu #intelarc #rtx5060 #localai

6 min read

Jovan Chan

Jun 14

ComfyUI API Tutorial 2026: Automate Image Generation

#comfyui #api #python #stablediffusion

5 min read

Jovan Chan

Jun 14

AMD Lemonade Review 2026: GPU, NPU, and Multi-Modal

#amd #llm #selfhosted #npu

5 min read

Jovan Chan

Jun 13

DeepSeek V4 vs Qwen3 for Local AI in 2026: Which Model Family Fits Your GPU?

#localllm #deepseek #qwen3 #gpu

6 min read

Writing Debut