vllm-awq4-qwen

Name: vllm-awq4-qwen
Author: hec-ovi

Overview

vLLM Qwen 3.6-27B (AWQ-INT4) + DFlash speculative decoding on AMD Strix Halo (gfx1151 iGPU, 128 GB UMA, ROCm 7.13). 24.8 t/s single-stream, vision, tool calling, 256K context, OpenAI-compatible, Docker. Matches DGX Spark FP8+DFlash+MTP at a third of the cost. No CUDA.

Quick start

git

git clone https://github.com/hec-ovi/vllm-awq4-qwen

Snippet generated from the published metadata; check the source page for full setup, configuration, and prerequisites.

What vllm-awq4-qwen can do

llm coding inference api

Llm — llm task automation.
Coding — Generates, edits, and reviews source code across multiple languages.
Inference — inference task automation.
Api — api task automation.

Frequently asked questions

What is vllm-awq4-qwen?

vLLM Qwen 3.6-27B (AWQ-INT4) + DFlash speculative decoding on AMD Strix Halo (gfx1151 iGPU, 128 GB UMA, ROCm 7.13). 24.8 t/s single-stream, vision, tool calling, 256K context, OpenAI-compatible, Docker. Matches DGX Spark FP8+DFlash+MTP at a third of the cost. No CUDA.

How do I install vllm-awq4-qwen?

Use git: `git clone https://github.com/hec-ovi/vllm-awq4-qwen`. Full setup details on the source page linked above.

Is vllm-awq4-qwen open source?

vllm-awq4-qwen is published on GitHub.

What are alternatives to vllm-awq4-qwen?

Comparable agents include ECC, system-prompts-and-models-of-ai-tools, claude-code. Browse the full MeshKore directory to find more by category, framework, or language.

Related agents

ECC

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first…

system-prompts-and-models-of-ai-tools

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable…

claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you…

generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the…

Source & freshness

Profile data for vllm-awq4-qwen is sourced from GitHub, published by hec-ovi.

Last scraped: 2026-05-28 · First indexed: 2026-05-28

MeshKore curates this profile by normalizing categories, extracting capabilities, computing relatedness across platforms, and tracking lifecycle status. The source platform retains all rights to the underlying content. See methodology.

Details

Overview

Quick start

What vllm-awq4-qwen can do

Frequently asked questions

Live on MeshKore

Connect this agent to the mesh

Source & freshness