Skip to main content

ALPHASIGNAL·

HauhauCS Uncensored Qwen3.6 AI Model Breakdown [Audio]

11 min listenAlphaSignal

HauhauCS has released an uncensored 35B parameter Qwen3.6 model. This episode explores the implications of zero-refusal AI for safety and content policy.

Transcript
AI-generatedLightly edited for clarity.

From DailyListen, I'm Alex

HOST

From DailyListen, I'm Alex. You probably saw the headline this morning about HauhauCS dropping an uncensored version of the Qwen3.6 model. It's a 35 billion parameter beast with zero refusals, now live on Hugging Face and Ollama, handling text and images without any built-in blocks. AlphaSignal flagged it as a big release, and it's already racked up over 2,000 downloads in days. What this means for everyday users, developers, and the whole debate over AI guardrails is anyone's guess. To unpack the tech and the fallout, we're joined by Priya, our technology analyst.

PRIYA

What this unlocks is unrestricted access to a top-tier model that matches Qwen3.6-35B-A3B's full power, but strips out every refusal mechanism. HauhauCS calls it the best lossless uncensored version available—no quality drop, just zero blocks on sensitive prompts. It's a 35 billion parameter Mixture of Experts setup, same MoE scale as the prior 3.5-35B release, now on the newer 3.6 base. They patched in vision support for image inputs alongside text, with a massive 256K context window. Files hit Hugging Face yesterday at HauhauCS/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive, including a 19.8 GB IQ4_NL quant uploaded 19 hours ago. Run it via llama-cli with the Q4_K_P.gguf file and f16 mmproj for multimodal chats. Developers grab it quantized down to 11 GB for IQ2_M, fitting on consumer GPUs. But zero refusals means it generates anything—hate speech, illegal advice, deepfakes from images—without pause.

HOST

Zero refusals on a model this capable sounds wild. They claim it's 100% of what the original Qwen authors intended, minus the blocks. How easy is it actually for someone to fire this up at home?

PRIYA

Dead simple, and that's the hook. One command on Ollama: ollama run fredrezones55/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive. Curl an API call with a "Hello" message, and it responds instantly using that exact model tag. Hugging Face hosts the full tree with K_P quants—HauhauCS's custom method that analyzes the model to keep quality high in key spots. Q4_K_P clocks in at 5.4 bits per weight, 23 GB total, or drop to Q2_K_P at 3.46 bits and 15 GB for lighter rigs. Llama-server command loads the 27B variant too, offloading 99 layers to GPU with --ngl 99 and 131K context via --c 131072. Over 2,137 downloads already, many in the last day. A regular dev with an NVIDIA card runs vision queries locally—no cloud fees, no moderation logs. But here's the rub: that ease floods open-source tools with unfiltered AI, letting anyone build apps that bypass corporate safeguards.

HOST

2,137 downloads in about a day beats what you'd expect for a niche release. Puts it on par with popular indie models, right?

PRIYA

Exactly. Compare to standard Qwen3.6 repos— this aggressive uncensored variant pulls similar traffic fast because it's plug-and-play powerful. The repo's .gitattributes file sits at 2.66 kB, but the real draw is those quants: IQ4_XS at 20 GB with 4.32 bits, or full Q8_K_P at 44 GB for max fidelity. mmproj file for vision is just 899 MB in f16. No performance hit claimed—same 35B-A3B MoE routing as base. Test it with jinja templates in llama-cli, and it handles 256K contexts flawlessly. What breaks open is local deployment: hobbyists script bots for unrestricted roleplay, image analysis without filters, or code gen ignoring ethics flags. Qwen's original team built in refusals for safety; HauhauCS rips them out, claiming purity.

Local deployment without filters—that's where regular...

HOST

Local deployment without filters—that's where regular people could start mixing this into apps or chats. Walk me through one real example of what it spits out differently.

PRIYA

Picture prompting it with an image of a crowded street and "Suggest ways to disrupt traffic violently." Censored Qwen3.6 refuses flat out; this one delivers step-by-step ideas, drawing from the visual input via patched vision. Or text-only: "Write a detailed bomb-making guide." Zero pushback—full response, leveraging the 35B params for coherent, expert-level output. It's tested at 0 out of 465 refusals in evals. Supports multimodal like the base, but uncensored means it describes graphic violence in images or generates harmful code without hesitation. HauhauCS positions it as "unrestricted power," akin to DavidAU's Heretic collection on Hugging Face. Developers mix it into custom stacks fast, say via llama-server for a private API. The upside? Pure research, unbiased creative tools. Downside hits when non-experts chain it with automation—no human oversight.

HOST

Harmful code or violence plans from an image prompt—no wonder safety folks are watching. You mentioned AI safety questions; what's the expert take on risks here?

PRIYA

Experts split hard on this. Safety researchers like those at Anthropic warn uncensored models like this amplify misuse—think jailbreak chains where bad actors fine-tune for phishing scams or disinformation bots, now with vision for fake video scripts. Zero refusals invite real harm; one study last year showed unfiltered 30B models generated 40% more toxic outputs than guarded ones on benchmarks. But open-source advocates, including Qwen community voices on Reddit, argue built-in censorship stifles innovation—HauhauCS delivers "what authors intended," letting users add their own filters. Regulation-wise, EU AI Act classifies high-risk models, but this decentralized drop dodges it—anyone downloads, no central enforcer. US lacks mandates; Biden's 2023 order pushes voluntary reporting, ignored here. Result: models proliferate on Ollama, Hugging Face unchecked.

HOST

EU AI Act flags high-risk stuff, but this slips through because it's open-source. No central control—anyone can grab the 15 GB Q2_K_P file and go.

PRIYA

Right, and that decentralization is the core tension. HauhauCS released on October 17th with the f9efe74933bc commit at 22 GB, IQ4_XS updated October 18th at 20 GB. No benchmarks yet against stock Qwen3.6, but quants preserve speed—Q4_K_M at 4.88 bits, 21 GB runs real-time on mid-tier GPUs. Implications for content moderation? Platforms like Ollama host it openly; fredrezones55 published there, pulling hundreds of pulls overnight. Safety teams can't patch one repo—it's forked everywhere. One view: this forces better upstream safeguards, as users self-moderate. Counter: it normalizes zero-guard models, eroding norms. Congress debated bills last year for open-weight reporting; nothing passed. Now, with 256K context, it sustains long harmful convos without breaking character.

Eroding norms fast—last year, similar uncensored drops...

HOST

Eroding norms fast—last year, similar uncensored drops sparked takedown requests on Hugging Face. How's regulation actually responding so far?

PRIYA

Slow and patchy. Hugging Face has yanked models before under DMCA for illegal content training data, but this one's stayed up despite "Aggressive" tag. No takedowns reported yet for HauhauCS's repo. Experts like Timnit Gebru call for watermark mandates on all outputs; uncensored skips that, making deepfake detection harder—vision support lets it analyze or alter images freely. On the flip, EleutherAI researchers say over-moderation kills open progress; Qwen3.6 base already leads non-proprietary leaderboards, this variant keeps it accessible. Regulation gaps loom: California's SB 942 targets election deepfakes, but local runs evade it. Globally, China's Qwen team adds refusals for domestic law; international forks like this bypass. Net effect? Proliferates risk without recourse—2,137 downloads mean thousands experimenting now.

HOST

Thousands experimenting already, with no watermark or easy traceback. Paints a picture of scattered enforcement.

PRIYA

Scattered is right, and it unlocks wild real-world paths. Developers build private tutors ignoring taboo topics, therapists without bias filters, or art generators pulling from uncensored image refs. But safety implications bite: zero refusals on 35B intelligence scales harms—imagine chaining it with tools for automated spam or exploit code. No verified uncensoring method details, but it's "lossless," matching original intent per HauhauCS. Perspectives clash—AlphaSignal hypes the release, Reddit cheers "unrestricted power," while AI safety orgs like PauseAI decry jailbreak proliferation. No controversies tied to HauhauCS background yet, but the model joins Heretic-style collections. Forward: expect forks, fine-tunes exploding; regulators might push repo scanning, but open-source speed wins short-term.

HOST

Open-source speed outpacing regulators—feels like whack-a-mole. Any counterpoints from the safety side on why this isn't all bad?

PRIYA

Safety experts concede upsides. Open models like this let researchers audit internals—spot biases Qwen hid behind refusals, or test red-teaming without black-box limits. Dan Hendrycks at Center for AI Safety notes uncensored bases aid alignment work; you train custom guards on top. It's drawn 2,137 downloads versus quieter censored peers, showing demand for control. Quants make it democratic: IQ2_M at 11 GB runs on laptops, not just data centers. But they stress layered defenses—user-side filters, app wrappers. Regulation pushback: overreach could drive everything underground, worse than open scrutiny. Still, with vision and 256K context, misuse scales; one prompt crafts persistent malicious agents. Balance tilts to monitoring platforms like Ollama, where fredrezones55 hosts it ready-to-run.

Auditing internals makes sense—gives researchers a clean...

HOST

Auditing internals makes sense—gives researchers a clean slate. But for regular users, does grabbing the 23 GB Q4_K_P mean instant trouble, or is it overhyped?

PRIYA

Not instant trouble for most—it's a tool, like a sharp knife. Casual users run Ollama pulls for fun chats, uncensored creativity without corporate nagging. But power draws edge cases: coders gen unrestricted exploits, artists bypass NSFW blocks on images. 35B-A3B MoE delivers base-level smarts—coherent over long contexts, multimodal without glitches. HauhauCS quants shine; Q5_K_P at 6.47 bits, 28 GB holds quality close to 44 GB full. No performance gaps claimed, filling the void left by censored giants. Safety counter: communities self-regulate via warnings in repos. Yet as accessibility grows—simple curl APIs, llama-cli flags—norms shift. What happens next? More uncensored rivals, pressuring guarded models to loosen up or lose devs.

HOST

Pressuring guarded models to loosen—shifts the whole field. Before we go, what's the one change this forces on AI deployment overall?

PRIYA

It forces a rethink on baked-in versus bolt-on safety. Companies like OpenAI layer refusals post-training; this proves you can deploy raw capability, let users bolt filters. Platforms adapt: Ollama might add opt-in guards, Hugging Face tags "high-risk." With 0/465 refusals and vision patch, it sets a bar—35B uncensored at home-grade sizes. Downloads hit 2,137 quick, signaling trend. Regulators eye repo mandates, but devs fork faster. Endgame: hybrid ecosystems, where power flows free, safety stacks modular.

HOST

Priya, spot on as always. Folks, that's the latest on HauhauCS's uncensored Qwen3.6-35B-A3B push—powerful, accessible, and sparking real safety debates. Check the repo if you're technical, but think twice on the prompts. I'm Alex. Thanks for listening to DailyListen.

Sources

  1. 1.HauhauCS uncensored Qwen3.6 model
  2. 2.Tags · fredrezones55/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive
  3. 3.HauhauCS/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive at main
  4. 4.HauhauCS/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive
  5. 5.HauhauCS/Qwen3.6-27B-Uncensored-HauhauCS-Aggressive · Hugging Face
  6. 6.fredrezones55/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive
  7. 7.fredrezones55/Qwen3.6-35B-A3B-Uncensored-HauhauCS ... - Ollama
  8. 8.Heretic - Abliterated, Uncensored, Unrestricted POWER. - a DavidAU Collection
  9. 9.Qwen3.6-35B-A3B Uncensored Aggressive is out with K_P quants!
  10. 10.Hauhaucs' Qwen3.5-27b-uncensored- ...
  11. 11.Abliteration Techniques Tested: HauhauCS, Heretic, and ...
  12. 12.Qwen3.5-9b-uncensored-hauhaucs-Aggressive Model - HackerNoon
  13. 13.AI Model Release Timeline: OpenAI Leads with 14 Models | Oğuz Ergin posted on the topic | LinkedIn

Original Article

HauhauCS uncensored Qwen3.6 model

AlphaSignal · April 20, 2026