How it works¶

UnBias-Plus is a thin, deliberate pipeline around a fine-tuned language model. The model isn't the interesting part on its own; what matters is that the model is prompted, parsed, and validated to return a structured, auditable result instead of free-form prose. This page walks through what happens between calling analyze() and getting a BiasResult back.

The pipeline at a glance¶

Input

Raw text — a sentence, paragraph, or full article. Passed via CLI, REST, or pipe.analyze(text).

→

Prompt

A system prompt fixes the bias categories, segment rules, and the JSON output schema.

→

Model

Fine-tuned Qwen3-8B-UnBias-Plus-SFT-Instruct generates the structured JSON. 4-bit quantization is optional.

→

Parse and validate

The JSON is extracted, validated with Pydantic, and character offsets are attached to each segment.

→

BiasResult

A typed object: render it, log it, persist it, pipe it anywhere.

What the model is asked to do¶

The system prompt defines a tight contract. Severity scales, segment rules, and output shape are all explicit, so the parsing step has something stable to verify against. A condensed view:

## BIAS TYPES
- loaded language             : words with strong emotional connotations
- dehumanizing framing        : language that strips dignity from groups
- false generalizations       : sweeping statements ("they always", "all of them")
- framing bias                : selective wording that implies a viewpoint
- euphemism/dysphemism        : softening or hardening language
- politically charged terminology : labels used to provoke rather than describe
- sensationalism              : exaggerated language to evoke emotional responses

## SEGMENT RULES
- "original" MUST be the EXACT substring as it appears in the input.
- Prefer fewer, longer segments over many short overlapping ones.
- Only modify phrases listed in biased_segments; preserve all factual content.

## OUTPUT SCHEMA (return ONLY valid JSON, no extra text)
{
  "binary_label": "biased" | "unbiased",
  "severity": 1-5,
  "bias_found": true | false,
  "biased_segments": [...],
  "unbiased_text": "Full rewritten neutral article"
}

Two design choices follow from this contract:

Exact substrings, not paraphrases

The model is instructed to return the exact original substring for each biased segment. In practice, offset matching first tries direct substring lookup, then a small normalization fallback (trimmed text plus quote and dash normalization). If no match is found, the segment is still returned, and start and end remain None.

Preserve facts, change framing

The rewriter is instructed to modify only the segments it flagged. Names, numbers, places, and quotes stay; loaded framing around them changes. The unbiased text should not read as a different article — just a calmer one.

The structured output is the point¶

Every public method ultimately returns a BiasResult. The schema is small enough to memorize and rich enough to drive a UI:

class BiasedSegment(BaseModel):
    original: str          # exact substring from input
    replacement: str       # neutral alternative
    severity: str          # "low" | "medium" | "high"
    bias_type: str         # usually one core category; may be merged (e.g. "A / B")
    reasoning: str         # 1-2 sentence explanation
    start: int | None      # character offset (computed)
    end: int | None        # character offset (computed)


class BiasResult(BaseModel):
    binary_label: str               # "biased" | "unbiased"
    severity: int                   # 1-5 (article-level)
    bias_found: bool
    biased_segments: list[BiasedSegment]
    unbiased_text: str              # full neutral rewrite
    original_text: str | None

That shape is what makes practical applications possible. With offsets and reasoning attached:

A frontend can render the original text with biased phrases highlighted in place, and show the rationale on hover.
A moderation queue can sort items by severity and route only severity >= 3 to a human reviewer.
An annotation pipeline can persist segments as labeled spans for training or analysis.
A diff view can show the original and the rewritten text side by side, segment by segment.

A free-form "this text is biased because..." paragraph from a stock LLM does not enable any of that without bespoke parsing.

How offsets get computed¶

The model returns the biased phrase as it appeared in the input, but it does not return character positions. schema.compute_offsets() walks the original text with a cursor:

For each segment, search for the exact phrase starting from the cursor's current position.
If found, attach start and end and advance the cursor past end. This handles repeated phrases: "always" appearing twice gets two distinct offset pairs in order of appearance.
If not found from the cursor, fall back to a search from position 0. If still missing, log a warning and leave offsets unset (the segment is still returned).
After processing all segments, sort by start.

The result is a list you can iterate in reading order, with positions that map cleanly onto the original text for highlighting.

Three entry points, one pipeline¶

CLI

unbias-plus --text "...", --file, --serve — quick experiments, scripts, batch runs.

REST API

unbias-plus --serve — POST /analyze, GET /health, plus a bundled demo UI.

Python API

from unbias_plus import UnBiasPlus — embed in pipelines, notebooks, services.

All three expose the same BiasResult shape. In local mode, CLI and Python call pipeline.UnBiasPlus.analyze() directly. REST uses the same local pipeline unless VLLM_BASE_URL is set, in which case it operates as a remote proxy.

Performance notes¶

Default model: vector-institute/Qwen3-8B-UnBias-Plus-SFT-Instruct, fine-tuned from Qwen3-8B for this task.
VRAM: roughly 16 GB at BF16. Pass load_in_4bit=True (or --load-in-4bit on the CLI) to fit on smaller GPUs, at the cost of some replacement quality.
Input limits: model context is 8,192 tokens. REST additionally enforces MAX_INPUT_CHARS (default 5000, configurable via environment variable). CLI and Python do not enforce a fixed character cap; practical limits come from model context and available memory.
CPU: supported via device="cpu", but inference is slow. GPU is recommended for any real workload.

Next¶

User Guide

Install and run UnBias-Plus locally — CLI, REST, or Python.
GitHub repository

Source code, issues, releases, and full module reference in src/unbias_plus.