API Reference¶

Reference for the unbias_plus package: pipeline, model, schema, FastAPI server, CLI, prompt, parser, and formatters. All public classes and functions are listed below.

Package¶

unbias_plus ¶

unbias-plus: Bias detection and debiasing using a single LLM.

UnBiasPlus ¶

Main pipeline for bias detection and debiasing.

Loads a fine-tuned LLM and exposes a simple interface for analyzing text for bias. Combines prompt building, inference, JSON parsing, offset computation, and formatting.

Parameters:

Name	Type	Description	Default
`model_name_or_path`	`str \| Path`	HuggingFace model ID or local path to the fine-tuned model. Defaults to `DEFAULT_MODEL` (`vector-institute/Qwen3-8B-UnBias-Plus-SFT-Instruct-V2`).	`DEFAULT_MODEL`
`device`	`str \| None`	Device to run on ('cuda' or 'cpu'). Auto-detected if None.	`None`
`load_in_4bit`	`bool`	Load model in 4-bit quantization. Default is False.	`False`
`max_new_tokens`	`int`	Maximum tokens to generate. Default is 8096.	`8096`

Examples:

>>> from unbias_plus import UnBiasPlus
>>> pipe = UnBiasPlus()
>>> result = pipe.analyze("Women are too emotional to lead.")
>>> print(result.binary_label)
biased

Source code in src/unbias_plus/pipeline.py

class UnBiasPlus:
    """Main pipeline for bias detection and debiasing.

    Loads a fine-tuned LLM and exposes a simple interface for
    analyzing text for bias. Combines prompt building, inference,
    JSON parsing, offset computation, and formatting.

    Parameters
    ----------
    model_name_or_path : str | Path
        HuggingFace model ID or local path to the fine-tuned
        model. Defaults to ``DEFAULT_MODEL``
        (``vector-institute/Qwen3-8B-UnBias-Plus-SFT-Instruct-V2``).
    device : str | None, optional
        Device to run on ('cuda' or 'cpu'). Auto-detected if None.
    load_in_4bit : bool, optional
        Load model in 4-bit quantization. Default is False.
    max_new_tokens : int, optional
        Maximum tokens to generate. Default is 8096.

    Examples
    --------
    >>> from unbias_plus import UnBiasPlus  # doctest: +SKIP
    >>> pipe = UnBiasPlus()  # doctest: +SKIP
    >>> result = pipe.analyze("Women are too emotional to lead.")  # doctest: +SKIP
    >>> print(result.binary_label)  # doctest: +SKIP
    biased

    """

    def __init__(
        self,
        model_name_or_path: str | Path = DEFAULT_MODEL,
        device: str | None = None,
        load_in_4bit: bool = False,
        max_new_tokens: int = 8096,
    ) -> None:
        self._model = UnBiasModel(
            model_name_or_path=model_name_or_path,
            device=device,
            load_in_4bit=load_in_4bit,
            max_new_tokens=max_new_tokens,
        )

    def analyze(self, text: str) -> BiasResult:
        """Analyze input text for bias.

        Runs the full pipeline: builds chat messages, runs inference,
        parses JSON output, computes character offsets for each
        segment, and attaches the original text to the result.

        Parameters
        ----------
        text : str
            The input text to analyze.

        Returns
        -------
        BiasResult
            Structured bias result with start/end offsets on each
            segment and original_text populated.

        Raises
        ------
        ValueError
            If the LLM output cannot be parsed into a valid BiasResult.

        Examples
        --------
        >>> result = pipe.analyze("All politicians are liars.")  # doctest: +SKIP
        >>> result.bias_found  # doctest: +SKIP
        True

        """
        text = prepare_input(text)
        messages = build_messages(text)
        raw_output = self._model.generate(messages)
        result = parse_llm_output(raw_output)
        return finalize_result(text, result)

    def analyze_to_cli(self, text: str) -> str:
        """Analyze text and return a formatted CLI string.

        Parameters
        ----------
        text : str
            The input text to analyze.

        Returns
        -------
        str
            Human-readable colored string for terminal display.

        """
        return format_cli(self.analyze(text))

    def analyze_to_dict(self, text: str) -> dict:
        """Analyze text and return result as a plain dictionary.

        Parameters
        ----------
        text : str
            The input text to analyze.

        Returns
        -------
        dict
            Plain dictionary representation of the result.

        """
        return format_dict(self.analyze(text))

    def analyze_to_json(self, text: str) -> str:
        """Analyze text and return result as a JSON string.

        Parameters
        ----------
        text : str
            The input text to analyze.

        Returns
        -------
        str
            Pretty-printed JSON string of the result.

        """
        return format_json(self.analyze(text))

analyze ¶

analyze(text)

Analyze input text for bias.

Runs the full pipeline: builds chat messages, runs inference, parses JSON output, computes character offsets for each segment, and attaches the original text to the result.

Parameters:

Name	Type	Description	Default
`text`	`str`	The input text to analyze.	required

Returns:

Type	Description
`BiasResult`	Structured bias result with start/end offsets on each segment and original_text populated.

Raises:

Type	Description
`ValueError`	If the LLM output cannot be parsed into a valid BiasResult.

Examples:

>>> result = pipe.analyze("All politicians are liars.")
>>> result.bias_found
True

Source code in src/unbias_plus/pipeline.py

def analyze(self, text: str) -> BiasResult:
    """Analyze input text for bias.

    Runs the full pipeline: builds chat messages, runs inference,
    parses JSON output, computes character offsets for each
    segment, and attaches the original text to the result.

    Parameters
    ----------
    text : str
        The input text to analyze.

    Returns
    -------
    BiasResult
        Structured bias result with start/end offsets on each
        segment and original_text populated.

    Raises
    ------
    ValueError
        If the LLM output cannot be parsed into a valid BiasResult.

    Examples
    --------
    >>> result = pipe.analyze("All politicians are liars.")  # doctest: +SKIP
    >>> result.bias_found  # doctest: +SKIP
    True

    """
    text = prepare_input(text)
    messages = build_messages(text)
    raw_output = self._model.generate(messages)
    result = parse_llm_output(raw_output)
    return finalize_result(text, result)

analyze_to_cli ¶

analyze_to_cli(text)

Analyze text and return a formatted CLI string.

Parameters:

Name	Type	Description	Default
`text`	`str`	The input text to analyze.	required

Returns:

Type	Description
`str`	Human-readable colored string for terminal display.

Source code in src/unbias_plus/pipeline.py

def analyze_to_cli(self, text: str) -> str:
    """Analyze text and return a formatted CLI string.

    Parameters
    ----------
    text : str
        The input text to analyze.

    Returns
    -------
    str
        Human-readable colored string for terminal display.

    """
    return format_cli(self.analyze(text))

analyze_to_dict ¶

analyze_to_dict(text)

Analyze text and return result as a plain dictionary.

Parameters:

Name	Type	Description	Default
`text`	`str`	The input text to analyze.	required

Returns:

Type	Description
`dict`	Plain dictionary representation of the result.

Source code in src/unbias_plus/pipeline.py

def analyze_to_dict(self, text: str) -> dict:
    """Analyze text and return result as a plain dictionary.

    Parameters
    ----------
    text : str
        The input text to analyze.

    Returns
    -------
    dict
        Plain dictionary representation of the result.

    """
    return format_dict(self.analyze(text))

analyze_to_json ¶

analyze_to_json(text)

Analyze text and return result as a JSON string.

Parameters:

Name	Type	Description	Default
`text`	`str`	The input text to analyze.	required

Returns:

Type	Description
`str`	Pretty-printed JSON string of the result.

Source code in src/unbias_plus/pipeline.py

def analyze_to_json(self, text: str) -> str:
    """Analyze text and return result as a JSON string.

    Parameters
    ----------
    text : str
        The input text to analyze.

    Returns
    -------
    str
        Pretty-printed JSON string of the result.

    """
    return format_json(self.analyze(text))

BiasedSegment ¶

Bases: BaseModel

A single biased segment detected in the text.

Attributes:

Name	Type	Description
`original`	`str`	The original biased phrase from the input text.
`replacement`	`str`	The suggested neutral replacement. Defaults to empty string if the model omits it (e.g. under 4-bit quantization).
`severity`	`str`	Severity level: 'low', 'medium', or 'high' (normalized lowercase for API/UI; model may emit 'Low' \| 'Medium' \| 'High').
`bias_type`	`str`	Type of bias (e.g. `loaded_language`, `stereotypical_association`).
`reasoning`	`str`	Explanation of why this segment is considered biased.
`start`	`int \| None`	Character offset start in the original text. Computed by the pipeline after parsing.
`end`	`int \| None`	Character offset end in the original text. Computed by the pipeline after parsing.
`replacement_start`	`int \| None`	Character offset start of `replacement` in `unbiased_text`. Computed by the pipeline after parsing.
`replacement_end`	`int \| None`	Character offset end of `replacement` in `unbiased_text`. Computed by the pipeline after parsing.

Examples:

>>> seg = BiasedSegment(
...     original="flood of migrants",
...     replacement="arrival of migrants",
...     severity="High",
...     bias_type="dehumanizing_language",
...     reasoning="Treats people as a threatening mass.",
... )
>>> seg.severity
'high'

Source code in src/unbias_plus/schema.py

class BiasedSegment(BaseModel):
    """A single biased segment detected in the text.

    Attributes
    ----------
    original : str
        The original biased phrase from the input text.
    replacement : str
        The suggested neutral replacement. Defaults to empty string
        if the model omits it (e.g. under 4-bit quantization).
    severity : str
        Severity level: 'low', 'medium', or 'high' (normalized lowercase
        for API/UI; model may emit 'Low' | 'Medium' | 'High').
    bias_type : str
        Type of bias (e.g. ``loaded_language``, ``stereotypical_association``).
    reasoning : str
        Explanation of why this segment is considered biased.
    start : int | None
        Character offset start in the original text. Computed
        by the pipeline after parsing.
    end : int | None
        Character offset end in the original text. Computed
        by the pipeline after parsing.
    replacement_start : int | None
        Character offset start of ``replacement`` in ``unbiased_text``.
        Computed by the pipeline after parsing.
    replacement_end : int | None
        Character offset end of ``replacement`` in ``unbiased_text``.
        Computed by the pipeline after parsing.

    Examples
    --------
    >>> seg = BiasedSegment(
    ...     original="flood of migrants",
    ...     replacement="arrival of migrants",
    ...     severity="High",
    ...     bias_type="dehumanizing_language",
    ...     reasoning="Treats people as a threatening mass.",
    ... )
    >>> seg.severity
    'high'

    """

    original: str
    replacement: str = ""  # optional — model may omit under 4-bit quantization
    severity: str = "medium"  # optional — defaults to medium if omitted
    bias_type: str = ""
    reasoning: str = ""
    start: int | None = None
    end: int | None = None
    replacement_start: int | None = None
    replacement_end: int | None = None

    @field_validator("severity", mode="before")
    @classmethod
    def validate_severity(cls, v: object) -> str:
        """Validate and normalise segment severity to low/medium/high.

        Accepts:
          - str 'low' | 'medium' | 'high'  (correct model output)
          - int 0-10  (model confused segment vs. global severity scale;
            bucketed the same way the global score is bucketed elsewhere)
          - anything else — defaults to 'medium'
        """
        allowed = {"low", "medium", "high"}
        if isinstance(v, str):
            normalized = v.lower().strip()
            if normalized in allowed:
                return normalized
        elif isinstance(v, (int, float)) and not isinstance(v, bool):
            logger.warning(
                "Segment severity returned as int '%s', coerced by bucket", v
            )
            if v >= 6:
                return "high"
            if v >= 3:
                return "medium"
            return "low"
        logger.warning("Unexpected segment severity '%s', defaulting to 'medium'", v)
        return "medium"

validate_severity `classmethod` ¶

validate_severity(v)

Validate and normalise segment severity to low/medium/high.

Accepts: - str 'low' | 'medium' | 'high' (correct model output) - int 0-10 (model confused segment vs. global severity scale; bucketed the same way the global score is bucketed elsewhere) - anything else — defaults to 'medium'

Source code in src/unbias_plus/schema.py

@field_validator("severity", mode="before")
@classmethod
def validate_severity(cls, v: object) -> str:
    """Validate and normalise segment severity to low/medium/high.

    Accepts:
      - str 'low' | 'medium' | 'high'  (correct model output)
      - int 0-10  (model confused segment vs. global severity scale;
        bucketed the same way the global score is bucketed elsewhere)
      - anything else — defaults to 'medium'
    """
    allowed = {"low", "medium", "high"}
    if isinstance(v, str):
        normalized = v.lower().strip()
        if normalized in allowed:
            return normalized
    elif isinstance(v, (int, float)) and not isinstance(v, bool):
        logger.warning(
            "Segment severity returned as int '%s', coerced by bucket", v
        )
        if v >= 6:
            return "high"
        if v >= 3:
            return "medium"
        return "low"
    logger.warning("Unexpected segment severity '%s', defaulting to 'medium'", v)
    return "medium"

BiasResult ¶

Bases: BaseModel

Full bias analysis result for an input text.

Attributes:

Name	Type	Description
`binary_label`	`str`	Overall label: 'biased' or 'unbiased'. Derived from severity when the model omits it.
`severity`	`int`	Overall severity score: 0 = no bias 1-5 = limited / low / moderate bias 6-10 = strong / recurring / highly distorting bias If the model returns a string ('low', 'medium', 'high'), it is coerced to a nearby integer on this scale.
`bias_found`	`bool`	Whether any bias was detected in the text. Derived from severity / segments when the model omits it.
`biased_segments`	`list[BiasedSegment]`	List of biased segments found in the text, each with character-level start/end offsets.
`unbiased_text`	`str`	Full neutral rewrite of the input text.
`original_text`	`str \| None`	The original input text. Set by the pipeline.

Examples:

>>> result = BiasResult(
...     binary_label="biased",
...     severity=6,
...     bias_found=True,
...     biased_segments=[],
...     unbiased_text="A neutral version of the text.",
... )
>>> result.binary_label
'biased'

Source code in src/unbias_plus/schema.py

class BiasResult(BaseModel):
    """Full bias analysis result for an input text.

    Attributes
    ----------
    binary_label : str
        Overall label: 'biased' or 'unbiased'. Derived from severity
        when the model omits it.
    severity : int
        Overall severity score:
          0      = no bias
          1-5    = limited / low / moderate bias
          6-10   = strong / recurring / highly distorting bias
        If the model returns a string ('low', 'medium', 'high'),
        it is coerced to a nearby integer on this scale.
    bias_found : bool
        Whether any bias was detected in the text. Derived from
        severity / segments when the model omits it.
    biased_segments : list[BiasedSegment]
        List of biased segments found in the text, each with
        character-level start/end offsets.
    unbiased_text : str
        Full neutral rewrite of the input text.
    original_text : str | None
        The original input text. Set by the pipeline.

    Examples
    --------
    >>> result = BiasResult(
    ...     binary_label="biased",
    ...     severity=6,
    ...     bias_found=True,
    ...     biased_segments=[],
    ...     unbiased_text="A neutral version of the text.",
    ... )
    >>> result.binary_label
    'biased'

    """

    binary_label: str
    severity: int
    bias_found: bool
    biased_segments: list[BiasedSegment]
    unbiased_text: str
    original_text: str | None = None

    @field_validator("binary_label")
    @classmethod
    def validate_binary_label(cls, v: str) -> str:
        """Validate binary_label is 'biased' or 'unbiased'."""
        allowed = {"biased", "unbiased"}
        normalized = v.lower().strip()
        if normalized not in allowed:
            raise ValueError(f"binary_label must be one of {allowed}, got '{v}'")
        return normalized

    @field_validator("severity", mode="before")
    @classmethod
    def validate_severity(cls, v: int | str) -> int:
        """Coerce and validate global severity on the 0-10 scale.

        Accepts:
          - int 0-10  (correct model output)
          - str 'low', 'medium', 'high', 'none'  (model confused scales)
          - any other int   (clamped into 0-10)
        """
        # String coercion — model confused global vs segment severity scale
        if isinstance(v, str):
            normalized = v.lower().strip()
            if normalized in _STR_TO_INT_SEVERITY:
                coerced = _STR_TO_INT_SEVERITY[normalized]
                logger.warning(
                    "Global severity returned as string '%s', coerced to %d",
                    v,
                    coerced,
                )
                return coerced
            # Try parsing as int string e.g. "3"
            try:
                v = int(v)
            except ValueError:
                logger.warning("Unrecognized severity '%s', defaulting to 3", v)
                return 3

        # Clamp out-of-range integer values gracefully onto 0-10
        if v < 0:
            return 0
        if v > 10:
            return 10
        return int(v)

validate_binary_label `classmethod` ¶

validate_binary_label(v)

Validate binary_label is 'biased' or 'unbiased'.

Source code in src/unbias_plus/schema.py

@field_validator("binary_label")
@classmethod
def validate_binary_label(cls, v: str) -> str:
    """Validate binary_label is 'biased' or 'unbiased'."""
    allowed = {"biased", "unbiased"}
    normalized = v.lower().strip()
    if normalized not in allowed:
        raise ValueError(f"binary_label must be one of {allowed}, got '{v}'")
    return normalized

validate_severity `classmethod` ¶

validate_severity(v)

Coerce and validate global severity on the 0-10 scale.

Accepts: - int 0-10 (correct model output) - str 'low', 'medium', 'high', 'none' (model confused scales) - any other int (clamped into 0-10)

Source code in src/unbias_plus/schema.py

@field_validator("severity", mode="before")
@classmethod
def validate_severity(cls, v: int | str) -> int:
    """Coerce and validate global severity on the 0-10 scale.

    Accepts:
      - int 0-10  (correct model output)
      - str 'low', 'medium', 'high', 'none'  (model confused scales)
      - any other int   (clamped into 0-10)
    """
    # String coercion — model confused global vs segment severity scale
    if isinstance(v, str):
        normalized = v.lower().strip()
        if normalized in _STR_TO_INT_SEVERITY:
            coerced = _STR_TO_INT_SEVERITY[normalized]
            logger.warning(
                "Global severity returned as string '%s', coerced to %d",
                v,
                coerced,
            )
            return coerced
        # Try parsing as int string e.g. "3"
        try:
            v = int(v)
        except ValueError:
            logger.warning("Unrecognized severity '%s', defaulting to 3", v)
            return 3

    # Clamp out-of-range integer values gracefully onto 0-10
    if v < 0:
        return 0
    if v > 10:
        return 10
    return int(v)

serve ¶

serve(
    model_name_or_path=DEFAULT_MODEL,
    host="0.0.0.0",
    port=8000,
    load_in_4bit=False,
    reload=False,
)

Start the unbias-plus API server with the demo UI.

Loads the model and starts a uvicorn server. The demo UI is served at http://localhost:{port}/ and the API is at http://localhost:{port}/analyze.

Parameters:

Name	Type	Description	Default
`model_name_or_path`	`str \| Path`	HuggingFace model ID or local path to the model.	`DEFAULT_MODEL`
`host`	`str`	Host address to bind to. Default is '0.0.0.0'.	`'0.0.0.0'`
`port`	`int`	Port to listen on. Default is 8000.	`8000`
`load_in_4bit`	`bool`	Load model in 4-bit quantization. Default is False.	`False`
`reload`	`bool`	Enable auto-reload on code changes. Default is False.	`False`

Examples:

>>> from unbias_plus.api import serve
>>> serve("Qwen/Qwen3-4B", port=8000)

Source code in src/unbias_plus/api.py

def serve(
    model_name_or_path: str | Path = DEFAULT_MODEL,
    host: str = "0.0.0.0",
    port: int = 8000,
    load_in_4bit: bool = False,
    reload: bool = False,
) -> None:
    """Start the unbias-plus API server with the demo UI.

    Loads the model and starts a uvicorn server. The demo UI
    is served at http://localhost:{port}/ and the API is at
    http://localhost:{port}/analyze.

    Parameters
    ----------
    model_name_or_path : str | Path
        HuggingFace model ID or local path to the model.
    host : str
        Host address to bind to. Default is '0.0.0.0'.
    port : int
        Port to listen on. Default is 8000.
    load_in_4bit : bool
        Load model in 4-bit quantization. Default is False.
    reload : bool
        Enable auto-reload on code changes. Default is False.

    Examples
    --------
    >>> from unbias_plus.api import serve
    >>> serve("Qwen/Qwen3-4B", port=8000)  # doctest: +SKIP

    """
    app.state.model_name_or_path = str(model_name_or_path)
    app.state.load_in_4bit = load_in_4bit
    print(f"Starting unbias-plus server at http://localhost:{port}")
    uvicorn.run(app, host=host, port=port, reload=reload)

Pipeline¶

unbias_plus.pipeline ¶

Main pipeline for unbias-plus.

UnBiasPlus ¶

Main pipeline for bias detection and debiasing.

Loads a fine-tuned LLM and exposes a simple interface for analyzing text for bias. Combines prompt building, inference, JSON parsing, offset computation, and formatting.

Parameters:

Name	Type	Description	Default
`model_name_or_path`	`str \| Path`	HuggingFace model ID or local path to the fine-tuned model. Defaults to `DEFAULT_MODEL` (`vector-institute/Qwen3-8B-UnBias-Plus-SFT-Instruct-V2`).	`DEFAULT_MODEL`
`device`	`str \| None`	Device to run on ('cuda' or 'cpu'). Auto-detected if None.	`None`
`load_in_4bit`	`bool`	Load model in 4-bit quantization. Default is False.	`False`
`max_new_tokens`	`int`	Maximum tokens to generate. Default is 8096.	`8096`

Examples:

>>> from unbias_plus import UnBiasPlus
>>> pipe = UnBiasPlus()
>>> result = pipe.analyze("Women are too emotional to lead.")
>>> print(result.binary_label)
biased

Source code in src/unbias_plus/pipeline.py

class UnBiasPlus:
    """Main pipeline for bias detection and debiasing.

    Loads a fine-tuned LLM and exposes a simple interface for
    analyzing text for bias. Combines prompt building, inference,
    JSON parsing, offset computation, and formatting.

    Parameters
    ----------
    model_name_or_path : str | Path
        HuggingFace model ID or local path to the fine-tuned
        model. Defaults to ``DEFAULT_MODEL``
        (``vector-institute/Qwen3-8B-UnBias-Plus-SFT-Instruct-V2``).
    device : str | None, optional
        Device to run on ('cuda' or 'cpu'). Auto-detected if None.
    load_in_4bit : bool, optional
        Load model in 4-bit quantization. Default is False.
    max_new_tokens : int, optional
        Maximum tokens to generate. Default is 8096.

    Examples
    --------
    >>> from unbias_plus import UnBiasPlus  # doctest: +SKIP
    >>> pipe = UnBiasPlus()  # doctest: +SKIP
    >>> result = pipe.analyze("Women are too emotional to lead.")  # doctest: +SKIP
    >>> print(result.binary_label)  # doctest: +SKIP
    biased

    """

    def __init__(
        self,
        model_name_or_path: str | Path = DEFAULT_MODEL,
        device: str | None = None,
        load_in_4bit: bool = False,
        max_new_tokens: int = 8096,
    ) -> None:
        self._model = UnBiasModel(
            model_name_or_path=model_name_or_path,
            device=device,
            load_in_4bit=load_in_4bit,
            max_new_tokens=max_new_tokens,
        )

    def analyze(self, text: str) -> BiasResult:
        """Analyze input text for bias.

        Runs the full pipeline: builds chat messages, runs inference,
        parses JSON output, computes character offsets for each
        segment, and attaches the original text to the result.

        Parameters
        ----------
        text : str
            The input text to analyze.

        Returns
        -------
        BiasResult
            Structured bias result with start/end offsets on each
            segment and original_text populated.

        Raises
        ------
        ValueError
            If the LLM output cannot be parsed into a valid BiasResult.

        Examples
        --------
        >>> result = pipe.analyze("All politicians are liars.")  # doctest: +SKIP
        >>> result.bias_found  # doctest: +SKIP
        True

        """
        text = prepare_input(text)
        messages = build_messages(text)
        raw_output = self._model.generate(messages)
        result = parse_llm_output(raw_output)
        return finalize_result(text, result)

    def analyze_to_cli(self, text: str) -> str:
        """Analyze text and return a formatted CLI string.

        Parameters
        ----------
        text : str
            The input text to analyze.

        Returns
        -------
        str
            Human-readable colored string for terminal display.

        """
        return format_cli(self.analyze(text))

    def analyze_to_dict(self, text: str) -> dict:
        """Analyze text and return result as a plain dictionary.

        Parameters
        ----------
        text : str
            The input text to analyze.

        Returns
        -------
        dict
            Plain dictionary representation of the result.

        """
        return format_dict(self.analyze(text))

    def analyze_to_json(self, text: str) -> str:
        """Analyze text and return result as a JSON string.

        Parameters
        ----------
        text : str
            The input text to analyze.

        Returns
        -------
        str
            Pretty-printed JSON string of the result.

        """
        return format_json(self.analyze(text))

analyze ¶

analyze(text)

Analyze input text for bias.

Runs the full pipeline: builds chat messages, runs inference, parses JSON output, computes character offsets for each segment, and attaches the original text to the result.

Parameters:

Name	Type	Description	Default
`text`	`str`	The input text to analyze.	required

Returns:

Type	Description
`BiasResult`	Structured bias result with start/end offsets on each segment and original_text populated.

Raises:

Type	Description
`ValueError`	If the LLM output cannot be parsed into a valid BiasResult.

Examples:

>>> result = pipe.analyze("All politicians are liars.")
>>> result.bias_found
True

Source code in src/unbias_plus/pipeline.py

def analyze(self, text: str) -> BiasResult:
    """Analyze input text for bias.

    Runs the full pipeline: builds chat messages, runs inference,
    parses JSON output, computes character offsets for each
    segment, and attaches the original text to the result.

    Parameters
    ----------
    text : str
        The input text to analyze.

    Returns
    -------
    BiasResult
        Structured bias result with start/end offsets on each
        segment and original_text populated.

    Raises
    ------
    ValueError
        If the LLM output cannot be parsed into a valid BiasResult.

    Examples
    --------
    >>> result = pipe.analyze("All politicians are liars.")  # doctest: +SKIP
    >>> result.bias_found  # doctest: +SKIP
    True

    """
    text = prepare_input(text)
    messages = build_messages(text)
    raw_output = self._model.generate(messages)
    result = parse_llm_output(raw_output)
    return finalize_result(text, result)

analyze_to_cli ¶

analyze_to_cli(text)

Analyze text and return a formatted CLI string.

Parameters:

Name	Type	Description	Default
`text`	`str`	The input text to analyze.	required

Returns:

Type	Description
`str`	Human-readable colored string for terminal display.

Source code in src/unbias_plus/pipeline.py

def analyze_to_cli(self, text: str) -> str:
    """Analyze text and return a formatted CLI string.

    Parameters
    ----------
    text : str
        The input text to analyze.

    Returns
    -------
    str
        Human-readable colored string for terminal display.

    """
    return format_cli(self.analyze(text))

analyze_to_dict ¶

analyze_to_dict(text)

Analyze text and return result as a plain dictionary.

Parameters:

Name	Type	Description	Default
`text`	`str`	The input text to analyze.	required

Returns:

Type	Description
`dict`	Plain dictionary representation of the result.

Source code in src/unbias_plus/pipeline.py

def analyze_to_dict(self, text: str) -> dict:
    """Analyze text and return result as a plain dictionary.

    Parameters
    ----------
    text : str
        The input text to analyze.

    Returns
    -------
    dict
        Plain dictionary representation of the result.

    """
    return format_dict(self.analyze(text))

analyze_to_json ¶

analyze_to_json(text)

Analyze text and return result as a JSON string.

Parameters:

Name	Type	Description	Default
`text`	`str`	The input text to analyze.	required

Returns:

Type	Description
`str`	Pretty-printed JSON string of the result.

Source code in src/unbias_plus/pipeline.py

def analyze_to_json(self, text: str) -> str:
    """Analyze text and return result as a JSON string.

    Parameters
    ----------
    text : str
        The input text to analyze.

    Returns
    -------
    str
        Pretty-printed JSON string of the result.

    """
    return format_json(self.analyze(text))

finalize_result ¶

finalize_result(text, result)

Attach original text and character offsets for each biased segment.

Segments whose replacement equals the original are dropped as no-ops. If no segments survive, the result is reconciled to a clean unbiased state (severity 0, no bias, rewrite == original) since, per the model contract, an empty biased_segments means there is no bias to report or rewrite.

Source code in src/unbias_plus/pipeline.py

def finalize_result(text: str, result: BiasResult) -> BiasResult:
    """Attach original text and character offsets for each biased segment.

    Segments whose replacement equals the original are dropped as no-ops. If no
    segments survive, the result is reconciled to a clean unbiased state
    (severity 0, no bias, rewrite == original) since, per the model contract,
    an empty ``biased_segments`` means there is no bias to report or rewrite.
    """
    segments = drop_unchanged_segments(result.biased_segments)
    segments = compute_offsets(text, segments)
    segments = compute_replacement_offsets(text, result.unbiased_text, segments)

    if not segments:
        return result.model_copy(
            update={
                "biased_segments": [],
                "binary_label": "unbiased",
                "bias_found": False,
                "severity": 0,
                "unbiased_text": text,
                "original_text": text,
            }
        )

    return result.model_copy(
        update={
            "biased_segments": segments,
            "original_text": text,
        }
    )

Model¶

unbias_plus.model ¶

LLM model loader and inference for unbias-plus.

UnBiasModel ¶

Loads and runs the fine-tuned bias detection LLM.

Wraps a HuggingFace causal LM with a simple generate() interface. Compatible with any HuggingFace causal LM — thinking mode is opt-in for Qwen3 models only.

Parameters:

Name	Type	Description	Default
`model_name_or_path`	`str \| Path`	HuggingFace model ID or local path to the model. Defaults to `DEFAULT_MODEL` (`vector-institute/Qwen3-8B-UnBias-Plus-SFT-Instruct-V2`).	`DEFAULT_MODEL`
`device`	`str \| None`	Device to run on ('cuda' or 'cpu'). Auto-detects if not provided.	`None`
`load_in_4bit`	`bool`	Load model in 4-bit quantization via bitsandbytes. Reduces VRAM to ~3GB (4B) or ~5GB (8B). Default is False.	`False`
`max_new_tokens`	`int`	Maximum number of new tokens to generate. Default 8096.	`8096`
`enable_thinking`	`bool`	Enable Qwen3 chain-of-thought thinking mode. Only supported by Qwen3 models — do not set for other models. Default is False.	`False`
`thinking_budget`	`int`	Maximum tokens allocated to the thinking block when enable_thinking=True. Default is 512.	`512`

Examples:

>>> model = UnBiasModel()
>>> raw = model.generate([{"role": "user", "content": "..."}])
>>> isinstance(raw, str)
True

Source code in src/unbias_plus/model.py

class UnBiasModel:
    """Loads and runs the fine-tuned bias detection LLM.

    Wraps a HuggingFace causal LM with a simple generate()
    interface. Compatible with any HuggingFace causal LM —
    thinking mode is opt-in for Qwen3 models only.

    Parameters
    ----------
    model_name_or_path : str | Path
        HuggingFace model ID or local path to the model.
        Defaults to ``DEFAULT_MODEL``
        (``vector-institute/Qwen3-8B-UnBias-Plus-SFT-Instruct-V2``).
    device : str | None, optional
        Device to run on ('cuda' or 'cpu').
        Auto-detects if not provided.
    load_in_4bit : bool, optional
        Load model in 4-bit quantization via bitsandbytes.
        Reduces VRAM to ~3GB (4B) or ~5GB (8B). Default is False.
    max_new_tokens : int, optional
        Maximum number of new tokens to generate. Default 8096.
    enable_thinking : bool, optional
        Enable Qwen3 chain-of-thought thinking mode. Only supported
        by Qwen3 models — do not set for other models. Default is False.
    thinking_budget : int, optional
        Maximum tokens allocated to the thinking block when
        enable_thinking=True. Default is 512.

    Examples
    --------
    >>> model = UnBiasModel()  # doctest: +SKIP
    >>> raw = model.generate([{"role": "user", "content": "..."}])  # doctest: +SKIP
    >>> isinstance(raw, str)  # doctest: +SKIP
    True
    """

    def __init__(
        self,
        model_name_or_path: str | Path = DEFAULT_MODEL,
        device: str | None = None,
        load_in_4bit: bool = False,
        max_new_tokens: int = 8096,
        enable_thinking: bool = False,
        thinking_budget: int = 512,
    ) -> None:
        self.model_name_or_path = str(model_name_or_path)
        self.max_new_tokens = max_new_tokens
        self.enable_thinking = enable_thinking
        self.thinking_budget = thinking_budget
        self.device = device or ("cuda" if torch.cuda.is_available() else "cpu")

        # --- Tokenizer ---
        self.tokenizer = AutoTokenizer.from_pretrained(self.model_name_or_path)
        if self.tokenizer.pad_token is None:
            self.tokenizer.pad_token = self.tokenizer.eos_token
            self.tokenizer.pad_token_id = self.tokenizer.eos_token_id
        self.tokenizer.padding_side = "left"
        self.eos_token_ids = list(
            {
                self.tokenizer.eos_token_id,
                self.tokenizer.convert_tokens_to_ids("<|im_end|>"),
                self.tokenizer.convert_tokens_to_ids("<|endoftext|>"),
            }
        )

        # --- Quantization config ---
        # 4-bit quantization is opt-in via --load-in-4bit flag only.
        # No automatic quantization is applied for any model.
        quantization_config = None
        if load_in_4bit:
            quantization_config = BitsAndBytesConfig(
                load_in_4bit=True,
                bnb_4bit_compute_dtype=torch.bfloat16,
            )

        # --- Model ---
        # device_map={'': device_index} ensures the full model lands on one
        # specific GPU, avoiding multi-GPU conflicts from device_map="auto".
        device_index = 0 if self.device == "cuda" else self.device
        self.model = AutoModelForCausalLM.from_pretrained(
            self.model_name_or_path,
            dtype=torch.bfloat16,
            device_map={"": device_index},
            quantization_config=quantization_config,
        )
        self.model.eval()
        # Avoid HF warning: generation_config.max_length fights with max_new_tokens.
        if getattr(self.model, "generation_config", None) is not None:
            self.model.generation_config.max_length = None

    def generate(self, messages: list[dict]) -> str:
        """Run inference on a list of chat messages and return the raw output.

        Uses greedy decoding (do_sample=False) for deterministic, consistent
        JSON output across runs. Works with any HuggingFace causal LM.

        Parameters
        ----------
        messages : list[dict]
            List of {"role": ..., "content": ...} dicts.
            Should include system prompt and user message.

        Returns
        -------
        str
            Raw string output from the model with the input prompt stripped.
            Special tokens are removed for clean downstream parsing.

        Examples
        --------
        >>> model = UnBiasModel()  # doctest: +SKIP
        >>> msgs = [{"role": "user", "content": "..."}]  # doctest: +SKIP
        >>> output = model.generate(msgs)  # doctest: +SKIP
        >>> isinstance(output, str)  # doctest: +SKIP
        True
        """
        # Build template kwargs as a literal — only pass thinking args when
        # explicitly enabled so the code works with any HF model, not just Qwen3.
        # enable_thinking is always passed explicitly (even as False) so
        # Qwen3's jinja template doesn't fall back to its own default of True.
        template_kwargs: dict = {
            "tokenize": True,
            "add_generation_prompt": True,
            "return_tensors": "pt",
            "return_dict": True,
            "truncation": True,
            "max_length": MAX_SEQ_LENGTH,
            # Always set enable_thinking explicitly for Qwen3 models so the
            # jinja template respects our setting rather than its own default.
            # For non-Qwen3 models this key is simply ignored by the tokenizer.
            "enable_thinking": self.enable_thinking,
        }
        if self.enable_thinking:
            template_kwargs["thinking_budget"] = self.thinking_budget

        tokenized = cast(
            BatchEncoding,
            self.tokenizer.apply_chat_template(messages, **template_kwargs),
        )

        input_ids = tokenized["input_ids"].to(self.device)
        attention_mask = tokenized["attention_mask"].to(self.device)

        with torch.no_grad():
            output_ids = self.model.generate(
                input_ids=input_ids,
                attention_mask=attention_mask,
                max_new_tokens=self.max_new_tokens,
                do_sample=False,  # greedy decoding — deterministic output
                temperature=None,  # must be None when do_sample=False
                top_p=None,  # must be None when do_sample=False
                pad_token_id=self.tokenizer.pad_token_id,
                eos_token_id=self.eos_token_ids,
            )

        # Decode only the new tokens — strip the input prompt.
        # skip_special_tokens=True removes <|im_start|>, <|endoftext|> etc.
        # so the parser receives clean text without special token artifacts
        # that could corrupt JSON extraction.
        new_tokens = output_ids[0][input_ids.shape[-1] :]
        return str(self.tokenizer.decode(new_tokens, skip_special_tokens=True))

    def generate_stream(self, messages: list[dict]) -> Iterator[str]:
        """Stream raw token text for a list of chat messages.

        Runs model generation in a background thread and yields decoded
        token strings via a ``TextIteratorStreamer`` as they are produced.
        Uses the same greedy decoding settings as :meth:`generate`.

        Parameters
        ----------
        messages : list[dict]
            List of ``{"role": ..., "content": ...}`` dicts.

        Yields
        ------
        str
            Decoded token text, one chunk per model step.

        """
        template_kwargs: dict = {
            "tokenize": True,
            "add_generation_prompt": True,
            "return_tensors": "pt",
            "return_dict": True,
            "truncation": True,
            "max_length": MAX_SEQ_LENGTH,
            "enable_thinking": self.enable_thinking,
        }
        if self.enable_thinking:
            template_kwargs["thinking_budget"] = self.thinking_budget

        tokenized = cast(
            BatchEncoding,
            self.tokenizer.apply_chat_template(messages, **template_kwargs),
        )
        input_ids = tokenized["input_ids"].to(self.device)
        attention_mask = tokenized["attention_mask"].to(self.device)

        streamer = TextIteratorStreamer(
            cast(AutoTokenizer, self.tokenizer),
            skip_prompt=True,
            skip_special_tokens=True,
        )

        generate_kwargs = {
            "input_ids": input_ids,
            "attention_mask": attention_mask,
            "max_new_tokens": self.max_new_tokens,
            "do_sample": False,
            "temperature": None,
            "top_p": None,
            "pad_token_id": self.tokenizer.pad_token_id,
            "eos_token_id": self.eos_token_ids,
            "streamer": streamer,
        }

        thread = threading.Thread(
            target=self.model.generate,
            kwargs=generate_kwargs,
            daemon=True,
        )
        thread.start()

        try:
            for token_text in streamer:
                yield token_text
        finally:
            thread.join()

generate ¶

generate(messages)

Run inference on a list of chat messages and return the raw output.

Uses greedy decoding (do_sample=False) for deterministic, consistent JSON output across runs. Works with any HuggingFace causal LM.

Parameters:

Name	Type	Description	Default
`messages`	`list[dict]`	List of {"role": ..., "content": ...} dicts. Should include system prompt and user message.	required

Returns:

Type	Description
`str`	Raw string output from the model with the input prompt stripped. Special tokens are removed for clean downstream parsing.

Examples:

>>> model = UnBiasModel()
>>> msgs = [{"role": "user", "content": "..."}]
>>> output = model.generate(msgs)
>>> isinstance(output, str)
True

Source code in src/unbias_plus/model.py

def generate(self, messages: list[dict]) -> str:
    """Run inference on a list of chat messages and return the raw output.

    Uses greedy decoding (do_sample=False) for deterministic, consistent
    JSON output across runs. Works with any HuggingFace causal LM.

    Parameters
    ----------
    messages : list[dict]
        List of {"role": ..., "content": ...} dicts.
        Should include system prompt and user message.

    Returns
    -------
    str
        Raw string output from the model with the input prompt stripped.
        Special tokens are removed for clean downstream parsing.

    Examples
    --------
    >>> model = UnBiasModel()  # doctest: +SKIP
    >>> msgs = [{"role": "user", "content": "..."}]  # doctest: +SKIP
    >>> output = model.generate(msgs)  # doctest: +SKIP
    >>> isinstance(output, str)  # doctest: +SKIP
    True
    """
    # Build template kwargs as a literal — only pass thinking args when
    # explicitly enabled so the code works with any HF model, not just Qwen3.
    # enable_thinking is always passed explicitly (even as False) so
    # Qwen3's jinja template doesn't fall back to its own default of True.
    template_kwargs: dict = {
        "tokenize": True,
        "add_generation_prompt": True,
        "return_tensors": "pt",
        "return_dict": True,
        "truncation": True,
        "max_length": MAX_SEQ_LENGTH,
        # Always set enable_thinking explicitly for Qwen3 models so the
        # jinja template respects our setting rather than its own default.
        # For non-Qwen3 models this key is simply ignored by the tokenizer.
        "enable_thinking": self.enable_thinking,
    }
    if self.enable_thinking:
        template_kwargs["thinking_budget"] = self.thinking_budget

    tokenized = cast(
        BatchEncoding,
        self.tokenizer.apply_chat_template(messages, **template_kwargs),
    )

    input_ids = tokenized["input_ids"].to(self.device)
    attention_mask = tokenized["attention_mask"].to(self.device)

    with torch.no_grad():
        output_ids = self.model.generate(
            input_ids=input_ids,
            attention_mask=attention_mask,
            max_new_tokens=self.max_new_tokens,
            do_sample=False,  # greedy decoding — deterministic output
            temperature=None,  # must be None when do_sample=False
            top_p=None,  # must be None when do_sample=False
            pad_token_id=self.tokenizer.pad_token_id,
            eos_token_id=self.eos_token_ids,
        )

    # Decode only the new tokens — strip the input prompt.
    # skip_special_tokens=True removes <|im_start|>, <|endoftext|> etc.
    # so the parser receives clean text without special token artifacts
    # that could corrupt JSON extraction.
    new_tokens = output_ids[0][input_ids.shape[-1] :]
    return str(self.tokenizer.decode(new_tokens, skip_special_tokens=True))

generate_stream ¶

generate_stream(messages)

Stream raw token text for a list of chat messages.

Runs model generation in a background thread and yields decoded token strings via a TextIteratorStreamer as they are produced. Uses the same greedy decoding settings as :meth:generate.

Parameters:

Name	Type	Description	Default
`messages`	`list[dict]`	List of `{"role": ..., "content": ...}` dicts.	required

Yields:

Type	Description
`str`	Decoded token text, one chunk per model step.

Source code in src/unbias_plus/model.py

def generate_stream(self, messages: list[dict]) -> Iterator[str]:
    """Stream raw token text for a list of chat messages.

    Runs model generation in a background thread and yields decoded
    token strings via a ``TextIteratorStreamer`` as they are produced.
    Uses the same greedy decoding settings as :meth:`generate`.

    Parameters
    ----------
    messages : list[dict]
        List of ``{"role": ..., "content": ...}`` dicts.

    Yields
    ------
    str
        Decoded token text, one chunk per model step.

    """
    template_kwargs: dict = {
        "tokenize": True,
        "add_generation_prompt": True,
        "return_tensors": "pt",
        "return_dict": True,
        "truncation": True,
        "max_length": MAX_SEQ_LENGTH,
        "enable_thinking": self.enable_thinking,
    }
    if self.enable_thinking:
        template_kwargs["thinking_budget"] = self.thinking_budget

    tokenized = cast(
        BatchEncoding,
        self.tokenizer.apply_chat_template(messages, **template_kwargs),
    )
    input_ids = tokenized["input_ids"].to(self.device)
    attention_mask = tokenized["attention_mask"].to(self.device)

    streamer = TextIteratorStreamer(
        cast(AutoTokenizer, self.tokenizer),
        skip_prompt=True,
        skip_special_tokens=True,
    )

    generate_kwargs = {
        "input_ids": input_ids,
        "attention_mask": attention_mask,
        "max_new_tokens": self.max_new_tokens,
        "do_sample": False,
        "temperature": None,
        "top_p": None,
        "pad_token_id": self.tokenizer.pad_token_id,
        "eos_token_id": self.eos_token_ids,
        "streamer": streamer,
    }

    thread = threading.Thread(
        target=self.model.generate,
        kwargs=generate_kwargs,
        daemon=True,
    )
    thread.start()

    try:
        for token_text in streamer:
            yield token_text
    finally:
        thread.join()

Schema¶

unbias_plus.schema ¶

Data schemas for unbias-plus output.

BiasedSegment ¶

Bases: BaseModel

A single biased segment detected in the text.

Attributes:

Name	Type	Description
`original`	`str`	The original biased phrase from the input text.
`replacement`	`str`	The suggested neutral replacement. Defaults to empty string if the model omits it (e.g. under 4-bit quantization).
`severity`	`str`	Severity level: 'low', 'medium', or 'high' (normalized lowercase for API/UI; model may emit 'Low' \| 'Medium' \| 'High').
`bias_type`	`str`	Type of bias (e.g. `loaded_language`, `stereotypical_association`).
`reasoning`	`str`	Explanation of why this segment is considered biased.
`start`	`int \| None`	Character offset start in the original text. Computed by the pipeline after parsing.
`end`	`int \| None`	Character offset end in the original text. Computed by the pipeline after parsing.
`replacement_start`	`int \| None`	Character offset start of `replacement` in `unbiased_text`. Computed by the pipeline after parsing.
`replacement_end`	`int \| None`	Character offset end of `replacement` in `unbiased_text`. Computed by the pipeline after parsing.

Examples:

>>> seg = BiasedSegment(
...     original="flood of migrants",
...     replacement="arrival of migrants",
...     severity="High",
...     bias_type="dehumanizing_language",
...     reasoning="Treats people as a threatening mass.",
... )
>>> seg.severity
'high'

Source code in src/unbias_plus/schema.py

class BiasedSegment(BaseModel):
    """A single biased segment detected in the text.

    Attributes
    ----------
    original : str
        The original biased phrase from the input text.
    replacement : str
        The suggested neutral replacement. Defaults to empty string
        if the model omits it (e.g. under 4-bit quantization).
    severity : str
        Severity level: 'low', 'medium', or 'high' (normalized lowercase
        for API/UI; model may emit 'Low' | 'Medium' | 'High').
    bias_type : str
        Type of bias (e.g. ``loaded_language``, ``stereotypical_association``).
    reasoning : str
        Explanation of why this segment is considered biased.
    start : int | None
        Character offset start in the original text. Computed
        by the pipeline after parsing.
    end : int | None
        Character offset end in the original text. Computed
        by the pipeline after parsing.
    replacement_start : int | None
        Character offset start of ``replacement`` in ``unbiased_text``.
        Computed by the pipeline after parsing.
    replacement_end : int | None
        Character offset end of ``replacement`` in ``unbiased_text``.
        Computed by the pipeline after parsing.

    Examples
    --------
    >>> seg = BiasedSegment(
    ...     original="flood of migrants",
    ...     replacement="arrival of migrants",
    ...     severity="High",
    ...     bias_type="dehumanizing_language",
    ...     reasoning="Treats people as a threatening mass.",
    ... )
    >>> seg.severity
    'high'

    """

    original: str
    replacement: str = ""  # optional — model may omit under 4-bit quantization
    severity: str = "medium"  # optional — defaults to medium if omitted
    bias_type: str = ""
    reasoning: str = ""
    start: int | None = None
    end: int | None = None
    replacement_start: int | None = None
    replacement_end: int | None = None

    @field_validator("severity", mode="before")
    @classmethod
    def validate_severity(cls, v: object) -> str:
        """Validate and normalise segment severity to low/medium/high.

        Accepts:
          - str 'low' | 'medium' | 'high'  (correct model output)
          - int 0-10  (model confused segment vs. global severity scale;
            bucketed the same way the global score is bucketed elsewhere)
          - anything else — defaults to 'medium'
        """
        allowed = {"low", "medium", "high"}
        if isinstance(v, str):
            normalized = v.lower().strip()
            if normalized in allowed:
                return normalized
        elif isinstance(v, (int, float)) and not isinstance(v, bool):
            logger.warning(
                "Segment severity returned as int '%s', coerced by bucket", v
            )
            if v >= 6:
                return "high"
            if v >= 3:
                return "medium"
            return "low"
        logger.warning("Unexpected segment severity '%s', defaulting to 'medium'", v)
        return "medium"

validate_severity `classmethod` ¶

validate_severity(v)

Validate and normalise segment severity to low/medium/high.

Accepts: - str 'low' | 'medium' | 'high' (correct model output) - int 0-10 (model confused segment vs. global severity scale; bucketed the same way the global score is bucketed elsewhere) - anything else — defaults to 'medium'

Source code in src/unbias_plus/schema.py

@field_validator("severity", mode="before")
@classmethod
def validate_severity(cls, v: object) -> str:
    """Validate and normalise segment severity to low/medium/high.

    Accepts:
      - str 'low' | 'medium' | 'high'  (correct model output)
      - int 0-10  (model confused segment vs. global severity scale;
        bucketed the same way the global score is bucketed elsewhere)
      - anything else — defaults to 'medium'
    """
    allowed = {"low", "medium", "high"}
    if isinstance(v, str):
        normalized = v.lower().strip()
        if normalized in allowed:
            return normalized
    elif isinstance(v, (int, float)) and not isinstance(v, bool):
        logger.warning(
            "Segment severity returned as int '%s', coerced by bucket", v
        )
        if v >= 6:
            return "high"
        if v >= 3:
            return "medium"
        return "low"
    logger.warning("Unexpected segment severity '%s', defaulting to 'medium'", v)
    return "medium"

BiasResult ¶

Bases: BaseModel

Full bias analysis result for an input text.

Attributes:

Name	Type	Description
`binary_label`	`str`	Overall label: 'biased' or 'unbiased'. Derived from severity when the model omits it.
`severity`	`int`	Overall severity score: 0 = no bias 1-5 = limited / low / moderate bias 6-10 = strong / recurring / highly distorting bias If the model returns a string ('low', 'medium', 'high'), it is coerced to a nearby integer on this scale.
`bias_found`	`bool`	Whether any bias was detected in the text. Derived from severity / segments when the model omits it.
`biased_segments`	`list[BiasedSegment]`	List of biased segments found in the text, each with character-level start/end offsets.
`unbiased_text`	`str`	Full neutral rewrite of the input text.
`original_text`	`str \| None`	The original input text. Set by the pipeline.

Examples:

>>> result = BiasResult(
...     binary_label="biased",
...     severity=6,
...     bias_found=True,
...     biased_segments=[],
...     unbiased_text="A neutral version of the text.",
... )
>>> result.binary_label
'biased'

Source code in src/unbias_plus/schema.py

class BiasResult(BaseModel):
    """Full bias analysis result for an input text.

    Attributes
    ----------
    binary_label : str
        Overall label: 'biased' or 'unbiased'. Derived from severity
        when the model omits it.
    severity : int
        Overall severity score:
          0      = no bias
          1-5    = limited / low / moderate bias
          6-10   = strong / recurring / highly distorting bias
        If the model returns a string ('low', 'medium', 'high'),
        it is coerced to a nearby integer on this scale.
    bias_found : bool
        Whether any bias was detected in the text. Derived from
        severity / segments when the model omits it.
    biased_segments : list[BiasedSegment]
        List of biased segments found in the text, each with
        character-level start/end offsets.
    unbiased_text : str
        Full neutral rewrite of the input text.
    original_text : str | None
        The original input text. Set by the pipeline.

    Examples
    --------
    >>> result = BiasResult(
    ...     binary_label="biased",
    ...     severity=6,
    ...     bias_found=True,
    ...     biased_segments=[],
    ...     unbiased_text="A neutral version of the text.",
    ... )
    >>> result.binary_label
    'biased'

    """

    binary_label: str
    severity: int
    bias_found: bool
    biased_segments: list[BiasedSegment]
    unbiased_text: str
    original_text: str | None = None

    @field_validator("binary_label")
    @classmethod
    def validate_binary_label(cls, v: str) -> str:
        """Validate binary_label is 'biased' or 'unbiased'."""
        allowed = {"biased", "unbiased"}
        normalized = v.lower().strip()
        if normalized not in allowed:
            raise ValueError(f"binary_label must be one of {allowed}, got '{v}'")
        return normalized

    @field_validator("severity", mode="before")
    @classmethod
    def validate_severity(cls, v: int | str) -> int:
        """Coerce and validate global severity on the 0-10 scale.

        Accepts:
          - int 0-10  (correct model output)
          - str 'low', 'medium', 'high', 'none'  (model confused scales)
          - any other int   (clamped into 0-10)
        """
        # String coercion — model confused global vs segment severity scale
        if isinstance(v, str):
            normalized = v.lower().strip()
            if normalized in _STR_TO_INT_SEVERITY:
                coerced = _STR_TO_INT_SEVERITY[normalized]
                logger.warning(
                    "Global severity returned as string '%s', coerced to %d",
                    v,
                    coerced,
                )
                return coerced
            # Try parsing as int string e.g. "3"
            try:
                v = int(v)
            except ValueError:
                logger.warning("Unrecognized severity '%s', defaulting to 3", v)
                return 3

        # Clamp out-of-range integer values gracefully onto 0-10
        if v < 0:
            return 0
        if v > 10:
            return 10
        return int(v)

validate_binary_label `classmethod` ¶

validate_binary_label(v)

Validate binary_label is 'biased' or 'unbiased'.

Source code in src/unbias_plus/schema.py

@field_validator("binary_label")
@classmethod
def validate_binary_label(cls, v: str) -> str:
    """Validate binary_label is 'biased' or 'unbiased'."""
    allowed = {"biased", "unbiased"}
    normalized = v.lower().strip()
    if normalized not in allowed:
        raise ValueError(f"binary_label must be one of {allowed}, got '{v}'")
    return normalized

validate_severity `classmethod` ¶

validate_severity(v)

Coerce and validate global severity on the 0-10 scale.

Accepts: - int 0-10 (correct model output) - str 'low', 'medium', 'high', 'none' (model confused scales) - any other int (clamped into 0-10)

Source code in src/unbias_plus/schema.py

@field_validator("severity", mode="before")
@classmethod
def validate_severity(cls, v: int | str) -> int:
    """Coerce and validate global severity on the 0-10 scale.

    Accepts:
      - int 0-10  (correct model output)
      - str 'low', 'medium', 'high', 'none'  (model confused scales)
      - any other int   (clamped into 0-10)
    """
    # String coercion — model confused global vs segment severity scale
    if isinstance(v, str):
        normalized = v.lower().strip()
        if normalized in _STR_TO_INT_SEVERITY:
            coerced = _STR_TO_INT_SEVERITY[normalized]
            logger.warning(
                "Global severity returned as string '%s', coerced to %d",
                v,
                coerced,
            )
            return coerced
        # Try parsing as int string e.g. "3"
        try:
            v = int(v)
        except ValueError:
            logger.warning("Unrecognized severity '%s', defaulting to 3", v)
            return 3

    # Clamp out-of-range integer values gracefully onto 0-10
    if v < 0:
        return 0
    if v > 10:
        return 10
    return int(v)

compute_offsets ¶

compute_offsets(original_text, segments)

Compute character start/end offsets for each biased segment.

Source code in src/unbias_plus/schema.py

def compute_offsets(
    original_text: str, segments: list[BiasedSegment]
) -> list[BiasedSegment]:
    """Compute character start/end offsets for each biased segment."""
    enriched = _assign_non_overlapping_spans(
        original_text,
        segments,
        "original",
        "start",
        "end",
        label="segment",
    )
    enriched.sort(key=lambda s: s.start if s.start is not None else 0)
    return deduplicate_by_span(enriched)

compute_replacement_offsets ¶

compute_replacement_offsets(
    original_text, unbiased_text, segments
)

Compute highlight spans in unbiased_text for each biased segment.

Primary strategy: align segment [start, end) boundaries from the original text onto the rewrite so the full neutralized region is highlighted. Falls back to flexible replacement string search when alignment fails.

Source code in src/unbias_plus/schema.py

def compute_replacement_offsets(
    original_text: str,
    unbiased_text: str,
    segments: list[BiasedSegment],
) -> list[BiasedSegment]:
    """Compute highlight spans in *unbiased_text* for each biased segment.

    Primary strategy: align segment ``[start, end)`` boundaries from the
    original text onto the rewrite so the full neutralized region is
    highlighted. Falls back to flexible ``replacement`` string search when
    alignment fails.
    """
    if original_text == unbiased_text:
        return segments

    orig_to_unb = _build_orig_to_unb_map(original_text, unbiased_text)
    used: list[tuple[int, int]] = []
    ordered = sorted(
        enumerate(segments),
        key=lambda item: (item[1].start is None, item[1].start or 0),
    )
    updates: dict[int, tuple[int, int]] = {}

    for orig_idx, seg in ordered:
        span: tuple[int, int] | None = None

        if seg.start is not None and seg.end is not None:
            span = _boundary_replacement_span(
                original_text, unbiased_text, seg.start, seg.end, orig_to_unb
            )

        if span is None and seg.replacement:
            cursor = orig_to_unb[seg.start] if seg.start is not None else 0
            span = _find_replacement_span_in_unbiased(
                unbiased_text, seg.replacement, cursor=cursor
            )

        if span is None:
            logger.warning(
                "No replacement span for segment %r at [%s:%s)",
                seg.original,
                seg.start,
                seg.end,
            )
            continue

        for u in used:
            if not (span[1] <= u[0] or span[0] >= u[1]) and span[0] < u[1]:
                span = (u[1], span[1])
        if span[1] <= span[0]:
            if seg.replacement:
                span = _find_replacement_span_in_unbiased(
                    unbiased_text,
                    seg.replacement,
                    cursor=used[-1][1] if used else 0,
                )
            if span is None or span[1] <= span[0]:
                continue

        used.append(span)
        updates[orig_idx] = span

    enriched: list[BiasedSegment] = []
    for idx, seg in enumerate(segments):
        span = updates.get(idx)
        if span is None:
            enriched.append(seg)
            continue
        enriched.append(
            seg.model_copy(
                update={
                    "replacement_start": span[0],
                    "replacement_end": span[1],
                }
            )
        )

    return enriched

drop_unchanged_segments ¶

drop_unchanged_segments(segments)

Drop segments whose replacement is identical to the original phrase.

Under vLLM stochasticity the model occasionally flags a span but returns a replacement equal to the original (no actual edit). There is nothing to highlight, so these add noise and are removed. Segments with an empty replacement are kept: an empty replacement means "delete the phrase", which is a genuine edit.

Source code in src/unbias_plus/schema.py

def drop_unchanged_segments(segments: list[BiasedSegment]) -> list[BiasedSegment]:
    """Drop segments whose replacement is identical to the original phrase.

    Under vLLM stochasticity the model occasionally flags a span but returns a
    replacement equal to the original (no actual edit). There is nothing to
    highlight, so these add noise and are removed. Segments with an empty
    replacement are kept: an empty replacement means "delete the phrase", which
    is a genuine edit.
    """
    kept: list[BiasedSegment] = []
    for seg in segments:
        replacement = seg.replacement.strip()
        if replacement and _normalize_for_equality(seg.original) == (
            _normalize_for_equality(replacement)
        ):
            logger.debug(
                "Dropping no-op segment (replacement == original): %r", seg.original
            )
            continue
        kept.append(seg)
    return kept

deduplicate_by_span ¶

deduplicate_by_span(segments)

Drop segments that share the same (start, end) after offset assignment.

The parser already merges identical original strings; this catches near-duplicates (whitespace variants) that still map to the same span.

Source code in src/unbias_plus/schema.py

def deduplicate_by_span(segments: list[BiasedSegment]) -> list[BiasedSegment]:
    """Drop segments that share the same (start, end) after offset assignment.

    The parser already merges identical ``original`` strings; this catches
    near-duplicates (whitespace variants) that still map to the same span.
    """
    seen: set[tuple[int, int]] = set()
    unique: list[BiasedSegment] = []
    for seg in segments:
        if seg.start is None or seg.end is None:
            unique.append(seg)
            continue
        key = (seg.start, seg.end)
        if key in seen:
            continue
        seen.add(key)
        unique.append(seg)
    return unique

API (FastAPI server)¶

unbias_plus.api ¶

FastAPI server for unbias-plus.

FeedbackRequest ¶

Bases: BaseModel

Request body for the feedback endpoint.

Source code in src/unbias_plus/api.py

class FeedbackRequest(BaseModel):
    """Request body for the feedback endpoint."""

    reaction: str  # "like" or "dislike" (required)
    message: str = ""  # optional free-text comment
    input_text: str = ""  # the text that was analyzed
    rating: int | None = None  # 1–5 star rating
    speed: str | None = None  # "too_slow" | "acceptable" | "fast"
    accuracy: str | None = None  # "not_accurate" | "somewhat" | "very_accurate"

AnalyzeRequest ¶

Bases: BaseModel

Request body for the analyze endpoint.

Attributes:

Name	Type	Description
`text`	`str`	The input text to analyze for bias.

Source code in src/unbias_plus/api.py

class AnalyzeRequest(BaseModel):
    """Request body for the analyze endpoint.

    Attributes
    ----------
    text : str
        The input text to analyze for bias.
    """

    text: str

HealthResponse ¶

Bases: BaseModel

Response body for the health endpoint.

Attributes:

Name	Type	Description
`status`	`str`	Server status string.
`model`	`str`	Currently loaded model name or path.

Source code in src/unbias_plus/api.py

class HealthResponse(BaseModel):
    """Response body for the health endpoint.

    Attributes
    ----------
    status : str
        Server status string.
    model : str
        Currently loaded model name or path.
    """

    status: str
    model: str

lifespan `async` ¶

lifespan(app)

Load the model on startup and release on shutdown.

Parameters:

Name	Type	Description	Default
`app`	`FastAPI`	The FastAPI application instance.	required

Yields:

Type	Description
`None`

Source code in src/unbias_plus/api.py

@asynccontextmanager
async def lifespan(app: FastAPI) -> AsyncGenerator[None, None]:
    """Load the model on startup and release on shutdown.

    Parameters
    ----------
    app : FastAPI
        The FastAPI application instance.

    Yields
    ------
    None
    """
    if VLLM_BASE_URL:
        from openai import OpenAI  # noqa: PLC0415

        app.state.vllm_client = OpenAI(base_url=VLLM_BASE_URL, api_key=VLLM_API_KEY)
        app.state.pipe = None
        print(f"Using remote vLLM via {VLLM_BASE_URL} (model: {VLLM_MODEL_NAME})")
    else:
        app.state.vllm_client = None
        model_path = getattr(app.state, "model_name_or_path", DEFAULT_MODEL)
        load_in_4bit = getattr(app.state, "load_in_4bit", False)
        app.state.pipe = UnBiasPlus(
            model_name_or_path=model_path,
            load_in_4bit=load_in_4bit,
        )
        pipe_ref = app.state.pipe

        def _cuda_warmup() -> None:
            print("Warming up CUDA kernels (background)...")
            try:
                pipe_ref.analyze("Warmup.")
                print("Warmup complete.")
            except Exception:
                pass  # warmup failure is non-fatal

        threading.Thread(target=_cuda_warmup, daemon=True).start()

    yield
    app.state.pipe = None
    app.state.vllm_client = None

favicon ¶

favicon()

Serve the demo favicon for browsers that request /favicon.ico by default.

Source code in src/unbias_plus/api.py

@app.get("/favicon.ico", include_in_schema=False)
def favicon() -> FileResponse:
    """Serve the demo favicon for browsers that request /favicon.ico by default."""
    if not FAVICON_PATH.exists():
        raise HTTPException(status_code=404, detail="Favicon not found.")
    return FileResponse(FAVICON_PATH, media_type="image/svg+xml")

index ¶

index()

Serve the landing page in cloud mode, or the demo UI locally.

Returns:

Type	Description
`str`	HTML content.

Raises:

Type	Description
`HTTPException`	404 if the template is not found.

Source code in src/unbias_plus/api.py

@app.get("/", response_class=HTMLResponse, response_model=None)
def index() -> str:
    """Serve the landing page in cloud mode, or the demo UI locally.

    Returns
    -------
    str
        HTML content.

    Raises
    ------
    HTTPException
        404 if the template is not found.
    """
    if VLLM_BASE_URL:
        html_file = DEMO_DIR / "templates" / "landing.html"
        if html_file.exists():
            return html_file.read_text()
    html_file = DEMO_DIR / "templates" / "index.html"
    if not html_file.exists():
        raise HTTPException(status_code=404, detail="Demo UI not found.")
    return html_file.read_text()

demo_page ¶

demo_page()

Serve the demo UI.

Returns:

Type	Description
`str`	index.html content.

Raises:

Type	Description
`HTTPException`	404 if index.html is not found.

Source code in src/unbias_plus/api.py

@app.get("/demo", response_class=HTMLResponse, response_model=None)
def demo_page() -> str:
    """Serve the demo UI.

    Returns
    -------
    str
        index.html content.

    Raises
    ------
    HTTPException
        404 if index.html is not found.
    """
    html_file = DEMO_DIR / "templates" / "index.html"
    if not html_file.exists():
        raise HTTPException(status_code=404, detail="Demo UI not found.")
    return html_file.read_text()

health ¶

health(request)

Check if the server and model are ready.

Returns:

Type	Description
`HealthResponse`	Server status and loaded model name.

Source code in src/unbias_plus/api.py

@app.get("/health", response_model=HealthResponse)
def health(request: Request) -> HealthResponse:
    """Check if the server and model are ready.

    Returns
    -------
    HealthResponse
        Server status and loaded model name.
    """
    vllm_client = getattr(request.app.state, "vllm_client", None)
    pipe = getattr(request.app.state, "pipe", None)
    if vllm_client is not None:
        return HealthResponse(
            status="ok", model=f"{VLLM_MODEL_NAME} (vLLM @ {VLLM_BASE_URL})"
        )
    if pipe is not None:
        return HealthResponse(status="ok", model=str(pipe._model.model_name_or_path))
    return HealthResponse(status="starting", model="not loaded")

analyze ¶

analyze(request, body)

Analyze input text for bias.

Parameters:

Name	Type	Description	Default
`request`	`Request`	FastAPI request (for app state).	required
`body`	`AnalyzeRequest`	Request body containing the text to analyze.	required

Returns:

Type	Description
`BiasResult`	Structured bias analysis result with character offsets.

Raises:

Type	Description
`HTTPException`	500 if no model backend is available or inference fails.
`HTTPException`	422 if the input is too long or output cannot be parsed.

Source code in src/unbias_plus/api.py

@app.post("/analyze", response_model=BiasResult)
def analyze(request: Request, body: AnalyzeRequest) -> BiasResult:
    """Analyze input text for bias.

    Parameters
    ----------
    request : Request
        FastAPI request (for app state).
    body : AnalyzeRequest
        Request body containing the text to analyze.

    Returns
    -------
    BiasResult
        Structured bias analysis result with character offsets.

    Raises
    ------
    HTTPException
        500 if no model backend is available or inference fails.
    HTTPException
        422 if the input is too long or output cannot be parsed.

    """
    vllm_client = getattr(request.app.state, "vllm_client", None)
    pipe = getattr(request.app.state, "pipe", None)
    if vllm_client is None and pipe is None:
        raise HTTPException(status_code=500, detail="Model not loaded.")
    text = prepare_input(body.text)
    if len(text) > MAX_INPUT_CHARS:
        raise HTTPException(
            status_code=422,
            detail=f"Input too long: {len(text)} chars (max {MAX_INPUT_CHARS}).",
        )
    try:
        if vllm_client is not None:
            completion = vllm_client.chat.completions.create(
                model=VLLM_MODEL_NAME,
                messages=build_messages(text),
                max_tokens=MAX_NEW_TOKENS,
                temperature=0,
                stop=["<|im_end|>", "<|endoftext|>"],
                extra_body={
                    "chat_template_kwargs": {"enable_thinking": False},
                    "stop_token_ids": [151645, 151643],
                },
            )
            raw = completion.choices[0].message.content or ""
            result = parse_llm_output(raw)
            return finalize_result(text, result)
        assert pipe is not None
        return cast(BiasResult, pipe.analyze(text))
    except ValueError as e:
        raise HTTPException(status_code=422, detail=_safe_error(e)) from e
    except Exception as e:
        raise HTTPException(status_code=500, detail=_safe_error(e)) from e

analyze_stream ¶

analyze_stream(request, body)

Stream bias analysis tokens via SSE, then emit the final parsed result.

Parameters:

Name	Type	Description	Default
`request`	`Request`	FastAPI request (for app state).	required
`body`	`AnalyzeRequest`	Request body containing the text to analyze.	required

Returns:

Type Description

StreamingResponse

Server-sent events stream. Each event is a JSON object: - {"t": "<token>"} for each generated token. - {"result": {...}} as the final event with the full BiasResult. Emitted as soon as the accumulated output parses as a full result (typically right after the closing } of the JSON), so the stream can end before max_tokens if the model finishes the object. - {"error": "<message>"} if inference fails.

Source code in src/unbias_plus/api.py

@app.post("/analyze/stream")
def analyze_stream(request: Request, body: AnalyzeRequest) -> StreamingResponse:
    """Stream bias analysis tokens via SSE, then emit the final parsed result.

    Parameters
    ----------
    request : Request
        FastAPI request (for app state).
    body : AnalyzeRequest
        Request body containing the text to analyze.

    Returns
    -------
    StreamingResponse
        Server-sent events stream. Each event is a JSON object:
        - ``{"t": "<token>"}`` for each generated token.
        - ``{"result": {...}}`` as the final event with the full BiasResult.
          Emitted as soon as the accumulated output parses as a full result
          (typically right after the closing ``}`` of the JSON), so the stream
          can end before ``max_tokens`` if the model finishes the object.
        - ``{"error": "<message>"}`` if inference fails.

    """
    vllm_client = getattr(request.app.state, "vllm_client", None)
    pipe = getattr(request.app.state, "pipe", None)
    if vllm_client is None and pipe is None:
        raise HTTPException(status_code=500, detail="Model not loaded.")
    text = prepare_input(body.text)
    if len(text) > MAX_INPUT_CHARS:
        raise HTTPException(
            status_code=422,
            detail=f"Input too long: {len(text)} chars (max {MAX_INPUT_CHARS}).",
        )

    def event_stream() -> Generator[str, None, None]:
        try:
            messages = build_messages(text)
            raw_output = ""

            if vllm_client is not None:
                stream = vllm_client.chat.completions.create(
                    model=VLLM_MODEL_NAME,
                    messages=messages,
                    max_tokens=MAX_NEW_TOKENS,
                    temperature=0,
                    stream=True,
                    stop=["<|im_end|>", "<|endoftext|>"],
                    extra_body={
                        "chat_template_kwargs": {"enable_thinking": False},
                        "stop_token_ids": [151645, 151643],
                    },
                )
                try:
                    for chunk in stream:
                        token = chunk.choices[0].delta.content or ""
                        if token:
                            raw_output += token
                            yield "data: " + json.dumps({"t": token}) + "\n\n"
                        early = _sse_result_line_or_none(raw_output, text)
                        if early is not None:
                            yield early
                            return
                finally:
                    stream.close()
            else:
                assert pipe is not None
                for token in pipe._model.generate_stream(messages):
                    raw_output += token
                    yield "data: " + json.dumps({"t": token}) + "\n\n"
                    early = _sse_result_line_or_none(raw_output, text)
                    if early is not None:
                        yield early
                        return

            result = parse_llm_output(raw_output)
            final = finalize_result(text, result)
            yield (
                "data: "
                + json.dumps({"result": json.loads(final.model_dump_json())})
                + "\n\n"
            )
        except ValueError as e:
            print(f"[stream] parse error: {_safe_error(e)}", flush=True)
            yield "data: " + json.dumps({"error": _safe_error(e)}) + "\n\n"
        except Exception as e:
            print(f"[stream] error: {_safe_error(e)}", flush=True)
            yield "data: " + json.dumps({"error": _safe_error(e)}) + "\n\n"

    return StreamingResponse(
        event_stream(),
        media_type="text/event-stream",
        headers={
            "Cache-Control": "no-cache",
            "X-Accel-Buffering": "no",
        },
    )

submit_feedback ¶

submit_feedback(body)

Save user feedback to BigQuery.

Returns:

Type	Description
`dict`	`{"ok": True}` on success.

Raises:

Type	Description
`HTTPException`	404 in local mode (no VLLM_BASE_URL). 422 if reaction value is invalid. 500 if the BigQuery write fails.

Source code in src/unbias_plus/api.py

@app.post("/feedback")
def submit_feedback(body: FeedbackRequest) -> dict[str, Any]:
    """Save user feedback to BigQuery.

    Returns
    -------
    dict
        ``{"ok": True}`` on success.

    Raises
    ------
    HTTPException
        404 in local mode (no VLLM_BASE_URL).
        422 if reaction value is invalid.
        500 if the BigQuery write fails.
    """
    if not VLLM_BASE_URL:
        raise HTTPException(status_code=404, detail="Not available in local mode.")

    if body.reaction not in ("like", "dislike"):
        raise HTTPException(
            status_code=422, detail="reaction must be 'like' or 'dislike'"
        )

    try:
        bq = _get_bq_client()
        table_id = f"{GCP_PROJECT}.{_BQ_DATASET}.{_BQ_TABLE}"
        rows = [
            {
                "timestamp": datetime.now(timezone.utc).isoformat(),
                "reaction": body.reaction,
                "message": body.message or None,
                "input_text": body.input_text[:500] if body.input_text else None,
                "rating": body.rating,
                "speed": body.speed or None,
                "accuracy": body.accuracy or None,
            }
        ]
        errors = bq.insert_rows_json(table_id, rows)
        if errors:
            raise RuntimeError(f"BigQuery insert errors: {errors}")
    except Exception as e:
        raise HTTPException(
            status_code=500, detail=f"Failed to save feedback: {e}"
        ) from e

    return {"ok": True}

serve ¶

serve(
    model_name_or_path=DEFAULT_MODEL,
    host="0.0.0.0",
    port=8000,
    load_in_4bit=False,
    reload=False,
)

Start the unbias-plus API server with the demo UI.

Loads the model and starts a uvicorn server. The demo UI is served at http://localhost:{port}/ and the API is at http://localhost:{port}/analyze.

Parameters:

Name	Type	Description	Default
`model_name_or_path`	`str \| Path`	HuggingFace model ID or local path to the model.	`DEFAULT_MODEL`
`host`	`str`	Host address to bind to. Default is '0.0.0.0'.	`'0.0.0.0'`
`port`	`int`	Port to listen on. Default is 8000.	`8000`
`load_in_4bit`	`bool`	Load model in 4-bit quantization. Default is False.	`False`
`reload`	`bool`	Enable auto-reload on code changes. Default is False.	`False`

Examples:

>>> from unbias_plus.api import serve
>>> serve("Qwen/Qwen3-4B", port=8000)

Source code in src/unbias_plus/api.py

def serve(
    model_name_or_path: str | Path = DEFAULT_MODEL,
    host: str = "0.0.0.0",
    port: int = 8000,
    load_in_4bit: bool = False,
    reload: bool = False,
) -> None:
    """Start the unbias-plus API server with the demo UI.

    Loads the model and starts a uvicorn server. The demo UI
    is served at http://localhost:{port}/ and the API is at
    http://localhost:{port}/analyze.

    Parameters
    ----------
    model_name_or_path : str | Path
        HuggingFace model ID or local path to the model.
    host : str
        Host address to bind to. Default is '0.0.0.0'.
    port : int
        Port to listen on. Default is 8000.
    load_in_4bit : bool
        Load model in 4-bit quantization. Default is False.
    reload : bool
        Enable auto-reload on code changes. Default is False.

    Examples
    --------
    >>> from unbias_plus.api import serve
    >>> serve("Qwen/Qwen3-4B", port=8000)  # doctest: +SKIP

    """
    app.state.model_name_or_path = str(model_name_or_path)
    app.state.load_in_4bit = load_in_4bit
    print(f"Starting unbias-plus server at http://localhost:{port}")
    uvicorn.run(app, host=host, port=port, reload=reload)

CLI¶

unbias_plus.cli ¶

CLI entry point for unbias-plus.

parse_args ¶

parse_args()

Parse CLI arguments.

Returns:

Type	Description
`Namespace`	Parsed arguments.

Source code in src/unbias_plus/cli.py

def parse_args() -> argparse.Namespace:
    """Parse CLI arguments.

    Returns
    -------
    argparse.Namespace
        Parsed arguments.

    """
    parser = argparse.ArgumentParser(
        prog="unbias-plus",
        description="Detect and debias text using a single LLM.",
    )

    input_group = parser.add_mutually_exclusive_group()
    input_group.add_argument(
        "--text",
        type=str,
        help="Text string to analyze.",
    )
    input_group.add_argument(
        "--file",
        type=str,
        nargs="+",
        help="Path(s) to .txt file(s) to analyze.",
    )
    input_group.add_argument(
        "--serve",
        action="store_true",
        default=False,
        help="Start the FastAPI server.",
    )

    parser.add_argument(
        "--model",
        type=str,
        default=DEFAULT_MODEL,
        help=f"HuggingFace model ID or local path. Default: {DEFAULT_MODEL}",
    )
    parser.add_argument(
        "--load-in-4bit",
        action="store_true",
        default=False,
        help="Load model in 4-bit quantization to reduce VRAM usage.",
    )
    parser.add_argument(
        "--json",
        action="store_true",
        default=False,
        help="Output result as raw JSON instead of formatted CLI display.",
    )
    parser.add_argument(
        "--max-new-tokens",
        type=int,
        default=8096,
        help="Maximum number of tokens to generate. Default: 8096",
    )
    parser.add_argument(
        "--host",
        type=str,
        default="0.0.0.0",
        help="Host for the API server. Default: 0.0.0.0",
    )
    parser.add_argument(
        "--port",
        type=int,
        default=8000,
        help="Port for the API server. Default: 8000",
    )

    return parser.parse_args()

main ¶

main()

Run the unbias-plus CLI.

Examples:

$ unbias-plus --text "Women are too emotional to lead." $ unbias-plus --file article.txt --json $ unbias-plus --file article1.txt article2.txt $ unbias-plus --serve --model path/to/model --port 8000 $ unbias-plus --serve --load-in-4bit

Source code in src/unbias_plus/cli.py

def main() -> None:
    """Run the unbias-plus CLI.

    Examples
    --------
    $ unbias-plus --text "Women are too emotional to lead."
    $ unbias-plus --file article.txt --json
    $ unbias-plus --file article1.txt article2.txt
    $ unbias-plus --serve --model path/to/model --port 8000
    $ unbias-plus --serve --load-in-4bit

    """
    args = parse_args()

    if args.serve:
        serve(
            model_name_or_path=args.model,
            host=args.host,
            port=args.port,
            load_in_4bit=args.load_in_4bit,
        )
        return

    if not args.text and not args.file:
        print(
            "Error: one of --text, --file, or --serve is required.",
            file=sys.stderr,
        )
        sys.exit(1)

    pipe = UnBiasPlus(
        model_name_or_path=args.model,
        load_in_4bit=args.load_in_4bit,
        max_new_tokens=args.max_new_tokens,
    )

    if args.file:
        for idx, file_path in enumerate(args.file):
            try:
                with open(file_path) as f:
                    text = f.read()
            except FileNotFoundError:
                print(f"Error: file '{file_path}' not found.", file=sys.stderr)
                sys.exit(1)

            if len(args.file) > 1:
                print(f"=== {file_path} ===")

            if args.json:
                print(pipe.analyze_to_json(text))
            else:
                print(pipe.analyze_to_cli(text))

            if idx < len(args.file) - 1:
                print()
    elif args.json:
        print(pipe.analyze_to_json(args.text))
    else:
        print(pipe.analyze_to_cli(args.text))

Prompt¶

unbias_plus.prompt ¶

Prompt templates for the unbias-plus LLM.

The model returns plain JSON with severity, biased_segments, and unbiased_text. binary_label and bias_found are not requested; they are derived from severity (severity > 0 => biased) downstream.

build_messages ¶

build_messages(text)

Build the chat messages list for the LLM given input text.

Formats the system prompt and user text into the messages format required by the model's chat template.

Parameters:

Name	Type	Description	Default
`text`	`str`	The input text to analyze for bias.	required

Returns:

Type	Description
`list[dict]`	List of `{"role": ..., "content": ...}` dicts ready for `tokenizer.apply_chat_template()`.

Examples:

>>> messages = build_messages("Women are too emotional to lead.")
>>> messages[0]["role"]
'system'
>>> messages[1]["role"]
'user'
>>> "Women are too emotional to lead." in messages[1]["content"]
True

Source code in src/unbias_plus/prompt.py

def build_messages(text: str) -> list[dict]:
    """Build the chat messages list for the LLM given input text.

    Formats the system prompt and user text into the messages format
    required by the model's chat template.

    Parameters
    ----------
    text : str
        The input text to analyze for bias.

    Returns
    -------
    list[dict]
        List of ``{"role": ..., "content": ...}`` dicts ready for
        ``tokenizer.apply_chat_template()``.

    Examples
    --------
    >>> messages = build_messages("Women are too emotional to lead.")
    >>> messages[0]["role"]
    'system'
    >>> messages[1]["role"]
    'user'
    >>> "Women are too emotional to lead." in messages[1]["content"]
    True
    """
    return [
        {"role": "system", "content": SYSTEM_PROMPT},
        {"role": "user", "content": USER_TEMPLATE.format(article=text)},
    ]

Parser¶

unbias_plus.parser ¶

Parser for LLM JSON output into BiasResult objects.

parse_llm_output ¶

parse_llm_output(raw_output)

Parse raw LLM output string into a BiasResult object.

Handles Qwen3 thinking blocks, markdown fences, truncated JSON, and the common failure mode where the model embeds unescaped " characters inside unbiased_text (e.g. quoting the word "false"). In that case a naive json.loads / regex fallback used to return severity with an empty biased_segments list.

Strategy order: 1. Strip thinking / extract outermost JSON object 2. Direct json.loads 3. Escape raw control characters in strings, retry 4. Truncation / missing-comma repairs 5. Optional json_repair 6. Schema-aware field extraction (severity, segments, unbiased_text)

Parameters:

Name	Type	Description	Default
`raw_output`	`str`	Raw string returned by the LLM.	required

Returns:

Type	Description
`BiasResult`	Validated bias analysis result.

Raises:

Type	Description
`ValueError`	If the output cannot be parsed into a BiasResult.

Source code in src/unbias_plus/parser.py

def parse_llm_output(raw_output: str) -> BiasResult:
    """Parse raw LLM output string into a BiasResult object.

    Handles Qwen3 thinking blocks, markdown fences, truncated JSON, and the
    common failure mode where the model embeds unescaped ``"`` characters
    inside ``unbiased_text`` (e.g. quoting the word ``"false"``). In that
    case a naive ``json.loads`` / regex fallback used to return
    ``severity`` with an empty ``biased_segments`` list.

    Strategy order:
    1. Strip thinking / extract outermost JSON object
    2. Direct ``json.loads``
    3. Escape raw control characters in strings, retry
    4. Truncation / missing-comma repairs
    5. Optional ``json_repair``
    6. Schema-aware field extraction (severity, segments, unbiased_text)

    Parameters
    ----------
    raw_output : str
        Raw string returned by the LLM.

    Returns
    -------
    BiasResult
        Validated bias analysis result.

    Raises
    ------
    ValueError
        If the output cannot be parsed into a BiasResult.
    """
    text = _strip_thinking_block(raw_output)
    candidates = _candidate_json_blobs(text)

    parsed_options: list[dict[Any, Any]] = []
    for blob in candidates:
        data = _parse_json_blob(blob)
        if data is not None:
            parsed_options.append(data)
        schema_data = _extract_schema_fields(blob)
        if schema_data is not None:
            parsed_options.append(schema_data)

    schema_full = _extract_schema_fields(text)
    if schema_full is not None:
        parsed_options.append(schema_full)

    if not parsed_options:
        raise ValueError(
            "LLM output could not be parsed as JSON after all repair attempts.\n"
            f"Raw output:\n{raw_output}"
        )

    # Prefer the parse that keeps the most segments, then the longest rewrite.
    # json_repair / early-quote truncation can yield severity + empty/short
    # unbiased_text while schema-aware extract recovers both.
    data = max(parsed_options, key=_parse_quality_score)

    if "biased_segments" in data and isinstance(data["biased_segments"], list):
        data["biased_segments"] = _deduplicate_segments(data["biased_segments"])
        data["biased_segments"] = _remove_contained_segments(data["biased_segments"])

    data = _derive_label_fields(data)

    try:
        return BiasResult(**data)
    except Exception as e:
        raise ValueError(
            f"LLM JSON does not match expected schema.\nData: {data}\nError: {e}"
        ) from e

Formatter¶

unbias_plus.formatter ¶

Formatters for displaying BiasResult output.

format_cli ¶

format_cli(result)

Format a BiasResult for CLI terminal display.

Produces a human-readable, colored terminal output showing the bias label, severity, each biased segment with its replacement and reasoning, and the full unbiased rewrite.

Parameters:

Name	Type	Description	Default
`result`	`BiasResult`	The bias analysis result to format.	required

Returns:

Type	Description
`str`	A human-readable colored string for terminal output.

Examples:

>>> result = BiasResult(
...     binary_label="biased",
...     severity=3,
...     bias_found=True,
...     biased_segments=[],
...     unbiased_text="Neutral.",
... )
>>> output = format_cli(result)
>>> isinstance(output, str)
True

Source code in src/unbias_plus/formatter.py

def format_cli(result: BiasResult) -> str:
    """Format a BiasResult for CLI terminal display.

    Produces a human-readable, colored terminal output showing
    the bias label, severity, each biased segment with its
    replacement and reasoning, and the full unbiased rewrite.

    Parameters
    ----------
    result : BiasResult
        The bias analysis result to format.

    Returns
    -------
    str
        A human-readable colored string for terminal output.

    Examples
    --------
    >>> result = BiasResult(
    ...     binary_label="biased",
    ...     severity=3,
    ...     bias_found=True,
    ...     biased_segments=[],
    ...     unbiased_text="Neutral.",
    ... )
    >>> output = format_cli(result)
    >>> isinstance(output, str)
    True

    """
    lines = []
    lines.append("=" * 60)
    if result.bias_found:
        lines.append(f"Segments found: {len(result.biased_segments)}")
    if not result.biased_segments:
        lines.append("\nNo biased segments detected.")
    lines.append("=" * 60)

    if result.biased_segments:
        lines.append("\nBIASED SEGMENTS:")
        for i, seg in enumerate(result.biased_segments, 1):
            color = _SEVERITY_COLORS.get(seg.severity, "")
            reset = _SEVERITY_COLORS["reset"]
            lines.append(f"\n  [{i}] {color}{seg.severity.upper()}{reset}")
            lines.append(f'  Original  : "{seg.original}"')
            lines.append(f'  Replace   : "{seg.replacement}"')
            lines.append(f"  Bias type : {seg.bias_type}")
            lines.append(f"  Reasoning : {seg.reasoning}")

    lines.append("\n" + "-" * 60)
    lines.append("NEUTRAL REWRITE:")
    lines.append(result.unbiased_text)
    lines.append("=" * 60)

    return "\n".join(lines)

format_dict ¶

format_dict(result)

Convert a BiasResult to a plain Python dictionary.

Parameters:

Name	Type	Description	Default
`result`	`BiasResult`	The bias analysis result to convert.	required

Returns:

Type	Description
`dict`	Plain dictionary representation of the result.

Examples:

>>> result = BiasResult(
...     binary_label="biased",
...     severity=3,
...     bias_found=True,
...     biased_segments=[],
...     unbiased_text="Neutral.",
... )
>>> d = format_dict(result)
>>> isinstance(d, dict)
True

Source code in src/unbias_plus/formatter.py

def format_dict(result: BiasResult) -> dict:
    """Convert a BiasResult to a plain Python dictionary.

    Parameters
    ----------
    result : BiasResult
        The bias analysis result to convert.

    Returns
    -------
    dict
        Plain dictionary representation of the result.

    Examples
    --------
    >>> result = BiasResult(
    ...     binary_label="biased",
    ...     severity=3,
    ...     bias_found=True,
    ...     biased_segments=[],
    ...     unbiased_text="Neutral.",
    ... )
    >>> d = format_dict(result)
    >>> isinstance(d, dict)
    True

    """
    return result.model_dump()

format_json ¶

format_json(result)

Convert a BiasResult to a formatted JSON string.

Parameters:

Name	Type	Description	Default
`result`	`BiasResult`	The bias analysis result to convert.	required

Returns:

Type	Description
`str`	Pretty-printed JSON string representation of the result.

Examples:

>>> result = BiasResult(
...     binary_label="biased",
...     severity=3,
...     bias_found=True,
...     biased_segments=[],
...     unbiased_text="Neutral.",
... )
>>> json_str = format_json(result)
>>> isinstance(json_str, str)
True

Source code in src/unbias_plus/formatter.py

def format_json(result: BiasResult) -> str:
    """Convert a BiasResult to a formatted JSON string.

    Parameters
    ----------
    result : BiasResult
        The bias analysis result to convert.

    Returns
    -------
    str
        Pretty-printed JSON string representation of the result.

    Examples
    --------
    >>> result = BiasResult(
    ...     binary_label="biased",
    ...     severity=3,
    ...     bias_found=True,
    ...     biased_segments=[],
    ...     unbiased_text="Neutral.",
    ... )
    >>> json_str = format_json(result)
    >>> isinstance(json_str, str)
    True

    """
    return json.dumps(result.model_dump(), indent=2)

API Reference¶

Package¶

unbias_plus ¶

UnBiasPlus ¶

analyze ¶

analyze_to_cli ¶

analyze_to_dict ¶

analyze_to_json ¶

BiasedSegment ¶

validate_severity classmethod ¶

BiasResult ¶

validate_binary_label classmethod ¶

validate_severity classmethod ¶

serve ¶

Pipeline¶

unbias_plus.pipeline ¶

UnBiasPlus ¶

analyze ¶

analyze_to_cli ¶

analyze_to_dict ¶

analyze_to_json ¶

finalize_result ¶

Model¶

unbias_plus.model ¶

UnBiasModel ¶

generate ¶

generate_stream ¶

Schema¶

unbias_plus.schema ¶

BiasedSegment ¶

validate_severity classmethod ¶

BiasResult ¶

validate_binary_label classmethod ¶

validate_severity classmethod ¶

compute_offsets ¶

compute_replacement_offsets ¶

drop_unchanged_segments ¶

deduplicate_by_span ¶

API (FastAPI server)¶

unbias_plus.api ¶

FeedbackRequest ¶

AnalyzeRequest ¶

HealthResponse ¶

lifespan async ¶

favicon ¶

index ¶

demo_page ¶

health ¶

analyze ¶

analyze_stream ¶

submit_feedback ¶

serve ¶

CLI¶

unbias_plus.cli ¶

parse_args ¶

main ¶

Prompt¶

unbias_plus.prompt ¶

build_messages ¶

Parser¶

unbias_plus.parser ¶

parse_llm_output ¶

Formatter¶

unbias_plus.formatter ¶

format_cli ¶

format_dict ¶

format_json ¶

validate_severity `classmethod` ¶

validate_binary_label `classmethod` ¶

validate_severity `classmethod` ¶

validate_severity `classmethod` ¶

validate_binary_label `classmethod` ¶

validate_severity `classmethod` ¶

lifespan `async` ¶