Researchers propose HierBias, a hierarchical model for context-aware media bias detection
The method jointly classifies binary bias and fine-grained bias types, reporting statistically significant gains over prior work on BABE and BASIL benchmarks.
1 source · cross-referenced
- HierBias introduces a hierarchical, context-conditioned architecture for media bias detection that models inter-sentence dependencies.
- The model pairs a sentence-level RoBERTa encoder with a cross-sentence Transformer aggregator and dual output heads for binary detection and four-class type classification.
- On BABE and BASIL, HierBias achieves 0.853 F1 and 0.723 MCC, outperforming the prior state of the art by +2.6% F1 and +4.3% MCC (McNemar’s test, p < 0.05).
- The authors provide theoretical results showing that leveraging document context reduces Bayes error when inter-sentence mutual information is non-zero.
- A multi-task learning bound demonstrates that joint training improves sample efficiency on small annotated corpora.
Researchers at an unspecified institution introduce HierBias, a hierarchical context-conditioned media bias detector that explicitly models inter-sentence dependencies ignored by prior sentence-level approaches.
The paper formalizes a context-conditioned bias probability and proves that incorporating document context strictly reduces Bayes error for sentence-level classification when inter-sentence mutual information is non-zero.
HierBias combines a RoBERTa encoder at the sentence level with a cross-sentence Transformer aggregator and dual output heads for binary bias detection and four-class bias type classification.
On the BABE and BASIL benchmarks, HierBias reports 0.853 F1 and 0.723 MCC, surpassing the prior state-of-the-art bias detector by +2.6% F1 and +4.3% MCC, with statistical significance per McNemar’s test (p < 0.05).
Ablation experiments indicate that each theoretical component contributes independently and consistently to performance.
The authors also derive a multi-task generalization bound showing that jointly training binary bias detection and fine-grained type classification improves sample efficiency on small annotated corpora.
- Jun 26, 2026 · arXiv cs.AI
Paper proposes activation-steering method to detect and reduce sycophancy in language models
Trust79 - Jun 26, 2026 · arXiv cs.AI
Compliant persona in chat models suppresses refusal, study finds
Trust79 - Jun 25, 2026 · arXiv cs.AI
Researchers propose automated benchmark generation for neural relational reasoning using LLMs
Trust79