Verbosity
Detects unnecessary wordiness and padding phrases that inflate text length without adding substance, a common trait of AI-generated academic writing.
Technical description
Identifies filler phrases, redundant modifiers, and padding expressions using curated dictionaries. Measures word-to-content ratio by counting substantive content words versus discourse markers, intensifiers, and filler. Computes padding density (filler_count / sentence_count) and flags sentences exceeding a word count threshold with low information density.
How it works
Layer 1 (deterministic): Matches text against a dictionary of ~150 padding phrases ('it is important to note that', 'in the context of', 'a wide range of'). Counts redundant modifiers and pleonasms. Calculates padding density per paragraph. Flags sentences with more filler words than content words.
Why this matters
Language models tend to produce verbose text because they are trained to generate plausible continuations, leading to repetitive elaboration and padding. Human academic writers, constrained by journal word limits, typically write more concisely. Excessive verbosity is a reliable surface-level signal of machine generation.
Score thresholds
- 0-1
- Concise writing with minimal padding
- 2-3
- Moderate use of filler phrases
- 4-5
- Heavily padded with redundant expressions throughout
Limitations
Some academic writing traditions (especially in social sciences) use more elaborate phrasing naturally. Non-native English speakers may use padding phrases as discourse connectors. Grant proposals often contain formulaic language that may trigger false positives.
References
- McCarthy PM, Jarvis S. (2010). MTLD, vocd-D, and HD-D: a validation study of sophisticated approaches to lexical diversity assessment. Behavior Research Methods
- Holtzman A, Buys J, Du L, Forbes M, Choi Y. (2020). The curious case of neural text degeneration. ICLR
- Welleck S, Kulikov I, Roller S, Dinan E, Cho K, Weston J. (2020). Neural text generation with unlikelihood training. ICLR
- Liang W, Zhang Y, Wu Z, et al.. (2025). Quantifying large language model usage in scientific papers. Nature Human Behaviour
- Markowitz DM, Hancock JT, Bailenson JN. (2024). Linguistic markers of inherently false AI communication and intentionally false human communication: evidence from hotel reviews. Journal of Language and Social Psychology
- Mosteller F, Wallace DL. (1964). Inference and Disputed Authorship: The Federalist. Addison-Wesley
- Finlayson M, Hewitt J, Koller A, Swayamdipta S, Sabharwal A. (2024). Closing the curious case of neural text degeneration. NeurIPS
- Krishna K, Song Y, Karpinska M, Wieting J, Iyyer M. (2023). Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense. NeurIPS
- Anonymous. (2025). Diversity boosts AI-generated text detection. arXiv:2509.18880