Oven logo

Oven

proselint0.16.0

Published

A linter for prose.

pip install proselint

Package Downloads

Weekly DownloadsMonthly Downloads

Requires Python

>=3.10

proselint logo

Workflow status codecov License

Writing is notoriously hard, even for the best writers, and it's not for lack of good advice — a tremendous amount of knowledge about the craft is strewn across usage guides, dictionaries, technical manuals, essays, pamphlets, websites, and the hearts and minds of great authors and editors. But poring over Strunk & White hardly makes one a better writer — it turns you into neither Strunk nor White. And nobody has the capacity to apply all the advice from Garner’s Modern English Usage, an 1100-page usage guide, to everything they write. In fact, the whole notion that one becomes a better writer by reading advice on writing rests on untenable assumptions about learning and memory. The traditional formats of knowledge about writing are thus essentially inert, waiting to be transformed.

We devised a simple solution: proselint, a linter for English prose. A linter is a computer program that, akin to a spell checker, scans through a file and detects issues — like how a real lint roller helps you get unwanted lint off of your shirt.

proselint places the world's greatest writers and editors by your side, where they whisper suggestions on how to improve your prose. You’ll be guided by advice inspired by Bryan Garner, David Foster Wallace, Chuck Palahniuk, Steve Pinker, Mary Norris, Mark Twain, Elmore Leonard, George Orwell, Matthew Butterick, William Strunk, Elwyn White, Philip Corbett, Ernest Gowers, and the editorial staff of the world’s finest literary magazines and newspapers, among others. Our goal is to aggregate knowledge about best practices in writing and to make that knowledge immediately accessible to all authors in the form of a linter for prose; all in a neat command-line utility that you can integrate into other tools, scripts, and workflows.

Installation

To get this up and running, install it using pip.

pip install proselint

Fedora

sudo dnf install proselint

Debian

sudo apt install python3-proselint

Ubuntu

sudo add-apt-repository universe
sudo apt install python3-proselint

Nix

proselint is packaged by nixpkgs.

Declarative

environment.systemPackages = [pkgs.proselint];

Imperative

nix profile install nixpkgs#proselint

Plugins for other software

proselint is available on:

Usage

Suppose you have a document text.md with the following text:

John is very unique.

You can run proselint over the document using the command line:

proselint check text.md

This prints a list of suggestions to stdout, one per line. Each suggestion is of the form:

file:<line>:<column>: <check_name>: <message>

For example,

text.md:1:9: uncomparables: Comparison of an uncomparable: 'very unique' is not comparable.

The command-line utility can also print suggestions in JSON using the --output-format json option. In this case, the output is considerably richer, following our stable wire schema.

{
  "result": {
    "file:///path/to/text.md": {
      "diagnostics": [
        {
          // Name of the check that output this suggestion.
          "check_path": "uncomparables",
          // Message to describe the suggestion
          "message": "Comparison of an uncomparable: 'very unique' is not comparable.",
          // Line and column where the error begins in the source
          "pos": [1, 9],
          // Absolute start and end of the error in the source
          "span": [9, 20],
          // Suggested replacements for the content, if applicable
          "replacements": null,
        }
      ]
    }
  }
}

To run the linter as part of another Python program, you can use the LintFile class in proselint.tools. This requires CheckRegistry to be populated.

from proselint.checks import __register__
from proselint.registry import CheckRegistry
from proselint.tools import LintFile

CheckRegistry().register_many(__register__)
suggestions = LintFile("source-name", "This sentence is very unique").lint()

This will return a list of suggestions:

[LintResult(
    check_result=CheckResult(
        check_path='uncomparables',
        message="Comparison of an uncomparable: 'very unique' is not comparable.",
        span=(18, 29),
        replacements=None,
    ),
    pos=(1, 18),
)]

Checks

You can disable any of the checks by modifying $XDG_CONFIG_HOME/proselint/config.json. If $XDG_CONFIG_HOME is not set or empty, ~/.config/proselint/config.json will be used. Additionally, for compatibility reasons, the legacy configurations ~/.proselintrc and $XDG_CONFIG_HOME/proselint/config will be checked if $XDG_CONFIG_HOME/proselint/config.json does not exist. Check selection is granular at any level, illustrated in the following example:

{
  "checks": {
    "typography": true,
    "typography.symbols": false,
    "typography.symbols.curly_quotes": true,
    "typography.punctuation.hyperbole": false,
  }
}

This configuration would enable all checks in the typography module, excluding typography.punctuation.hyperbole and those in typography.symbols, but preserving typography.symbols.curly_quotes. Using this system allows you to concisely and precisely select checks at an individual level.

IDDescription
annotationsCatch annotations left in the text
archaismAvoid archaic forms
cliches.hellAvoid a common cliché regarding hell
cliches.miscAvoid clichés
dates_times.am_pmFormat the time of day correctly
dates_times.datesFormat dates appropriately
hedgingAvoid undermining yourself with uncertainty
industrial_language.airlineseAvoid jargon of the airline industry
industrial_language.bureaucrateseAvoid bureaucratese
industrial_language.chatspeakAvoid lolling and other chatspeak
industrial_language.commercialeseAvoid jargon of the commercial world
industrial_language.corporate_speakAvoid corporate buzzwords
industrial_language.jargonAvoid miscellaneous jargon
lexical_illusionsAvoid repeating words or phrases
malapropismsAvoid common malapropisms
misc.apologizingBe confident and avoid excessive apologizing
misc.back_formationsAvoid redundant backformations
misc.butDo not start a paragraph with "But..."
misc.capitalizationCapitalize only what ought to be capitalized
misc.compositionAdhere to principles of composition
misc.currencyAvoid redundant currency symbols
misc.debasedAvoid debased language
misc.false_pluralsAvoid false plurals
misc.greylistAvoid greylisted terms
misc.illogicAvoid illogical forms
misc.inferior_superiorSuperior to, not than
misc.institution_nameUse the correct names of institutions
misc.latinAvoid overuse of Latin phrases
misc.many_aUse singular forms with "many a"
misc.metadiscourseAvoid discussing the discussion
misc.narcissismTalk about the subject, not its study
misc.not_guiltyAvoid "not guilty beyond a reasonable doubt"
misc.phrasal_adjectivesHyphenate phrasal adjectives correctly
misc.preferred_formsUse the preferred forms of terms
misc.pretensionDo not be pretentious
misc.professionsUse the right names for jobs
misc.scare_quotesDo not misuse scare quotes
misc.suddenlyRetain suddenness by not using "suddenly"
misc.tense_presentFollow advice from Tense Present
misc.waxedUse adjectives for waxed, as in "wax poetic"
misc.whenceAvoid redundancy with "whence"
mixed_metaphorsDo not mix metaphors
mondegreensAvoid mondegreens
needless_variantsUse preferred forms over uncommon variants
nonwordsDo not use nonwords
oxymoronsAvoid oxymorons
psychologyAvoid misusing psychological terms
redundancy.miscAvoid redundancy in phrases
redundancy.ras_syndromeAvoid redundancy in acronyms
restricted.elementaryRestrict writing to terms from elementary school
restricted.top1000Restrict writing to the top 1000 words by usage
skunked_termsAvoid using skunked terms
social_awareness.lgbtqBe aware of LGBTQ+ terminology
social_awareness.nwordTake responsibility for use of "the n-word"
social_awareness.sexismBe aware of sexist language
spelling.able_atableUse the correct form of -able and -atable
spelling.able_ibleUse the correct form of -able and -ible
spelling.ally_lyUse the correct form of -ally and -ly
spelling.ance_enceUse the correect form of -ance and -ence
spelling.athletesSpell the names of athletes correctly
spelling.consistencyBe consistent in spelling
spelling.ely_lyUse the correct form of -ely and -ly
spelling.em_im_en_inUse the correct form of -em, -im, -en, and -in
spelling.er_orUse the correct form of -er and -or
spelling.in_unUse the correct form of -in and -un
spelling.miscSpell miscellaneous terms correctly
spelling.ve_ofUse the correct form of -ve and -of
terms.animal_adjectivesUse the right adjectives for likening animals
terms.denizen_labelsUse the right names for denizens
terms.eponymous_adjectivesUse the right names for likening people
terms.veneryUse the right names for groups of animals
typography.diacritical_marksUse diacritical marks
typography.punctuationUse punctuation correctly
typography.symbolsUse symbols correctly
uncomparablesDo not compare uncomparables
weasel_wordsAvoid weasel words

Contributing

Interested in contributing to proselint? Great — there are plenty of ways you can help. Check out our contributing guidelines, where we describe how you can help us build proselint into the greatest writing tool in the world.

Support

If you run into a problem, please open an issue.

Running Tests

Automated tests are included in the tests directory. To run these tests locally, you can use pytest via poe test.

License

The project is licensed under the BSD license.