textwrap – Formatting text paragraphs¶
Purpose: | Formatting text by adjusting where line breaks occur in a paragraph. |
---|---|
Available In: | 2.5 |
The textwrap module can be used to format text for output in situations where pretty-printing is desired. It offers programmatic functionality similar to the paragraph wrapping or filling features found in many text editors.
Example Data¶
The examples below use textwrap_example.py, which contains a string sample_text:
sample_text = ''' The textwrap module can be used to format text for output in situations where pretty-printing is desired. It offers programmatic functionality similar to the paragraph wrapping or filling features found in many text editors. '''
Filling Paragraphs¶
The fill() convenience function takes text as input and produces formatted text as output. Let’s see what it does with the sample_text provided.
import textwrap
from textwrap_example import sample_text
print 'No dedent:\n'
print textwrap.fill(sample_text)
The results are something less than what we want:
$ python textwrap_fill.py
No dedent:
The textwrap module can be used to format text for output in
situations where pretty-printing is desired. It offers
programmatic functionality similar to the paragraph wrapping
or filling features found in many text editors.
Removing Existing Indentation¶
Notice the embedded tabs and extra spaces mixed into the middle of the output. It looks pretty rough. We can do better if we start by removing any common whitespace prefix from all of the lines in the sample text. This allows us to use docstrings or embedded multi-line strings straight from our Python code while removing the formatting of the code itself. The sample string has an artificial indent level introduced for illustrating this feature.
import textwrap
from textwrap_example import sample_text
dedented_text = textwrap.dedent(sample_text).strip()
print 'Dedented:\n'
print dedented_text
The results are starting to look better:
$ python textwrap_dedent.py
Dedented:
The textwrap module can be used to format text for output in situations
where pretty-printing is desired. It offers programmatic functionality similar
to the paragraph wrapping or filling features found in many text editors.
Since “dedent” is the opposite of “indent”, the result is a block of text with the common initial whitespace from each line removed. If one line is already indented more than another, some of the whitespace will not be removed.
One tab.
Two tabs.
One tab.
becomes
One tab.
Two tabs.
One tab.
Combining Dedent and Fill¶
Next, let’s see what happens if we take the dedented text and pass it through fill() with a few different width values.
import textwrap from textwrap_example import sample_text dedented_text = textwrap.dedent(sample_text).strip() for width in [ 20, 60, 80 ]: print print '%d Columns:\n' % width print textwrap.fill(dedented_text, width=width)
This gives several sets of output in the specified widths:
$ python textwrap_fill_width.py
20 Columns:
The textwrap module
can be used to
format text for
output in situations
where pretty-
printing is desired.
It offers
programmatic
functionality
similar to the
paragraph wrapping
or filling features
found in many text
editors.
60 Columns:
The textwrap module can be used to format text for output in
situations where pretty-printing is desired. It offers
programmatic functionality similar to the paragraph wrapping
or filling features found in many text editors.
80 Columns:
The textwrap module can be used to format text for output in situations where
pretty-printing is desired. It offers programmatic functionality similar to the
paragraph wrapping or filling features found in many text editors.
Hanging Indents¶
Besides the width of the output, you can control the indent of the first line independently of subsequent lines.
import textwrap
from textwrap_example import sample_text
dedented_text = textwrap.dedent(sample_text).strip()
print textwrap.fill(dedented_text, initial_indent='', subsequent_indent=' ')
This makes it relatively easy to produce a hanging indent, where the first line is indented less than the other lines.
$ python textwrap_hanging_indent.py
The textwrap module can be used to format text for output in
situations where pretty-printing is desired. It offers
programmatic functionality similar to the paragraph wrapping or
filling features found in many text editors.
The indent values can include non-whitespace characters, too, so the hanging indent can be prefixed with * to produce bullet points, etc. That came in handy when I converted my old zwiki content so I could import it into trac. I used the StructuredText package from Zope to parse the zwiki data, then created a formatter to produce trac’s wiki markup as output. Using textwrap, I was able to format the output pages so almost no manual tweaking was needed after the conversion.
See also
- textwrap
- Standard library documentation for this module.
- Text Processing Tools
- More tools for working with text.