mlpproc


Namemlpproc JSON
Version 1.0.3 PyPI version JSON
download
home_pagehttps://github.com/dlesbre/mlpproc
SummaryMLPP: Markup Language Pre-Processor for text files (code/html/tex/...) inspired by the C preprocessor
upload_time2024-05-26 11:43:20
maintainerNone
docs_urlNone
authorDorian Lesbre
requires_python>=3.6
licenseMIT
keywords mlpproc preprocessor preprocess markup language python terminal
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # MLPProc : Markup Language Pre-Preprocessor

[![Maintenance](https://img.shields.io/badge/Maintained%3F-yes-brightgreen.svg)](https://github.com/dlesbre/mlpproc/graphs/commit-activity)
[![PyPI version](https://img.shields.io/pypi/v/mlpproc.svg)](https://pypi.python.org/pypi/mlpproc/)
[![PyPI pyversions](https://img.shields.io/pypi/pyversions/mlpproc.svg)](https://pypi.python.org/pypi/mlpproc/)
[![License](https://img.shields.io/pypi/l/mlpproc.svg)](https://github.com/dlesbre/mlpproc/blob/master/LICENSE)
[![PyPI status](https://img.shields.io/pypi/status/mlpproc.svg)](https://pypi.python.org/pypi/mlpproc/)
[![Downloads](https://pepy.tech/badge/mlpproc)](https://pepy.tech/project/mlpproc)

Simple program to preprocess text files. It is inspired by the C preprocessor and should work with any language.

## Contents

- [Contents](#contents)
- [Installation](#installation)
- [Manual installation](#manual-installation)
- [Preprocessor syntax](#preprocessor-syntax)
  - [Basic syntax](#basic-syntax)
  - [Nesting and resolution order](#nesting-and-resolution-order)
- [Command line usage](#command-line-usage)
- [Python usage](#python-usage)
- [Command and block reference](#command-and-block-reference)
  - [Commands](#commands)
  - [Blocks](#blocks)
- [Defining commands, blocks and final actions](#defining-commands-blocks-and-final-actions)
  - [Useful functions](#useful-functions)

## Installation

This package has no dependencies. To install simply run
```console
pip install mlpproc
```

Once installed you can call the preprocessor with
```console
mlpp --version
python -m mlpproc --version
```

## Manual installation

For a manual installation run

```console
git clone https://github.com/dlesbre/mlpproc.git &&
cd mlpproc &&
python3 -m venv venv &&
source venv/bin/activate &&
make setup
```

For development install, run `make setup-dev` to install extra dependencies and setup pre-commit.

## Preprocessor syntax

### Basic syntax

The preprocessor instructions are wrapped between "{%" and "%}". These tokens can be changed if they conflict with the syntax of the file's langage. Instructions are case-sensitive.

Preprocessor instructions are split in three categories :

- **Commands**: `{% command [args] %}`

	Commands print text where they are placed. For instance `{% date %}` prints the current date.
	Some special commands print no text but perform actions. `{% def name my_name %}` prints nothing but defines a new command `{% name %}` which prints `my_name`.

- **Blocks**: `{% block_name [args] %} ... some text ... {% endblock_name %}`

	Blocks work very similarly to commands: they wrap around some text and alter it in some way. For instance the `{% verbatim %}` block prints all text in itself verbatim, without rendering any of the commands.

- **Final actions**: some actions can be queued by special commands. They occur once every command and block in the file has been rendered and affect the whole current file. For instance `{% replace foo bar %}` will replace all instances of "foo" with "bar" in the whole rendered file (including occurrences before the command is called). Final actions can be restricted to a smaller part of the document with `{% block -a %}...{% endblock %}` :

		some text... foo here is not replaced
		{% begin block -a %}
			foo here is replaced
			{% replace foo bar %}
			foo here is replaced
			{% begin block %}
				foo here is also replaced
			{% endblock %}
			{% command foo as argument isn't replaced %}
			{% command that prints foo %} will be replaced
		{% endblock %}

For a list of command run `mlpp -h commands` or see the [command and block reference](#command-and-block-reference).

### Nesting and resolution order

Commands can be nested within one another : `{% foo {% bar %} %}`. The most innermost command is called first. So here `{% bar %}` is called, then `{% foo bar_output %}` is called. You can even do `{% {% foo %} %}` which will call `{% foo %}` first, then call `{% foo_output %}`. Note that this will fail if `foo_output` isn't a valid command, just like the previous command will fail is `bar_output` isn't a valid argument for `foo`.

Nesting can also be used for block arguments, but *it can NOT be used for block names and endblock*. This will likely fail block resolution and result in matching the wrong endblock or no endblock.

---

## Command line usage

The preprocessor can be called from the command line with:

```console
mlpp [--flags] [input_file]
python3 -m mlpproc [--flags] [input_file]
```

The default input file is `stdin`. Command line options are:

- `-o --output <file>` specifies a file to write output to. Default is stdout
- `-b --begin <string>` change the begin token (default is `"{% "`)
- `-e --end <string>` change the end token (default is `" %}"`)
- `-r --recursion_depth <number>` set the max recursion depth (default {rec}). Use -1 for no maximum recursion (dangerous)
- `-d -D --define <name>[=<value>]` defines a simple command with name `<name>` which prints `<value>` (nothing if no value). Can be used multiple times on command line
- `-i -I --include <path>` Adds paths to the INCLUDE_PATH. default INCLUDE_PATH is `[".", dir(input_file), dir(output_file)]`. Can be used multiple times on command line
- `w --warnings <hide|error>` choose whether to hide warnings or have them raise an error. default is display.
- `s --silent <warning_name>` silence a specific warning (ex: `"extra-arguments"`)
- `v --version` show version and exit
- `h --help` show this help and exit
- `h --help commands` show a list of commands and blocks and exit
- `h --help <cmd_name>` show help for a specific command of block

## Python usage

The package can be imported in python 3 with `import mlpproc`. This imports a `Preprocessor` class with all default commands included (see [list](#command-and-block-reference)). The simplest way to use the preprocessor is then:

```Python
import mlpproc

preprocessor = mlpproc.Preprocessor()

parsed_contents = preprocessor.process(file_contents, filename)
```

The filename is only needed for pretty error reports, and can be

You can configure the preprocessor directly via it's public attributes:

- `max_recursion_depth: int` (default 20) - raises an error past this depth
- `token_begin: str` and `token_end: str` (default "{% " and " %}") - tokens wrapping preprocessor calls in the document. They should not be equal or be a single or double quote (`'` and `"`) or parentheses `(` or `)`.
- `token_endblock: str` (default "end") - specifies what form the endblock command takes with the regex `<token_begin>\s*<token_endblock><block_name>\s*<token_end>`
- `safe_calls: bool` (default True) - if True, catches exceptions raised by command or blocks
- `error_mode: mlpproc.ErrorMode` (default RAISE), how errors are handled:
	- PRINT_AND_EXIT -> print to stderr and exit
	- PRINT_AND_RAISE -> print to stderr and raise exception
	- RAISE -> raise exception
- `warning_mode: mlpproc.WarningMode` (default RAISE)
	- HIDE -> do nothing
	- PRINT -> print to stderr
	- RAISE -> raise python warning
	- AS_ERROR -> passes to self.send_error()
- `use_color: bool` (default False) if True, uses ansi color when printing errors



---

## Command and block reference

Here follows a list of predefined commands and blocks. An up-to-date list can be found by running `mlpp -h commands` and detailed descriptions obtained by running `mlpp -h <command_name>`.

### Commands

#### begin

```
  Prints the current begin token (default "{%")

  Usage: begin [<number>]
  The optional number is used for recursion calls
    begin     -> "{%"
    begin 0   -> "{%"
    begin <n> -> "{% begin <n-1> %}"
```

#### call

```
  Prints a call to its arguments.

  Ex: "{% call my_command my_args %}" -> "{% my_command my_args %}"
  Useful in defs to use recursive calls.
  For recursion you can stack calls:
  "{% call call ... %}" -> "{% call ... %}"
```

#### capitalize

```
  Converts text to Capitalized case

  usage: capitalize [text]

  If text is present, converts text
  else converts everything in the document (can be restricted with block).
```

#### date

```
  Prints the current date.

  Usage: date [format=YYYY-MM-DD]
    format specifies year with YYYY or YY, month with MM or M,
    day with DD or D, hour with hh or h, minutes with mm or m
    seconds with ss or s
```

#### def

```
  Defines a new command or macro.

  Usage:
   def foo               -> defines empty foo command (prints nothing)
   def foo   some text   -> {% foo %} prints "some text"
                            (strips trailing/leading space)
   def foo " some text " -> {% foo %} prints " some text "
   def foo(arg1, arg2) text with arg1 and arg2
      -> {% foo bar "hi there" %} prints "text with bar and hi there"

  def overwrites old commands and blocks irreversibly.
  All defs are global, including those coming from sub-blocks and included files.

  The text in a def is effectively rendered twice:
  - the text is rendered once at definition (due to nesting)
  - when the new command is called argument substitution takes place first
    and the text is re-rendered before printing
  defs can use nesting and recursive calls using command like call, begin and end.

    {% def name john %}

    // name is evaluated before def
    {% def rec1 {% name %} %}

    // call evaluated before def, prints {% name %}
    // which will be evaluated when define is called
    {% def rec2 {% call name %} %}

    // 1rst call evaluated in define, prints {% call name %}
    // which will be evaluated when define is called
    {% def rec3 {% call call name %} %}

    {% def name alice %}
    {% rec1 %} -> prints john
    {% rec2 %} -> prints alice
    {% rec3 %} -> prints {% name %}

  defs can be overloaded on the number of arguments:

    {% def sum(a,b) a+b %}
    {% def sum(a)   {% sum a 0 %} %}
    {% sum 5 10 %} -> prints 5+10
    {% sum 5 %}    -> prints 5+0
```

#### deflist

```
  Defines a new command for a list of objects.

  Usage: deflist list_name space separated list " element with spaces "

  Defines list_name such that:
          list_name          prints the lists
          list_name <number> prints the n-th element
                             (number must be a between -length+1 and length+1)

  Can be used in combination with the for block to iterate multiple lists in a loop.
```

#### end

```
  Prints the current end token (default "%}")

  Usage: end [<number>]
  The optional number is used for recursion calls
    end     -> "%}"
    end 0   -> "%}"
    end <n> -> "{% end <n-1> %}"
```

#### error

```
  Raises an error.
  Use with if block to raise errors if conditions are not met.

  Usage: error [message]
```

#### filename

```
  Prints the name of the current file being parsed.
```

#### fix_first_line

```
  Ensures the document starts with a non-empty
  line (unless it is empty)
```

#### fix_last_line

```
  Ensures the file ends with a single empty
  line (unless it is empty)
```

#### include

```
  Includes the content of another file.

  Usage: include [--options] path
    path can be absolute or relative to
    any path in include_path: [current_working_dir, input_file_dir, output_file_dir]
    paths can be added to include_path with the --include/-i/-I preprocessor option

  Options:
    -b --begin <string> specify the begin token ("{%")
                        defaults to the same as current file
    -e --end   <string> specify the end token ("%}")
                        defaults to the same as current file
    -v --verbatim       when present, includes files as is, without parsing.
```

#### input_name

```
  Prints name of input file
  (only defined when called via mlpp or mlpproc.__main__.preprocessor_main())
```

#### label

```
  Adds a label at the current position

  Usage: label <label_name>
  Where label_name must be a valid identifier.

  Can be used in combination with the atlabel block
  to place text at all occurrences of a label.
```

#### line

```
  Prints the current line number.
  This is the line number of the command in the input file, the line
  in the output file may differ due to insertions/deletions.
```

#### lower

```
  Converts text to lower case

  usage: lower [text]

  If text is present, converts text
  else converts everything in the document (can be restricted with block).
```

#### output_name

```
  Prints name of output file
  (only defined when called via mlpp or mlpproc.__main__.preprocessor_main())
```

#### paste

```
  Pastes the contents of a clipboard (defined in a cut block)

  Usage: paste [-v|--verbatim] [clipboard]
    if --verbatim is set, paste the text as is, without rendering it
    clipboard is a string identifying the clipboard (default "").
    it must match a previous cut block's clipboard argument
```

#### replace

```
  Used to find and replace text

  Usage: replace [--options] pattern replacement [text]

  If text is present, replacement takes place in text.
  else it takes place in the whole document (can be restricted with block)

  Options:
    -c --count <number> number of occurrences to replace (default all)
    -i --ignore-case    pattern search ignores case (foo will match foo,FoO,FOO...)
    -w --whole-word     pattern only matches full words, i.e. occurrences not directly
                        preceded/followed by a letter/number/underscore.
    -r --regex          pattern is a regular expression, capture groups can be placed
                        in replacement with \1, \2,...
                        incompatible with --whole-word
```

#### strip

```
  Removes empty lines as well as trailing/leading whitespace.
  Ensures file ends on a single empty line
```

#### strip_empty_lines

```
  Removes empty lines (lines containing only spaces)
```

#### strip_leading_whitespace

```
  Removes leading whitespace (indent)
```

#### strip_trailing_whitespace

```
  Removes trailing whitespace
```

#### undef

```
  Undefines a command or block.
  This is irreversible and can undefine built-in commands and blocks.

  Usage: undef name
```

#### version

```
  Prints the mlpp version.
```

#### warning

```
  Raises a warning.
  Use with if block to raise warnings if conditions are not met.

  Usage: warning [message]
```

### Blocks

#### atlabel

```
  Renders a chunk of text and places it at all labels matching
  its label when processing is done.

  Usage: atlabel <label>

  It differs from the cut block in that:
  - it will also print its content to calls of {% label XXX %} preceding it
  - it can't be overwritten (at most one atlabel block per label)
  - the text is rendered just once, at the atlabel block definition
    (and not in where the text will be placed)

  ex:
    "{% def foo bar %}
    first label: {% label my_label %}
    {% atlabel my_label %}foo is {% foo %}{% endatlabel %}
          {% def foo notbar %}
    second label: {% label my_label %}"
  prints:
    "
    first label: foo is bar

    second label: foo is bar"

  Can be used in combination with include to create files inheriting
  from a common base.
```

#### block

```
  Block used to restrict action/defs/labels... to a local part of the files
  Can be very useful to wrap an include

  usage: block [--options]

  options:
    -b --begin <string>  change the begin token (default is same as current)
    -e --end <string>    change the end token (default is same as current)
    -d --local-defs      commands defined and undefined in the block are local
    -a --local-actions   final actions called in the block will only affect the block
                         use this to restrict replace, upper, ... to a section
    -c --local-clipboard the clipboard defined by cut in the block are local
    -l --local-labels    labels defined in the block are local, so they can
                         only be written to by local atlabel blocks

  Just like the verbatim block, changing begin and end means that the block will
  end at the first {% endblock %} not matching a {% block %}:

    {% block -b < -e > %}
      {% block blabla %}
      ...
      {% endblock %} // this endblock is ignored
    {% endblock %} // block ends here
```

#### comment

```
  This block is a comment.
  All text until the next 'endcomment' is ignored
```

#### cut

```
  Used to cut a section of text to paste elsewhere.
  The text is processed when pasted, not when cut

  Usage: cut [--pre-render|-p] [<clipboard_name>]
    if --pre-render - renders the block here
      (it will be re-rendered at time of pasting, unless using paste -v|--verbatim)
    clipboard is a string identifying the clipboard, default is ""

  ex:
    {% cut %}foo is {% foo %}{% endcut %}
    {% def foo bar %}
    first paste: {% paste %}
    {% def foo notbar %}
    second paste: {% paste %}"
  prints:
    "

    first paste: foo is bar

    second paste: foo is notbar"
```

#### for

```
  Simple for loop used to render a chunk of text multiple times.
  ex: "{% for x in range(2) %}{% x %},{% endfor %}" -> "1,2,"

  Usage: for <ident> in range(stop)
                        range(start, stop)
                        range(start, stop, step)
         for <ident> in space separated list " argument with spaces"


  range can be combined with the deflist command to iterate multiple lists:

    "{% deflist names alice john frank %} {% deflist ages 23 31 19 %}
    {% for i in range(3) %}{% names {% i %} %} (age {% ages {% i %} %})
    {% endfor %}"

  prints:

    "
    alice (age 23)
    john (age 31)
    frank (age 19)
    "
```

#### if

```
  Used to select wether or not to render a chunk of text
  based on simple conditions
  ex :
    {% if def identifier %}, {% if ndef identifier %}...
    {% if {% var %}==str_value %}, {% if {% var %}!=str_value %}...

  Usage: {% if <condition> %} ...
         [{% elif <condition> %} ...]
         [{% else %}...]
         {% endif %}

  Condition syntax is as follows
    simple_condition =
      | true | false | 1 | 0 | <string>
      | def <identifier> | ndef <identifier>
      | <str> == <str> | <str> != <str>

    condition =
      | <simple_condition> | not <simple_condition>
      | <condition> and <condition>
      | <condition> or <condition>
      | (<condition>)
```

#### repeat

```
  Used to repeat a block of text a number of times

  Usage: repeat <number>

  Ex: "{% repeat 4 %}a{% endrepeat %}" prints "aaaa".

  Unlike {% for x in range(3) %}, {% repeat 3 %} only
    renders the block once and prints three copies.
```

#### verbatim

```
  Used to paste contents without parsing them
  Stops at first {% endverbatim %} not matching a {% verbatim %}.

  Ex:
    "{% verbatim %}some text with symbols {% and %}{% endverbatim %}"
  Prints:
    "some text with symbols {% and %}"

  Ex:
    "{% verbatim %}some text with {% verbatim %}nested verbatim{% endverbatim %}{% endverbatim %}"
  Prints:
    "some text with {% verbatim %}nested verbatim{% endverbatim %}"
```

#### void

```
  This block is parsed but not printed.
  Use it to place comments or a bunch of def
  without adding whitespace
```


---

## Defining commands, blocks and final actions

This package is designed to simply add new commands and blocks:

- **commands**: they are classes inherinting from command, they should define `__call__` and doc signature:
	```Python
	class MyCommand(Command):
	    def __call__(self, p: Preprocessor, args: str) -> str:
 	        ...

 	    doc = "documentation for my command"
	```

	The first argument is the preprocessor object, the second is the args string entered after the command. For example when calling `{% command_name some args %}` args will contain `" some args "` including leading/trailing spaces.

	The return value is the string to be inserted instead of the command call.

	Command are stored in the preprocessor's `command` dict. They can be added with:

	```Python
	# adds the command to all new Preprocessor objects
	Preprocessor.commands["command_name"] = MyCommand()
	# adds the command to a specific Preprocessor object
	my_preproc_obj.commands["command_name"] = MyCommand()
	```

- **blocks**: they are classes with signature:
 	```Python
 	class MyBlock(Block):

 	    def __call__(self, p: Preprocessor, args: str, block_contents: str) -> str:
 	        ...

 	    doc = "my block documentation"
 	```

	`args` is the blocks argument, just like in commands, and `block_contents` is everything between `{% block args %}` and `{% endblock %}`.

	They return a string that replaces the whole block `{% block ... %}...{% endblock %}`

	Blocks are stored in the preprocessor's `blocks` dict. They can be added with:

	```Python
	Preprocessor.blocks["block_name"] = MyBlock()
	```

- **final actions**: they have the same signature as commands:
	```Python
	def final_action_function(p: Preprocessor, text: str) -> str
	```

	Here the `text` arg is the whole text (with all commands rendered).

	The action returns the transformed text.

	Actions are stored in the preprocessor's `final_actions` list, in the order in which they are to be executed. They can be added with:

	```Python
	# adds a post action to the whole class
	Preprocessor.final_actions.append(post_action_function)
	# adds a post action to a specific object
	preprocessor_obj.final_actions.append(post_action_function)
	```

### Useful functions

Some useful functions and attribute that are useful when defining commands or blocks

- `Preprocessor.split_args(self, args: str) -> List[str]` - use split a command or block arguments like the command line would.
	You can then parse them with `argparse`. However, `argparse` exits on parsing errors, so the module provide
	```Python
	class ArgumentParserNoExit(argparse.ArgumentParser):
	```
	which raises `argparse.ArgumentError` instead of exiting, allowing errors to be caught and passed to the preprocessor error handling system.
- `Preprocessor.send_error(self, name: str, msg: str)` - sends an error (and exits). Errors should be only fatal problems. Non-fatal problems should be warnings.
- `Preprocessor.send_warning(self, name: str, msg: str)` - sends a warning.
- `Preprocessor.current_position: Position` - variable containing all position info.
- `Preprocessor.parse(self, string: str) -> str` - processed the string commands and blocks and returns the parsed version
	It can be used for block contents, recursive defines, or any text which has preprocessor syntax.

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/dlesbre/mlpproc",
    "name": "mlpproc",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.6",
    "maintainer_email": null,
    "keywords": "mlpproc preprocessor preprocess markup language python terminal",
    "author": "Dorian Lesbre",
    "author_email": "dorian.lesbre@gmail.com",
    "download_url": null,
    "platform": "any",
    "description": "# MLPProc : Markup Language Pre-Preprocessor\n\n[![Maintenance](https://img.shields.io/badge/Maintained%3F-yes-brightgreen.svg)](https://github.com/dlesbre/mlpproc/graphs/commit-activity)\n[![PyPI version](https://img.shields.io/pypi/v/mlpproc.svg)](https://pypi.python.org/pypi/mlpproc/)\n[![PyPI pyversions](https://img.shields.io/pypi/pyversions/mlpproc.svg)](https://pypi.python.org/pypi/mlpproc/)\n[![License](https://img.shields.io/pypi/l/mlpproc.svg)](https://github.com/dlesbre/mlpproc/blob/master/LICENSE)\n[![PyPI status](https://img.shields.io/pypi/status/mlpproc.svg)](https://pypi.python.org/pypi/mlpproc/)\n[![Downloads](https://pepy.tech/badge/mlpproc)](https://pepy.tech/project/mlpproc)\n\nSimple program to preprocess text files. It is inspired by the C preprocessor and should work with any language.\n\n## Contents\n\n- [Contents](#contents)\n- [Installation](#installation)\n- [Manual installation](#manual-installation)\n- [Preprocessor syntax](#preprocessor-syntax)\n  - [Basic syntax](#basic-syntax)\n  - [Nesting and resolution order](#nesting-and-resolution-order)\n- [Command line usage](#command-line-usage)\n- [Python usage](#python-usage)\n- [Command and block reference](#command-and-block-reference)\n  - [Commands](#commands)\n  - [Blocks](#blocks)\n- [Defining commands, blocks and final actions](#defining-commands-blocks-and-final-actions)\n  - [Useful functions](#useful-functions)\n\n## Installation\n\nThis package has no dependencies. To install simply run\n```console\npip install mlpproc\n```\n\nOnce installed you can call the preprocessor with\n```console\nmlpp --version\npython -m mlpproc --version\n```\n\n## Manual installation\n\nFor a manual installation run\n\n```console\ngit clone https://github.com/dlesbre/mlpproc.git &&\ncd mlpproc &&\npython3 -m venv venv &&\nsource venv/bin/activate &&\nmake setup\n```\n\nFor development install, run `make setup-dev` to install extra dependencies and setup pre-commit.\n\n## Preprocessor syntax\n\n### Basic syntax\n\nThe preprocessor instructions are wrapped between \"{%\" and \"%}\". These tokens can be changed if they conflict with the syntax of the file's langage. Instructions are case-sensitive.\n\nPreprocessor instructions are split in three categories :\n\n- **Commands**: `{% command [args] %}`\n\n\tCommands print text where they are placed. For instance `{% date %}` prints the current date.\n\tSome special commands print no text but perform actions. `{% def name my_name %}` prints nothing but defines a new command `{% name %}` which prints `my_name`.\n\n- **Blocks**: `{% block_name [args] %} ... some text ... {% endblock_name %}`\n\n\tBlocks work very similarly to commands: they wrap around some text and alter it in some way. For instance the `{% verbatim %}` block prints all text in itself verbatim, without rendering any of the commands.\n\n- **Final actions**: some actions can be queued by special commands. They occur once every command and block in the file has been rendered and affect the whole current file. For instance `{% replace foo bar %}` will replace all instances of \"foo\" with \"bar\" in the whole rendered file (including occurrences before the command is called). Final actions can be restricted to a smaller part of the document with `{% block -a %}...{% endblock %}` :\n\n\t\tsome text... foo here is not replaced\n\t\t{% begin block -a %}\n\t\t\tfoo here is replaced\n\t\t\t{% replace foo bar %}\n\t\t\tfoo here is replaced\n\t\t\t{% begin block %}\n\t\t\t\tfoo here is also replaced\n\t\t\t{% endblock %}\n\t\t\t{% command foo as argument isn't replaced %}\n\t\t\t{% command that prints foo %} will be replaced\n\t\t{% endblock %}\n\nFor a list of command run `mlpp -h commands` or see the [command and block reference](#command-and-block-reference).\n\n### Nesting and resolution order\n\nCommands can be nested within one another : `{% foo {% bar %} %}`. The most innermost command is called first. So here `{% bar %}` is called, then `{% foo bar_output %}` is called. You can even do `{% {% foo %} %}` which will call `{% foo %}` first, then call `{% foo_output %}`. Note that this will fail if `foo_output` isn't a valid command, just like the previous command will fail is `bar_output` isn't a valid argument for `foo`.\n\nNesting can also be used for block arguments, but *it can NOT be used for block names and endblock*. This will likely fail block resolution and result in matching the wrong endblock or no endblock.\n\n---\n\n## Command line usage\n\nThe preprocessor can be called from the command line with:\n\n```console\nmlpp [--flags] [input_file]\npython3 -m mlpproc [--flags] [input_file]\n```\n\nThe default input file is `stdin`. Command line options are:\n\n- `-o --output <file>` specifies a file to write output to. Default is stdout\n- `-b --begin <string>` change the begin token (default is `\"{% \"`)\n- `-e --end <string>` change the end token (default is `\" %}\"`)\n- `-r --recursion_depth <number>` set the max recursion depth (default {rec}). Use -1 for no maximum recursion (dangerous)\n- `-d -D --define <name>[=<value>]` defines a simple command with name `<name>` which prints `<value>` (nothing if no value). Can be used multiple times on command line\n- `-i -I --include <path>` Adds paths to the INCLUDE_PATH. default INCLUDE_PATH is `[\".\", dir(input_file), dir(output_file)]`. Can be used multiple times on command line\n- `w --warnings <hide|error>` choose whether to hide warnings or have them raise an error. default is display.\n- `s --silent <warning_name>` silence a specific warning (ex: `\"extra-arguments\"`)\n- `v --version` show version and exit\n- `h --help` show this help and exit\n- `h --help commands` show a list of commands and blocks and exit\n- `h --help <cmd_name>` show help for a specific command of block\n\n## Python usage\n\nThe package can be imported in python 3 with `import mlpproc`. This imports a `Preprocessor` class with all default commands included (see [list](#command-and-block-reference)). The simplest way to use the preprocessor is then:\n\n```Python\nimport mlpproc\n\npreprocessor = mlpproc.Preprocessor()\n\nparsed_contents = preprocessor.process(file_contents, filename)\n```\n\nThe filename is only needed for pretty error reports, and can be\n\nYou can configure the preprocessor directly via it's public attributes:\n\n- `max_recursion_depth: int` (default 20) - raises an error past this depth\n- `token_begin: str` and `token_end: str` (default \"{% \" and \" %}\") - tokens wrapping preprocessor calls in the document. They should not be equal or be a single or double quote (`'` and `\"`) or parentheses `(` or `)`.\n- `token_endblock: str` (default \"end\") - specifies what form the endblock command takes with the regex `<token_begin>\\s*<token_endblock><block_name>\\s*<token_end>`\n- `safe_calls: bool` (default True) - if True, catches exceptions raised by command or blocks\n- `error_mode: mlpproc.ErrorMode` (default RAISE), how errors are handled:\n\t- PRINT_AND_EXIT -> print to stderr and exit\n\t- PRINT_AND_RAISE -> print to stderr and raise exception\n\t- RAISE -> raise exception\n- `warning_mode: mlpproc.WarningMode` (default RAISE)\n\t- HIDE -> do nothing\n\t- PRINT -> print to stderr\n\t- RAISE -> raise python warning\n\t- AS_ERROR -> passes to self.send_error()\n- `use_color: bool` (default False) if True, uses ansi color when printing errors\n\n\n\n---\n\n## Command and block reference\n\nHere follows a list of predefined commands and blocks. An up-to-date list can be found by running `mlpp -h commands` and detailed descriptions obtained by running `mlpp -h <command_name>`.\n\n### Commands\n\n#### begin\n\n```\n  Prints the current begin token (default \"{%\")\n\n  Usage: begin [<number>]\n  The optional number is used for recursion calls\n    begin     -> \"{%\"\n    begin 0   -> \"{%\"\n    begin <n> -> \"{% begin <n-1> %}\"\n```\n\n#### call\n\n```\n  Prints a call to its arguments.\n\n  Ex: \"{% call my_command my_args %}\" -> \"{% my_command my_args %}\"\n  Useful in defs to use recursive calls.\n  For recursion you can stack calls:\n  \"{% call call ... %}\" -> \"{% call ... %}\"\n```\n\n#### capitalize\n\n```\n  Converts text to Capitalized case\n\n  usage: capitalize [text]\n\n  If text is present, converts text\n  else converts everything in the document (can be restricted with block).\n```\n\n#### date\n\n```\n  Prints the current date.\n\n  Usage: date [format=YYYY-MM-DD]\n    format specifies year with YYYY or YY, month with MM or M,\n    day with DD or D, hour with hh or h, minutes with mm or m\n    seconds with ss or s\n```\n\n#### def\n\n```\n  Defines a new command or macro.\n\n  Usage:\n   def foo               -> defines empty foo command (prints nothing)\n   def foo   some text   -> {% foo %} prints \"some text\"\n                            (strips trailing/leading space)\n   def foo \" some text \" -> {% foo %} prints \" some text \"\n   def foo(arg1, arg2) text with arg1 and arg2\n      -> {% foo bar \"hi there\" %} prints \"text with bar and hi there\"\n\n  def overwrites old commands and blocks irreversibly.\n  All defs are global, including those coming from sub-blocks and included files.\n\n  The text in a def is effectively rendered twice:\n  - the text is rendered once at definition (due to nesting)\n  - when the new command is called argument substitution takes place first\n    and the text is re-rendered before printing\n  defs can use nesting and recursive calls using command like call, begin and end.\n\n    {% def name john %}\n\n    // name is evaluated before def\n    {% def rec1 {% name %} %}\n\n    // call evaluated before def, prints {% name %}\n    // which will be evaluated when define is called\n    {% def rec2 {% call name %} %}\n\n    // 1rst call evaluated in define, prints {% call name %}\n    // which will be evaluated when define is called\n    {% def rec3 {% call call name %} %}\n\n    {% def name alice %}\n    {% rec1 %} -> prints john\n    {% rec2 %} -> prints alice\n    {% rec3 %} -> prints {% name %}\n\n  defs can be overloaded on the number of arguments:\n\n    {% def sum(a,b) a+b %}\n    {% def sum(a)   {% sum a 0 %} %}\n    {% sum 5 10 %} -> prints 5+10\n    {% sum 5 %}    -> prints 5+0\n```\n\n#### deflist\n\n```\n  Defines a new command for a list of objects.\n\n  Usage: deflist list_name space separated list \" element with spaces \"\n\n  Defines list_name such that:\n          list_name          prints the lists\n          list_name <number> prints the n-th element\n                             (number must be a between -length+1 and length+1)\n\n  Can be used in combination with the for block to iterate multiple lists in a loop.\n```\n\n#### end\n\n```\n  Prints the current end token (default \"%}\")\n\n  Usage: end [<number>]\n  The optional number is used for recursion calls\n    end     -> \"%}\"\n    end 0   -> \"%}\"\n    end <n> -> \"{% end <n-1> %}\"\n```\n\n#### error\n\n```\n  Raises an error.\n  Use with if block to raise errors if conditions are not met.\n\n  Usage: error [message]\n```\n\n#### filename\n\n```\n  Prints the name of the current file being parsed.\n```\n\n#### fix_first_line\n\n```\n  Ensures the document starts with a non-empty\n  line (unless it is empty)\n```\n\n#### fix_last_line\n\n```\n  Ensures the file ends with a single empty\n  line (unless it is empty)\n```\n\n#### include\n\n```\n  Includes the content of another file.\n\n  Usage: include [--options] path\n    path can be absolute or relative to\n    any path in include_path: [current_working_dir, input_file_dir, output_file_dir]\n    paths can be added to include_path with the --include/-i/-I preprocessor option\n\n  Options:\n    -b --begin <string> specify the begin token (\"{%\")\n                        defaults to the same as current file\n    -e --end   <string> specify the end token (\"%}\")\n                        defaults to the same as current file\n    -v --verbatim       when present, includes files as is, without parsing.\n```\n\n#### input_name\n\n```\n  Prints name of input file\n  (only defined when called via mlpp or mlpproc.__main__.preprocessor_main())\n```\n\n#### label\n\n```\n  Adds a label at the current position\n\n  Usage: label <label_name>\n  Where label_name must be a valid identifier.\n\n  Can be used in combination with the atlabel block\n  to place text at all occurrences of a label.\n```\n\n#### line\n\n```\n  Prints the current line number.\n  This is the line number of the command in the input file, the line\n  in the output file may differ due to insertions/deletions.\n```\n\n#### lower\n\n```\n  Converts text to lower case\n\n  usage: lower [text]\n\n  If text is present, converts text\n  else converts everything in the document (can be restricted with block).\n```\n\n#### output_name\n\n```\n  Prints name of output file\n  (only defined when called via mlpp or mlpproc.__main__.preprocessor_main())\n```\n\n#### paste\n\n```\n  Pastes the contents of a clipboard (defined in a cut block)\n\n  Usage: paste [-v|--verbatim] [clipboard]\n    if --verbatim is set, paste the text as is, without rendering it\n    clipboard is a string identifying the clipboard (default \"\").\n    it must match a previous cut block's clipboard argument\n```\n\n#### replace\n\n```\n  Used to find and replace text\n\n  Usage: replace [--options] pattern replacement [text]\n\n  If text is present, replacement takes place in text.\n  else it takes place in the whole document (can be restricted with block)\n\n  Options:\n    -c --count <number> number of occurrences to replace (default all)\n    -i --ignore-case    pattern search ignores case (foo will match foo,FoO,FOO...)\n    -w --whole-word     pattern only matches full words, i.e. occurrences not directly\n                        preceded/followed by a letter/number/underscore.\n    -r --regex          pattern is a regular expression, capture groups can be placed\n                        in replacement with \\1, \\2,...\n                        incompatible with --whole-word\n```\n\n#### strip\n\n```\n  Removes empty lines as well as trailing/leading whitespace.\n  Ensures file ends on a single empty line\n```\n\n#### strip_empty_lines\n\n```\n  Removes empty lines (lines containing only spaces)\n```\n\n#### strip_leading_whitespace\n\n```\n  Removes leading whitespace (indent)\n```\n\n#### strip_trailing_whitespace\n\n```\n  Removes trailing whitespace\n```\n\n#### undef\n\n```\n  Undefines a command or block.\n  This is irreversible and can undefine built-in commands and blocks.\n\n  Usage: undef name\n```\n\n#### version\n\n```\n  Prints the mlpp version.\n```\n\n#### warning\n\n```\n  Raises a warning.\n  Use with if block to raise warnings if conditions are not met.\n\n  Usage: warning [message]\n```\n\n### Blocks\n\n#### atlabel\n\n```\n  Renders a chunk of text and places it at all labels matching\n  its label when processing is done.\n\n  Usage: atlabel <label>\n\n  It differs from the cut block in that:\n  - it will also print its content to calls of {% label XXX %} preceding it\n  - it can't be overwritten (at most one atlabel block per label)\n  - the text is rendered just once, at the atlabel block definition\n    (and not in where the text will be placed)\n\n  ex:\n    \"{% def foo bar %}\n    first label: {% label my_label %}\n    {% atlabel my_label %}foo is {% foo %}{% endatlabel %}\n          {% def foo notbar %}\n    second label: {% label my_label %}\"\n  prints:\n    \"\n    first label: foo is bar\n\n    second label: foo is bar\"\n\n  Can be used in combination with include to create files inheriting\n  from a common base.\n```\n\n#### block\n\n```\n  Block used to restrict action/defs/labels... to a local part of the files\n  Can be very useful to wrap an include\n\n  usage: block [--options]\n\n  options:\n    -b --begin <string>  change the begin token (default is same as current)\n    -e --end <string>    change the end token (default is same as current)\n    -d --local-defs      commands defined and undefined in the block are local\n    -a --local-actions   final actions called in the block will only affect the block\n                         use this to restrict replace, upper, ... to a section\n    -c --local-clipboard the clipboard defined by cut in the block are local\n    -l --local-labels    labels defined in the block are local, so they can\n                         only be written to by local atlabel blocks\n\n  Just like the verbatim block, changing begin and end means that the block will\n  end at the first {% endblock %} not matching a {% block %}:\n\n    {% block -b < -e > %}\n      {% block blabla %}\n      ...\n      {% endblock %} // this endblock is ignored\n    {% endblock %} // block ends here\n```\n\n#### comment\n\n```\n  This block is a comment.\n  All text until the next 'endcomment' is ignored\n```\n\n#### cut\n\n```\n  Used to cut a section of text to paste elsewhere.\n  The text is processed when pasted, not when cut\n\n  Usage: cut [--pre-render|-p] [<clipboard_name>]\n    if --pre-render - renders the block here\n      (it will be re-rendered at time of pasting, unless using paste -v|--verbatim)\n    clipboard is a string identifying the clipboard, default is \"\"\n\n  ex:\n    {% cut %}foo is {% foo %}{% endcut %}\n    {% def foo bar %}\n    first paste: {% paste %}\n    {% def foo notbar %}\n    second paste: {% paste %}\"\n  prints:\n    \"\n\n    first paste: foo is bar\n\n    second paste: foo is notbar\"\n```\n\n#### for\n\n```\n  Simple for loop used to render a chunk of text multiple times.\n  ex: \"{% for x in range(2) %}{% x %},{% endfor %}\" -> \"1,2,\"\n\n  Usage: for <ident> in range(stop)\n                        range(start, stop)\n                        range(start, stop, step)\n         for <ident> in space separated list \" argument with spaces\"\n\n\n  range can be combined with the deflist command to iterate multiple lists:\n\n    \"{% deflist names alice john frank %} {% deflist ages 23 31 19 %}\n    {% for i in range(3) %}{% names {% i %} %} (age {% ages {% i %} %})\n    {% endfor %}\"\n\n  prints:\n\n    \"\n    alice (age 23)\n    john (age 31)\n    frank (age 19)\n    \"\n```\n\n#### if\n\n```\n  Used to select wether or not to render a chunk of text\n  based on simple conditions\n  ex :\n    {% if def identifier %}, {% if ndef identifier %}...\n    {% if {% var %}==str_value %}, {% if {% var %}!=str_value %}...\n\n  Usage: {% if <condition> %} ...\n         [{% elif <condition> %} ...]\n         [{% else %}...]\n         {% endif %}\n\n  Condition syntax is as follows\n    simple_condition =\n      | true | false | 1 | 0 | <string>\n      | def <identifier> | ndef <identifier>\n      | <str> == <str> | <str> != <str>\n\n    condition =\n      | <simple_condition> | not <simple_condition>\n      | <condition> and <condition>\n      | <condition> or <condition>\n      | (<condition>)\n```\n\n#### repeat\n\n```\n  Used to repeat a block of text a number of times\n\n  Usage: repeat <number>\n\n  Ex: \"{% repeat 4 %}a{% endrepeat %}\" prints \"aaaa\".\n\n  Unlike {% for x in range(3) %}, {% repeat 3 %} only\n    renders the block once and prints three copies.\n```\n\n#### verbatim\n\n```\n  Used to paste contents without parsing them\n  Stops at first {% endverbatim %} not matching a {% verbatim %}.\n\n  Ex:\n    \"{% verbatim %}some text with symbols {% and %}{% endverbatim %}\"\n  Prints:\n    \"some text with symbols {% and %}\"\n\n  Ex:\n    \"{% verbatim %}some text with {% verbatim %}nested verbatim{% endverbatim %}{% endverbatim %}\"\n  Prints:\n    \"some text with {% verbatim %}nested verbatim{% endverbatim %}\"\n```\n\n#### void\n\n```\n  This block is parsed but not printed.\n  Use it to place comments or a bunch of def\n  without adding whitespace\n```\n\n\n---\n\n## Defining commands, blocks and final actions\n\nThis package is designed to simply add new commands and blocks:\n\n- **commands**: they are classes inherinting from command, they should define `__call__` and doc signature:\n\t```Python\n\tclass MyCommand(Command):\n\t    def __call__(self, p: Preprocessor, args: str) -> str:\n \t        ...\n\n \t    doc = \"documentation for my command\"\n\t```\n\n\tThe first argument is the preprocessor object, the second is the args string entered after the command. For example when calling `{% command_name some args %}` args will contain `\" some args \"` including leading/trailing spaces.\n\n\tThe return value is the string to be inserted instead of the command call.\n\n\tCommand are stored in the preprocessor's `command` dict. They can be added with:\n\n\t```Python\n\t# adds the command to all new Preprocessor objects\n\tPreprocessor.commands[\"command_name\"] = MyCommand()\n\t# adds the command to a specific Preprocessor object\n\tmy_preproc_obj.commands[\"command_name\"] = MyCommand()\n\t```\n\n- **blocks**: they are classes with signature:\n \t```Python\n \tclass MyBlock(Block):\n\n \t    def __call__(self, p: Preprocessor, args: str, block_contents: str) -> str:\n \t        ...\n\n \t    doc = \"my block documentation\"\n \t```\n\n\t`args` is the blocks argument, just like in commands, and `block_contents` is everything between `{% block args %}` and `{% endblock %}`.\n\n\tThey return a string that replaces the whole block `{% block ... %}...{% endblock %}`\n\n\tBlocks are stored in the preprocessor's `blocks` dict. They can be added with:\n\n\t```Python\n\tPreprocessor.blocks[\"block_name\"] = MyBlock()\n\t```\n\n- **final actions**: they have the same signature as commands:\n\t```Python\n\tdef final_action_function(p: Preprocessor, text: str) -> str\n\t```\n\n\tHere the `text` arg is the whole text (with all commands rendered).\n\n\tThe action returns the transformed text.\n\n\tActions are stored in the preprocessor's `final_actions` list, in the order in which they are to be executed. They can be added with:\n\n\t```Python\n\t# adds a post action to the whole class\n\tPreprocessor.final_actions.append(post_action_function)\n\t# adds a post action to a specific object\n\tpreprocessor_obj.final_actions.append(post_action_function)\n\t```\n\n### Useful functions\n\nSome useful functions and attribute that are useful when defining commands or blocks\n\n- `Preprocessor.split_args(self, args: str) -> List[str]` - use split a command or block arguments like the command line would.\n\tYou can then parse them with `argparse`. However, `argparse` exits on parsing errors, so the module provide\n\t```Python\n\tclass ArgumentParserNoExit(argparse.ArgumentParser):\n\t```\n\twhich raises `argparse.ArgumentError` instead of exiting, allowing errors to be caught and passed to the preprocessor error handling system.\n- `Preprocessor.send_error(self, name: str, msg: str)` - sends an error (and exits). Errors should be only fatal problems. Non-fatal problems should be warnings.\n- `Preprocessor.send_warning(self, name: str, msg: str)` - sends a warning.\n- `Preprocessor.current_position: Position` - variable containing all position info.\n- `Preprocessor.parse(self, string: str) -> str` - processed the string commands and blocks and returns the parsed version\n\tIt can be used for block contents, recursive defines, or any text which has preprocessor syntax.\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "MLPP: Markup Language Pre-Processor for text files (code/html/tex/...) inspired by the C preprocessor",
    "version": "1.0.3",
    "project_urls": {
        "Homepage": "https://github.com/dlesbre/mlpproc"
    },
    "split_keywords": [
        "mlpproc",
        "preprocessor",
        "preprocess",
        "markup",
        "language",
        "python",
        "terminal"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "65259c4d42dfe35c2060a41fa46772632df69e7b6d26881c0dd56084211d6b65",
                "md5": "96b426d27eafb41f261cc47dea33e48d",
                "sha256": "258a5054032e63be914a9e704517d4c46d930a016d4cc52f4da5f0ade174ee48"
            },
            "downloads": -1,
            "filename": "mlpproc-1.0.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "96b426d27eafb41f261cc47dea33e48d",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.6",
            "size": 44289,
            "upload_time": "2024-05-26T11:43:20",
            "upload_time_iso_8601": "2024-05-26T11:43:20.927830Z",
            "url": "https://files.pythonhosted.org/packages/65/25/9c4d42dfe35c2060a41fa46772632df69e7b6d26881c0dd56084211d6b65/mlpproc-1.0.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-05-26 11:43:20",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "dlesbre",
    "github_project": "mlpproc",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "mlpproc"
}
        
Elapsed time: 0.55869s