> **Jesth has triggered the creation of two spin-offs to push its ideas further:**
> - [Braq](https://github.com/pyrustic/braq): Customizable data format for config files, AI prompts, and more.
> - [Paradict](https://github.com/pyrustic/paradict): Streamable multi-format serialization with schema.
>
> These projects therefore make Jesth obsolete.
<!-- Cover -->
<div align="center">
<img src="https://raw.githubusercontent.com/pyrustic/misc/master/assets/jesth/cover.png" alt="Demo" width="650">
<p align="center">
JesthFile with 1 anonymous top section, and 2 'user' sections
</p>
</div>
<!-- Intro Text -->
# Jesth
<b> Human-readable versatile data format </b>
This project is part of the [Pyrustic Open Ecosystem](https://pyrustic.github.io).
> [Installation](#installation) [Demo](#demo) [Latest](https://github.com/pyrustic/jesth/tags) [Modules](https://github.com/pyrustic/jesth/tree/master/docs/modules#readme)
## Table of contents
- [Overview](#overview)
- [Why not TOML, YAML, or JSON ?](#why-not-toml-yaml-or-json-)
- [Why use Jesth ?](#why-use-jesth-)
- [Demo](#demo)
- [Code snippets for everyday scenarios](#code-snippets-for-everyday-scenarios)
- [Anatomy of a Section](#anatomy-of-a-section)
- [Base section](#base-section)
- [Dict section](#dict-section)
- [Section family](#section-family)
- [Specs of the Dict Section](#specs-of-the-dict-section)
- [Scalars](#scalars)
- [Null value](#null-value)
- [Single-line strings](#single-line-strings)
- [Multi-line strings](#multi-line-strings)
- [Date and time](#date-and-time)
- [Embedded binary data](#embedded-binary-data)
- [Comments and whitespaces](#comments-and-whitespaces)
- [Collections (nested dict and list)](#collections-nested-dict-and-list)
- [Application programming interface](#application-programming-interface)
- [Jesth Schema for data validation](#jesth-schema-for-data-validation)
- [Available implementations](#available-implementations)
- [Miscellaneous](#miscellaneous)
- [Contributing](#contributing)
- [Installation](#installation)
# Overview
**Jesth** (pronounced [/dÊ’est/](https://dictionary.cambridge.org/pronunciation/english/jest)), which stands for **Just Extract Sections Then Hack'em**, is a human-readable data format whose reference parser is an eponymous lightweight [Python](https://www.python.org/) library available on [PyPI](#installation).
> Read the **backstory** in this [HN discussion](https://news.ycombinator.com/item?id=35991018) !
## Versatile format
A Jesth **document** is made up of **sections**, each defined by a **header** (between two square brackets) and a **body** which is just lines of text.
Instead of interpreting these lines upstream after extracting the sections, the Jesth parser lays low and lets the programmer decide which hack to apply to them.
Because of this parsing policy, a single Jesth document can contain an **eclectic set of sections** representing a poem, server configs, help text, [prompts](https://github.com/f/awesome-chatgpt-prompts) for a chatbot, [directed graph](https://en.wikipedia.org/wiki/Directed_graph), [ascii artwork](https://en.wikipedia.org/wiki/ASCII_art), and more.
> **Jesth** is used by **Shared** for data persistence and exchange. Visit [Shared](https://github.com/pyrustic/shared#readme) !
## Shipped with a proper and extensively tested hack
For convenience, the Jesth library includes a proper and extensively tested [hack]((https://en.wikipedia.org/wiki/Hacker_culture)) to convert a compatible section to a [dictionary](https://en.wikipedia.org/wiki/Associative_array) data structure. In such a section that allows comments, one can encode **strings** (single or multi-line), **scalars** (integer, float, decimal float, complex, booleans), **date** and **time**, **null** value, **binary** data and nested **collections** (list and dictionary).
Therefore, Jesth can be used for configuration files, to create a [scripting language](https://github.com/pyrustic/backstage), to act as a markup language to write [docstrings](https://en.wikipedia.org/wiki/Docstring#Python) so that a tool can generate reference documentation after parsing them, etc.
In the next few sections, we'll dive deep into Jesth, but first, why not [TOML](https://en.wikipedia.org/wiki/TOML), [YAML](https://fr.wikipedia.org/wiki/YAML), or [JSON](https://en.wikipedia.org/wiki/JSON) ?
# Why not TOML, YAML, or JSON ?
Jesth offers a high level of **versatility** that allows the programmer to freely decide what a section can contain and how to interpret its contents.
Moreover, whenever the programmer decides to adopt a dictionary as a data structure to formalize some or all sections of a Jesth document, he/she is offered a [very readable and strict syntax](#specs-of-the-dict-section) to encode the data.
The library exposes an intuitive API for loading and dumping Dict sections with an option to preserve comments and whitespaces.
The following subsections will briefly compare **TOML**, **YAML**, and **JSON** to **Jesth's dictionary data structure support**.
## Why not TOML ?
Jesth shares with **TOML** the use of square brackets to define sections (or tables in TOML terminology), therefore, among other formats, the TOML syntax is closest to Jesth. However, when dealing with nested structures, TOML becomes very confusing, revealing the limitations of its syntax.
Besides intuitiveness and readability, the design decisions behind Jesth [accidentally](https://en.wikipedia.org/wiki/Serendipity) created an **unlimited reserved word pool**. For example, the `[[END]]` header (note the double brackets on either side) is reserved as a document separator, allowing you to mark the end of a Jesth stream or store multiple Jesth documents in a single file.
## Why not YAML ?
**YAML** tightly embraces complexity and ambiguity, which are probably not useful for most programmers' needs. Jesth keeps it minimalist, unambiguous and very readable.
## Why not JSON ?
**JSON** is great because it mimics the way we define data structures in source code. Therefore, it can be minified and is also a reasonable choice for machine-to-machine communication, but it cannot beat the readability of formats that natively support line breaks and/or indentation. In comparison to Jesth, JSON has fewer native data types and no native support for comments.
# Why use Jesth ?
Jesth is ideal for configuration files, whether intended for use by a standard application or by a chatbot/AI. Personal notes can also benefit from being entered into a Jesth file, so they can be parsed later. As mentioned in the [Overview](#overview) section, Jesth was used to design a [scripting language](https://github.com/pyrustic/backstage) and currently serves as the markup language for [docstrings](https://en.wikipedia.org/wiki/Docstring#Python) in all projects that are part of the [Pyrustic ecosystem](https://pyrustic.github.io).
Loading a Jesth document is stress-free because exceptions are only thrown when you try to convert a non-compatible section to a dictionary. But even that is the strict default behavior of the dictionary converter since you can customize the converter to string invalid values. This specific customization is as simple as this: `value_converter.fallback_decoder = str`
There's more to say, for example the Jesth Dict Section syntax is text editor/IDE-friendly, so it may benefit from supporting some cool features like the folding mechanism.
# Demo
The demo is a clonable repository containing a JesthFile and a Python script to load its contents.
> [Open the Demo](https://github.com/pyrustic/jesth-demo#readme) !
# Code snippets for everyday scenarios
**Load a JesthFile:**
```python
from jesth import read
path = "/home/alex/jesthfile"
document = read(path)
```
**Get sections:**
```python
...
# get the "config" section
config_section = document.get("config")
# get the list of all "user" sections
user_sections = document.get_all("user")
# show the body of a section
print(config_section.body) # list of lines
# show the body as a text (concatenated lines)
print(config_section.to_text())
# update the body of a section
new_body = [r"# comment",
r"server1 = '1.1.1.1'",
r"port = 80"]
config_section.update(new_body)
```
**Convert a compatible section into a Python Dict:**
```python
...
# won't raise an exception if the section can't be converted to dict
dict_object = section.to_dict()
# will raise an exception if the section can't be converted to dict
dict_object = section.create_dict()
# Note that `.create_dict` accepts `default` and `strict` arguments
# If the body can be converted to a dict but is empty, you get `default`
# Set True to `strict` to preserve comments and whitespaces
# Note that `.to_dict` accepts `default`, `strict`, and `fallback`
# If the body can't be converted to a dict, you get `fallback`
```
**Edit a document:**
```python
...
# append a new "user" section at the end of document
body = {"name": "alex", "id": 420}
document.append("user", body) # body as a regular Python dict
# insert a new "user" section at the beginning of document
body = [r"# comment",
r"name = 'alex'",
r"id = 420"]
document.insert(0, "user", body) # body as a list of lines
```
**Save a document:**
```python
from jesth import write
...
# if the document is linked to a JesthFile, call its `save` method
document.save()
# or save to a new path
document.save_to(path)
# Under the hood, the Document class actually uses the `write` function
write(document, path)
```
**Render a document:**
```python
from jesth import render
...
# return the string representation of the data
# this jesth text can be stored as it in a JesthFile
jesth_text = document.render()
# Under the hood, the Document class actually uses the `render` function
jesth_text = render(document)
```
# Anatomy of a Section
A **section** is made up of two parts: a **header** and a **body**. The header is defined on its own line, surrounded by two single brackets (opening and closing brackets respectively).
The body is what is between two consecutive headers or between a header and the end of the document.
**Example of headers:**
```
[this is a header]
[this_is_another_header]
\[this is not a header at all]
[[this is very illegal]]
\[[this is not illegal at all but still not a header]]
```
> **Note:** A header may be empty and referenced from the source code with an empty string. Sections with empty headers are called `anonymous sections`.
**Example of two anonymous sections:**
```
This text actually belongs to an anonymous section,
which happens to also be the first section of this document
[]
This text actually belongs to the second anonymous section
of this document.
If the very first section of a document is anonymous, it can
ignore to define its header.
```
> **Note:** As you noticed, two sections may have similar headers. Read more about this in the 'Section family' section.
## Base section
Is called Base section a section as defined in the document, without any transformation, i.e., just text lines. From the source code perspective, the body of such section is represented by default as a list of lines (string).
The alternative representation of such body is just a concatenated version of this list.
## Dict section
Is called Dict section, a section that is intended to be converted into a dictionary object by the parser.
By default, the library uses the regular `dict` type to represent the body of such section. This default behavior can be customized through the `ValueConverter` object.
## Section family
Different sections can have the same header. In this case, they belong to a **section family**. By default, the `.get(header)` method from the `Document` class returns the first defined section with the given header argument. This method actually accepts a `sub_index` argument which is set to `0` by default to access the first item of the section family.
**Get the 2nd section in the 'user' section family:**
```python
...
section = document.get("user", sub_index=1)
```
**Get the list of all sections in the 'user' section family:**
```python
...
sections = document.get_all("user")
```
# Specs of the Dict Section
The specs for a Dictionary Section are mostly aligned with Python specs. For example, a Jesth Integer follows exactly the definition of Python Integer.
> **Note:** The Jesth library allows the customization of each type of data as well in the context of parsing or in the context of rendering. This customization is made via the [ValueConverter](#the-valueconverter-object) object
## Scalars
Jesth supports integers (base 10, hexadecimal, octal, binary), float (approximate binary floating-point, fixed-precision decimal floating-point), complex numbers, and booleans.
> **Note:** Jesth Scalars specs are aligned with their respective Python equivalent specs.
### Integer
Jesth Dict section supports Integer whose specs is aligned with Python `int` type specs.
**Example:**
```
int = -4_200_000
hex_int = 0xAE1F
oct_int = 0o3427
bin_int = 0b101_0011_1010_1010
```
> **Note:** By default, Jesth tidy up numbers by using the underscore convention.
**Mapping with actual Python objects:**
|Jesth|Python|
|---|---|
|integer|`int`|
|hexadecimal int|`jesth.box.HexInt` subclassing `int`|
|octal int|`jesth.box.OctInt` subclassing `int`|
|binary int|`jesth.box.BinInt` subclassing `int`|
### Float
Jesth Dict section supports approximate binary floating-point and fixed-precision decimal floating-point which should be represented in the scientific notation.
**Approximate binary floating-point:**
```
float = 3.14
```
**fixed-precision decimal floating-point:**
```
decimal_float = 3.14E0
```
> **Note:** By default, Jesth tidy up float numbers by using the underscore convention.
**Decimal float with underscores:**
```
decimal_float = 3.141_592_653_589_793_238_46E0
```
**Mapping with actual Python objects:**
|Jesth|Python|
|---|---|
|binary floating-point|`float`|
|decimal floating-point|`decimal.Decimal`|
### Complex
Jesth Dict section supports complex numbers.
**Example:**
```
my_complex = -4.2+6.9i
```
The number in the example above is equivalent to this in Python:
```python
my_complex = -4.2+6.9j
# or
my_complex = complex(-4.2, 6.9)
```
> **Note:** The only difference between the Jesth complex and Python complex is the usage of `i` as suffix by Jesth instead of `j` like Python.
### Bool
Jesth Dict section supports booleans.
**Mapping with actual Python objects:**
|Jesth|Python|
|---|---|
|true|`True`|
|false|`False`|
## Null value
Jesth Dict section supports the `null` value that translates to Python `None` singleton.
## Single-line strings
Jesth Dict Section supports single-line strings. Such strings may embed unicode escape sequence and should be enclosed with double-quotes. Raw strings (WYSIWYG) are also supported and in these ones should be enclosed with single-quotes.
**String embedding unicode escape sequences:**
```
my_string = "this is a \n single-line string \u000A with line breaks"
another_string = "string containing "double quotes" and a \\ backslash"
```
> **Note:** The Backslash character is a special one here. It is used to escape a backslash or to make a unicode escape sequence. It shouldn't be used to escape a quote.
**Raw string (WYSIWYG):**
```
my_string = 'this is a \n single-line string \u000A with no line break'
another_string = 'string containing 'single quotes' and a \ backslash'
```
**Mapping with actual Python objects:**
|Jesth|Python|
|---|---|
|Double quotes String|`str`|
|Single quotes String|`jesth.box.RawString`|
> The Jesth single-line String type is aligned with Python `str` type.
## Multi-line strings
Multi-line strings work like single-line strings with one major difference: they can actally span on multilines !
Creating multi-line strings implies using tags and indentation. The Jesth name for a multi-line string that supports unicode escape sequences is `Text` and its tag is `(text)`. For raw multi-line string, the Jesth name is `Raw` and its tag is `(raw)`.
A multi-line string is written with a 4-spaces indent under its tag. This indented block under the tag is then closed with a `---` separator symbol placed on its own line.
**Multi-line string that supports unicode escape sequences:**
```
multiline = (text)
This is a multi-line
string that supports \\
unicode escape sequences \u000A
---
empty_text = (text)
---
```
**Raw multi-line string:**
```
multiline = (raw)
This is a raw multi-line \
string \u000A
---
empty_raw = (raw)
---
```
**Mapping with actual Python objects:**
|Jesth|Python|
|---|---|
|Text|`jesth.box.Text` subclassing `str`|
|Raw|`jesth.box.Raw` subclassing `str`|
## Date and time
Jesth Dict Section supports date and time (ISO 8601).
**Date and time:**
```
datetime = 2020-10-20T15:35:57Z+01:00
date = 2020-10-20
time = 15:35:57
```
**Mapping with actual Python objects:**
|Jesth|Python|
|---|---|
|datetime|`datetime.datetime`|
|date|`datetime.date`|
|time|`datetime.time`|
## Embedded binary data
Jesth Dict Section supports binary data. Under the hood, standard base64 (RFC 4648) is used. For convenience, the Jesth parser allows for line breaks and the Jesth renderer outputs base64 lines of 76 characters.
Embedding binary data encoded with base64 implies the use of a specific tag and indentation. The Jesth name for binary data is `Bin` and its tag is `(bin)`.
Binary data encoded with base64 is written with a 4-spaces indent under the `(bin)` tag. This indented block under the tag is then closed with a `---` separator symbol placed on its own line.
```
bin = (bin)
TG9yZW0gaXBzdW0gZG9sb3Igc2l0IGFtZXQsIGNvbnNlY3RldHVyIGFkaXBpc2NpbmcgZWxpdCwg
c2VkIGRvIGVpdXNtb2QgdGVtcG9yIGluY2lkaWR1bnQgdXQgbGFib3JlIGV0IGRvbG9yZSBtYWduY
SBhbGlxdWEu
---
empty_bin = (bin)
---
```
**Mapping with actual Python objects:**
|Jesth|Python|
|---|---|
|bin (base64)|`bytes`|
## Comments and whitespaces
Jesth Dict Section supports comments which are strings starting with `#`. Comments and Whitespaces can be preserved while converting a Jesth Dict Section into a Python Dictionary.
**Example with comments and whitespaces:**
```
# this is a comment
key = "value"
# another comment
number = 42
```
**Mapping with actual Python objects:**
|Jesth|Python|
|---|---|
|comment|`jesth.box.Comment` subclassing ``str|
|whitespace|`jesth.box.Whitespace` subclassing ``str|
> **Note:** Along with `jesth.box.Comment` and `jesth.box.Whitespace`, the classes `jesth.box.CommentID` and `jesth.box.WhitespaceID` are also used to help embed comments and whitespace in a Python dictionary by playing the role of keys. They don't accept any argument, subclass `int`. Each instanciation of one of these classes create a new unique ID. The creation of this unique ID is thread-safe and handled by `jesth.box.N`.
## Collections (nested dict and list)
Jesth Dict Section supports nested dictionaries and lists.
Nesting a collection implies the use of a specific tag and indentation. The Jesth name for a nested dictionary is `Dict` and its tag is `(dict)`. In the case of a nested list, its Jesth name is `List` and its tag is `(list)`.
**Nested collections:**
```
my_dict = (dict)
key_1 = "value"
# Nested list
key_2 = (list)
"item i"
"item ii"
# nested dict
(dict)
project = "jesth"
number = 42
```
**Mapping with actual Python objects:**
|Jesth| Python |
|---|--------|
|dict| `dict` |
|list| `list` |
# Application programming interface
The library exposes functions and classes to load and dump Jesth document with optional customization at each stage. Loaded documents and their sections are encapsulated into classes exposing methods and properties with a high emphasis on ergonomy.
## Loading Jesth
To load Jesth data, one may use the `jesth.parser.read` function when the source is a file (file-like object, path string, or pathlib.Path object), or `jesth.parser.parse` function when the source is a string.
These functions returns a `Document` object that is expected to contain `Section` objects. Under the hood, these functions use the `jesth.parser.Parser` class that consume line by line the contents and stops when `[[END]]` is encountered.
Both functions accept the `ValueConverter` object for customization.
**Examples:**
```python
from decimal import Decimal
from jesth import read, parse
# Using 'read'
path = "/path/to/jesthfile"
document = read(path) # path may also be a pathlib.Path object
# Using 'parse'
with open(path, "r") as file:
text = file.read()
document = parse(text)
# Force floats to use decimal.Decimal
vc = ValueConverter(float_decoder=Decimal)
document = parse(text, value_converter=vc)
```
As showed in the example above, references to these functions are available from the root module.
## Dumping Jesth
To save the contents of a `Document` in a file, use the `jesth.renderer.write` function. This function uses another function `jesth.renderer.render` which takes a document as argument then return its textual representation (with the square brackets and all).
> **Note:** The `Document` class itself already exposes the `render` and `save` methods which use under the hood the functions `render` and `write` that has been described above.
**Example:**
```python
from jesth import read, write, render
path = "/path/to/jesthfile"
document = read(path)
# Save to same file
document.save()
# Save to another file
new_path = "/path/to/copy.txt"
document.save_to(new_path)
# render (convert the document into text)
text = document.render()
# note that the 'render' method accepts *headers
# and spacing arguments.
# By default the spacing is 1 (space) between sections
# Save to a file with 'write' function
write(document, path) # path may be a pathlib.Path instance
# render with the 'render' function
text = render(document)
```
As showed in the example above, references to the `write` and `render` functions are available from the root module.
## The Document object
The `read` and `parse` functions returns a document object which contains the list of sections objects and exposes methods to edit the document, save it in a file, render its contents etc.
> **Properties:** `path`, `sections`, `value_converter`, and `headers`.
### Edit a document
The document object exposes `append`, `insert`, `remove`, and `remove_all` methods to add and remove sections.
**Example:**
```python
from jesth import read
path = "/path/to/jestfile"
document = read(path)
# append a new section to the document
body = ["line 1", "line 2", "line 3"]
document.append("header", body=body) # body may be a dict or string
# insert an empty section at a specific index
document.insert(0, "header") # therefore, first section
# override the first section with a new empty one
document.set(0, "header") # 'body' is still set to None here
# remove the last section with a specific header
document.remove("header") # sub_index defaults to -1
# remove the first section with a specific header
document.remove("header", sub_index=0)
# note that the index argument is meant to be relative
# to this 'section family', i.e., sections with same header.
# remove all sections by header
document.remove_all("header")
```
### Query a document
The document object exposes `get`, `get_all`, and `count` methods to retrieve sections or their count.
**Example:**
```python
from jesth import read
path = "/path/to/jestfile"
document = read(path)
# get the first section with a specific header
section = document.get("header") # the index arg defaults to 0
# get the last section with a specific header
section = document.get("header", sub_index=-1)
# note that the index argument is meant to be relative
# to this 'section family', i.e., sections with same header.
# get all sections with a specific header
sections = document.get_all("header")
# count the number of sections sharing a specific header
n = document.count("header")
```
## The Section object
The Section object contains the header and the body of a section. The body is stored as a list of string that may be converted on demand, if compatible, into a Python dictionary.
> **Properties:** `header`, `body`, and `value_converter`.
**Example:**
```python
from jesth import read
path = "/path/to/jestfile"
document = read(path)
section = document.get("header")
# Convert the body of this section into a dict then return it.
# Beware, this method won't raise an exception at all
# instead, it may return a fallback value
data = section.to_dict() # accepts "default", "fallback", "strict"
# Concatenate the lines of the body to form a text string
data = section.to_text()
# Convert the body of this section into a dict then return it.
# Beware, this method will raise an exception if the conversion fails
data = section.create_dict() # accepts "default" and "strict"
# the default parameter is for default data if the new created dict
# is empty. The strict parameter is to tell whether comment/whitespace
# should be preserved and inserted in the new dict !
# update the body of this section
body = ["line 1", "line 2", "line 3"]
section.update(body) # here, body may be a string or a dict
```
## The ValueConverter object
This object has methods to encode and decode values, and also properties to customize conversion types.
Most time, you won't need to set or edit this object. The encode and decode methods of this object are only useful for test purpose.
**Parameters:**
- **dict_type**: the Python type in which a Jesth Dict should be converted into. Defaults to Python dict type.
- **list_type**: the Python type in which a Jesth List should be converted into. Defaults to Python list type.
- **XXX_types**: this represents a group of parameters. Here, XXX is a placeholder for a Jesth data type. Valid types are: dict, list, bin, bool, complex, date, datetime, float, integer, raw, string, text, time. Examples: dict_types, float_types, and time_types. Use this parameter to set a list of Python types that may be encoded in the Jesth type (the same used as prefix). Example: dict_types defaults to [dict, OrderedDict], i.e., while encoding some Python data, an OrderedDict instance or a regular dict instance will be encoded as a Jesth dict.
- **XXX_encoder**: this represents a group of parameters. Here, XXX is a placeholder for one of: bin, bool, complex, date, datetime, float, integer, null, raw, string, text, time, fallback. Use these parameters to set a callable to encode Python values into Jesth values. Example: float_encoder = decimal.Decimal
- **XXX_decoder**: this represents a group of parameters. Here, XXX is a placeholder for one of: bin, bool, complex, date, datetime, float, integer, null, raw, string, text, time, fallback. Use these parameters to set a callable to decode Jesth values into Python values. Example: integer_decoder = int
> **Note:** the parameters of ValueConverter are mirrored as read and write properties.
# Jesth Schema for data validation
Coming soon... ;)
# Available implementations
For the moment, the only available implementation of Jesth is this reference library written in Python.
If you are interested into writing a Jesth library for another language (Rust, C#, Java, JavaScript, PHP, etc), feel free to get in touch with [me](https://pyrustic.github.io/#contact) for any question !
# Miscellaneous
Collection of miscellaneous notes.
## Acronymy
Jesth stands for **Just Extract Sections Then Hack'em**. Here, the word `hack` is a direct reference to the [Hacker culture](https://en.wikipedia.org/wiki/Hacker_culture).
## End of document
To tell the parser to stop parsing a document, input in the document the `[[END]]` (case-insensitive) tag on its own line.
## Opening square bracket at the beginning of a line
You must escape the opening square brackets at the start of a line of a body with a backslash `\` character. And whenever your line needs to start with `n` backslash(es) followed by an opening square bracket, you need to prepend an extra backslash to the line, resulting in `n+1` backslash(es).
> **Note:** This does not apply when programmatically submitting the body of a section as an argument to a function or class.
**Example:**
```
[header]
\[hello world !
\\\[Here there is one extra line to escape the bracket. Backslashes themselves shouldn't be escaped
```
## Indentation
Indentation in a Jesth file is aligned with the de-facto indentation convention in Python. Four spaces represent one indent.
## Beautiful cover image
The beautiful cover image is made with the amazing [carbon app](https://carbon.now.sh).
# Contributing
Feel free to open an issue to report a bug, suggest some changes, show some useful code snippets, or discuss anything related to this project. You can also directly email [me](https://pyrustic.github.io/#contact).
> **Note:** I would like to automate a decent Git workflow (branching, merging patches, all the basic cool things cool kids do) with [Backstage](https://github.com/pyrustic/backstage). In the meantime, I will manually incorporate the fixes into the only existing branch (`master`). Thank you for your understanding !
## Setup your development environment
Following are instructions to setup your development environment
```bash
# create and activate a virtual environment
python -m venv venv
source venv/bin/activate
# clone the project then change into its directory
git clone https://github.com/pyrustic/jesth.git
cd jesth
# install the package locally (editable mode)
pip install -e .
# run tests
python -m unittest discover -f -s tests -t .
# deactivate the virtual environment
deactivate
```
# Installation
**Jesth** is **cross platform**. It is built on [Ubuntu](https://ubuntu.com/download/desktop) with [Python 3.8](https://www.python.org/downloads/) and should work on **Python 3.5** or **newer**.
## Create and activate a virtual environment
```bash
python -m venv venv
source venv/bin/activate
```
## Install for the first time
```bash
pip install jesth
```
## Upgrade the package
```bash
pip install jesth --upgrade --upgrade-strategy eager
```
## Deactivate the virtual environment
```bash
deactivate
```
# About the author
Hi, I'm Alex, a tech enthusiast ! Get in touch with [me](https://pyrustic.github.io/#contact) !
<br>
<br>
<br>
[Back to top](#readme)
Raw data
{
"_id": null,
"home_page": "https://github.com/pyrustic/jesth",
"name": "jesth",
"maintainer": "Pyrustic Evangelist",
"docs_url": null,
"requires_python": ">=3.5",
"maintainer_email": "rusticalex@yahoo.com",
"keywords": "serialization,library,format,collections,data,markup,parser,config,configuration,renderer,language,typed,data-structure,pyrustic",
"author": "Pyrustic Evangelist",
"author_email": "rusticalex@yahoo.com",
"download_url": "https://files.pythonhosted.org/packages/d6/04/23b68f98d4cbbf90f66af50d066030429b29926db0dee59ba59fd4d38761/jesth-0.0.11.tar.gz",
"platform": null,
"description": "> **Jesth has triggered the creation of two spin-offs to push its ideas further:**\n> - [Braq](https://github.com/pyrustic/braq): Customizable data format for config files, AI prompts, and more.\n> - [Paradict](https://github.com/pyrustic/paradict): Streamable multi-format serialization with schema.\n>\n> These projects therefore make Jesth obsolete.\n\n<!-- Cover -->\n<div align=\"center\">\n <img src=\"https://raw.githubusercontent.com/pyrustic/misc/master/assets/jesth/cover.png\" alt=\"Demo\" width=\"650\">\n <p align=\"center\">\n JesthFile with 1 anonymous top section, and 2 'user' sections\n </p>\n</div>\n\n\n<!-- Intro Text -->\n# Jesth\n<b> Human-readable versatile data format </b>\n \nThis project is part of the [Pyrustic Open Ecosystem](https://pyrustic.github.io).\n> [Installation](#installation) [Demo](#demo) [Latest](https://github.com/pyrustic/jesth/tags) [Modules](https://github.com/pyrustic/jesth/tree/master/docs/modules#readme)\n\n\n## Table of contents\n\n- [Overview](#overview)\n- [Why not TOML, YAML, or JSON ?](#why-not-toml-yaml-or-json-)\n- [Why use Jesth ?](#why-use-jesth-)\n- [Demo](#demo)\n- [Code snippets for everyday scenarios](#code-snippets-for-everyday-scenarios)\n- [Anatomy of a Section](#anatomy-of-a-section)\n - [Base section](#base-section)\n - [Dict section](#dict-section)\n - [Section family](#section-family)\n- [Specs of the Dict Section](#specs-of-the-dict-section)\n - [Scalars](#scalars)\n - [Null value](#null-value)\n - [Single-line strings](#single-line-strings)\n - [Multi-line strings](#multi-line-strings)\n - [Date and time](#date-and-time)\n - [Embedded binary data](#embedded-binary-data)\n - [Comments and whitespaces](#comments-and-whitespaces)\n - [Collections (nested dict and list)](#collections-nested-dict-and-list)\n- [Application programming interface](#application-programming-interface)\n- [Jesth Schema for data validation](#jesth-schema-for-data-validation)\n- [Available implementations](#available-implementations)\n- [Miscellaneous](#miscellaneous)\n- [Contributing](#contributing)\n- [Installation](#installation)\n\n\n# Overview\n**Jesth** (pronounced [/d\u0292est/](https://dictionary.cambridge.org/pronunciation/english/jest)), which stands for **Just Extract Sections Then Hack'em**, is a human-readable data format whose reference parser is an eponymous lightweight [Python](https://www.python.org/) library available on [PyPI](#installation).\n\n> Read the **backstory** in this [HN discussion](https://news.ycombinator.com/item?id=35991018) !\n\n## Versatile format\nA Jesth **document** is made up of **sections**, each defined by a **header** (between two square brackets) and a **body** which is just lines of text. \n\nInstead of interpreting these lines upstream after extracting the sections, the Jesth parser lays low and lets the programmer decide which hack to apply to them.\n\nBecause of this parsing policy, a single Jesth document can contain an **eclectic set of sections** representing a poem, server configs, help text, [prompts](https://github.com/f/awesome-chatgpt-prompts) for a chatbot, [directed graph](https://en.wikipedia.org/wiki/Directed_graph), [ascii artwork](https://en.wikipedia.org/wiki/ASCII_art), and more.\n\n> **Jesth** is used by **Shared** for data persistence and exchange. Visit [Shared](https://github.com/pyrustic/shared#readme) !\n\n## Shipped with a proper and extensively tested hack\nFor convenience, the Jesth library includes a proper and extensively tested [hack]((https://en.wikipedia.org/wiki/Hacker_culture)) to convert a compatible section to a [dictionary](https://en.wikipedia.org/wiki/Associative_array) data structure. In such a section that allows comments, one can encode **strings** (single or multi-line), **scalars** (integer, float, decimal float, complex, booleans), **date** and **time**, **null** value, **binary** data and nested **collections** (list and dictionary).\n\nTherefore, Jesth can be used for configuration files, to create a [scripting language](https://github.com/pyrustic/backstage), to act as a markup language to write [docstrings](https://en.wikipedia.org/wiki/Docstring#Python) so that a tool can generate reference documentation after parsing them, etc.\n\nIn the next few sections, we'll dive deep into Jesth, but first, why not [TOML](https://en.wikipedia.org/wiki/TOML), [YAML](https://fr.wikipedia.org/wiki/YAML), or [JSON](https://en.wikipedia.org/wiki/JSON) ?\n\n# Why not TOML, YAML, or JSON ?\nJesth offers a high level of **versatility** that allows the programmer to freely decide what a section can contain and how to interpret its contents. \n\nMoreover, whenever the programmer decides to adopt a dictionary as a data structure to formalize some or all sections of a Jesth document, he/she is offered a [very readable and strict syntax](#specs-of-the-dict-section) to encode the data. \n\nThe library exposes an intuitive API for loading and dumping Dict sections with an option to preserve comments and whitespaces.\n\nThe following subsections will briefly compare **TOML**, **YAML**, and **JSON** to **Jesth's dictionary data structure support**.\n\n## Why not TOML ?\nJesth shares with **TOML** the use of square brackets to define sections (or tables in TOML terminology), therefore, among other formats, the TOML syntax is closest to Jesth. However, when dealing with nested structures, TOML becomes very confusing, revealing the limitations of its syntax.\n\nBesides intuitiveness and readability, the design decisions behind Jesth [accidentally](https://en.wikipedia.org/wiki/Serendipity) created an **unlimited reserved word pool**. For example, the `[[END]]` header (note the double brackets on either side) is reserved as a document separator, allowing you to mark the end of a Jesth stream or store multiple Jesth documents in a single file.\n\n## Why not YAML ?\n**YAML** tightly embraces complexity and ambiguity, which are probably not useful for most programmers' needs. Jesth keeps it minimalist, unambiguous and very readable.\n\n## Why not JSON ?\n**JSON** is great because it mimics the way we define data structures in source code. Therefore, it can be minified and is also a reasonable choice for machine-to-machine communication, but it cannot beat the readability of formats that natively support line breaks and/or indentation. In comparison to Jesth, JSON has fewer native data types and no native support for comments.\n\n# Why use Jesth ?\nJesth is ideal for configuration files, whether intended for use by a standard application or by a chatbot/AI. Personal notes can also benefit from being entered into a Jesth file, so they can be parsed later. As mentioned in the [Overview](#overview) section, Jesth was used to design a [scripting language](https://github.com/pyrustic/backstage) and currently serves as the markup language for [docstrings](https://en.wikipedia.org/wiki/Docstring#Python) in all projects that are part of the [Pyrustic ecosystem](https://pyrustic.github.io).\n\nLoading a Jesth document is stress-free because exceptions are only thrown when you try to convert a non-compatible section to a dictionary. But even that is the strict default behavior of the dictionary converter since you can customize the converter to string invalid values. This specific customization is as simple as this: `value_converter.fallback_decoder = str`\n\nThere's more to say, for example the Jesth Dict Section syntax is text editor/IDE-friendly, so it may benefit from supporting some cool features like the folding mechanism.\n\n# Demo\nThe demo is a clonable repository containing a JesthFile and a Python script to load its contents. \n\n> [Open the Demo](https://github.com/pyrustic/jesth-demo#readme) !\n\n# Code snippets for everyday scenarios\n**Load a JesthFile:**\n```python\nfrom jesth import read\n\npath = \"/home/alex/jesthfile\"\ndocument = read(path)\n```\n\n**Get sections:**\n```python\n...\n\n# get the \"config\" section\nconfig_section = document.get(\"config\")\n\n# get the list of all \"user\" sections\nuser_sections = document.get_all(\"user\")\n\n# show the body of a section\nprint(config_section.body) # list of lines\n\n# show the body as a text (concatenated lines)\nprint(config_section.to_text())\n\n# update the body of a section\nnew_body = [r\"# comment\", \n r\"server1 = '1.1.1.1'\",\n r\"port = 80\"]\nconfig_section.update(new_body)\n```\n\n**Convert a compatible section into a Python Dict:**\n\n```python\n...\n\n# won't raise an exception if the section can't be converted to dict\ndict_object = section.to_dict()\n\n# will raise an exception if the section can't be converted to dict\ndict_object = section.create_dict()\n\n# Note that `.create_dict` accepts `default` and `strict` arguments\n# If the body can be converted to a dict but is empty, you get `default`\n# Set True to `strict` to preserve comments and whitespaces\n\n# Note that `.to_dict` accepts `default`, `strict`, and `fallback`\n# If the body can't be converted to a dict, you get `fallback`\n```\n\n**Edit a document:**\n```python\n...\n\n# append a new \"user\" section at the end of document\nbody = {\"name\": \"alex\", \"id\": 420}\ndocument.append(\"user\", body) # body as a regular Python dict\n\n# insert a new \"user\" section at the beginning of document\nbody = [r\"# comment\", \n r\"name = 'alex'\", \n r\"id = 420\"]\ndocument.insert(0, \"user\", body) # body as a list of lines\n```\n\n**Save a document:**\n```python\nfrom jesth import write\n...\n\n# if the document is linked to a JesthFile, call its `save` method\ndocument.save()\n\n# or save to a new path\ndocument.save_to(path)\n\n# Under the hood, the Document class actually uses the `write` function\nwrite(document, path)\n```\n\n**Render a document:**\n```python\nfrom jesth import render\n...\n\n# return the string representation of the data\n# this jesth text can be stored as it in a JesthFile\njesth_text = document.render()\n\n# Under the hood, the Document class actually uses the `render` function\njesth_text = render(document)\n```\n\n# Anatomy of a Section\nA **section** is made up of two parts: a **header** and a **body**. The header is defined on its own line, surrounded by two single brackets (opening and closing brackets respectively). \n\nThe body is what is between two consecutive headers or between a header and the end of the document.\n\n**Example of headers:**\n\n```\n[this is a header]\n\n[this_is_another_header]\n\n\\[this is not a header at all]\n\n[[this is very illegal]]\n\n\\[[this is not illegal at all but still not a header]]\n```\n\n> **Note:** A header may be empty and referenced from the source code with an empty string. Sections with empty headers are called `anonymous sections`.\n\n**Example of two anonymous sections:**\n```\nThis text actually belongs to an anonymous section,\nwhich happens to also be the first section of this document\n\n[]\nThis text actually belongs to the second anonymous section\nof this document.\nIf the very first section of a document is anonymous, it can\nignore to define its header.\n\n```\n\n> **Note:** As you noticed, two sections may have similar headers. Read more about this in the 'Section family' section.\n\n## Base section\nIs called Base section a section as defined in the document, without any transformation, i.e., just text lines. From the source code perspective, the body of such section is represented by default as a list of lines (string). \n\nThe alternative representation of such body is just a concatenated version of this list.\n\n## Dict section\nIs called Dict section, a section that is intended to be converted into a dictionary object by the parser. \n\nBy default, the library uses the regular `dict` type to represent the body of such section. This default behavior can be customized through the `ValueConverter` object.\n\n## Section family\nDifferent sections can have the same header. In this case, they belong to a **section family**. By default, the `.get(header)` method from the `Document` class returns the first defined section with the given header argument. This method actually accepts a `sub_index` argument which is set to `0` by default to access the first item of the section family.\n\n**Get the 2nd section in the 'user' section family:**\n\n```python\n...\n\nsection = document.get(\"user\", sub_index=1)\n```\n\n**Get the list of all sections in the 'user' section family:**\n```python\n...\n\nsections = document.get_all(\"user\")\n```\n\n# Specs of the Dict Section\nThe specs for a Dictionary Section are mostly aligned with Python specs. For example, a Jesth Integer follows exactly the definition of Python Integer. \n\n> **Note:** The Jesth library allows the customization of each type of data as well in the context of parsing or in the context of rendering. This customization is made via the [ValueConverter](#the-valueconverter-object) object\n\n## Scalars\nJesth supports integers (base 10, hexadecimal, octal, binary), float (approximate binary floating-point, fixed-precision decimal floating-point), complex numbers, and booleans.\n\n> **Note:** Jesth Scalars specs are aligned with their respective Python equivalent specs.\n\n### Integer\nJesth Dict section supports Integer whose specs is aligned with Python `int` type specs.\n\n**Example:**\n```\nint = -4_200_000\nhex_int = 0xAE1F\noct_int = 0o3427\nbin_int = 0b101_0011_1010_1010\n```\n> **Note:** By default, Jesth tidy up numbers by using the underscore convention.\n\n**Mapping with actual Python objects:**\n\n|Jesth|Python|\n|---|---|\n|integer|`int`|\n|hexadecimal int|`jesth.box.HexInt` subclassing `int`|\n|octal int|`jesth.box.OctInt` subclassing `int`|\n|binary int|`jesth.box.BinInt` subclassing `int`|\n\n### Float\nJesth Dict section supports approximate binary floating-point and fixed-precision decimal floating-point which should be represented in the scientific notation.\n\n**Approximate binary floating-point:**\n```\nfloat = 3.14\n```\n\n**fixed-precision decimal floating-point:**\n```\ndecimal_float = 3.14E0\n```\n\n> **Note:** By default, Jesth tidy up float numbers by using the underscore convention.\n\n**Decimal float with underscores:**\n```\ndecimal_float = 3.141_592_653_589_793_238_46E0\n```\n\n**Mapping with actual Python objects:**\n\n|Jesth|Python|\n|---|---|\n|binary floating-point|`float`|\n|decimal floating-point|`decimal.Decimal`|\n\n### Complex\nJesth Dict section supports complex numbers.\n\n**Example:**\n```\nmy_complex = -4.2+6.9i\n```\n\nThe number in the example above is equivalent to this in Python:\n```python\nmy_complex = -4.2+6.9j\n\n# or\nmy_complex = complex(-4.2, 6.9)\n```\n> **Note:** The only difference between the Jesth complex and Python complex is the usage of `i` as suffix by Jesth instead of `j` like Python.\n\n\n### Bool\nJesth Dict section supports booleans.\n\n**Mapping with actual Python objects:**\n\n|Jesth|Python|\n|---|---|\n|true|`True`|\n|false|`False`|\n\n## Null value\nJesth Dict section supports the `null` value that translates to Python `None` singleton.\n\n## Single-line strings\nJesth Dict Section supports single-line strings. Such strings may embed unicode escape sequence and should be enclosed with double-quotes. Raw strings (WYSIWYG) are also supported and in these ones should be enclosed with single-quotes.\n\n**String embedding unicode escape sequences:**\n```\nmy_string = \"this is a \\n single-line string \\u000A with line breaks\"\nanother_string = \"string containing \"double quotes\" and a \\\\ backslash\"\n```\n> **Note:** The Backslash character is a special one here. It is used to escape a backslash or to make a unicode escape sequence. It shouldn't be used to escape a quote.\n\n**Raw string (WYSIWYG):**\n```\nmy_string = 'this is a \\n single-line string \\u000A with no line break'\nanother_string = 'string containing 'single quotes' and a \\ backslash'\n```\n\n**Mapping with actual Python objects:**\n\n|Jesth|Python|\n|---|---|\n|Double quotes String|`str`|\n|Single quotes String|`jesth.box.RawString`|\n\n> The Jesth single-line String type is aligned with Python `str` type.\n\n## Multi-line strings\nMulti-line strings work like single-line strings with one major difference: they can actally span on multilines !\n\nCreating multi-line strings implies using tags and indentation. The Jesth name for a multi-line string that supports unicode escape sequences is `Text` and its tag is `(text)`. For raw multi-line string, the Jesth name is `Raw` and its tag is `(raw)`.\n\nA multi-line string is written with a 4-spaces indent under its tag. This indented block under the tag is then closed with a `---` separator symbol placed on its own line.\n\n\n**Multi-line string that supports unicode escape sequences:**\n```\nmultiline = (text)\n This is a multi-line\n string that supports \\\\\n unicode escape sequences \\u000A\n ---\n \nempty_text = (text)\n ---\n```\n\n**Raw multi-line string:**\n```\nmultiline = (raw)\n This is a raw multi-line \\\n string \\u000A\n ---\n \nempty_raw = (raw)\n ---\n```\n**Mapping with actual Python objects:**\n\n|Jesth|Python|\n|---|---|\n|Text|`jesth.box.Text` subclassing `str`|\n|Raw|`jesth.box.Raw` subclassing `str`| \n\n## Date and time\nJesth Dict Section supports date and time (ISO 8601).\n**Date and time:**\n```\ndatetime = 2020-10-20T15:35:57Z+01:00\ndate = 2020-10-20\ntime = 15:35:57\n```\n\n**Mapping with actual Python objects:**\n\n|Jesth|Python|\n|---|---|\n|datetime|`datetime.datetime`|\n|date|`datetime.date`|\n|time|`datetime.time`| \n\n## Embedded binary data\nJesth Dict Section supports binary data. Under the hood, standard base64 (RFC 4648) is used. For convenience, the Jesth parser allows for line breaks and the Jesth renderer outputs base64 lines of 76 characters.\n\nEmbedding binary data encoded with base64 implies the use of a specific tag and indentation. The Jesth name for binary data is `Bin` and its tag is `(bin)`.\n\nBinary data encoded with base64 is written with a 4-spaces indent under the `(bin)` tag. This indented block under the tag is then closed with a `---` separator symbol placed on its own line.\n\n```\nbin = (bin)\n TG9yZW0gaXBzdW0gZG9sb3Igc2l0IGFtZXQsIGNvbnNlY3RldHVyIGFkaXBpc2NpbmcgZWxpdCwg\n c2VkIGRvIGVpdXNtb2QgdGVtcG9yIGluY2lkaWR1bnQgdXQgbGFib3JlIGV0IGRvbG9yZSBtYWduY\n SBhbGlxdWEu\n ---\n \nempty_bin = (bin)\n ---\n```\n\n**Mapping with actual Python objects:**\n\n|Jesth|Python|\n|---|---|\n|bin (base64)|`bytes`|\n\n\n## Comments and whitespaces\nJesth Dict Section supports comments which are strings starting with `#`. Comments and Whitespaces can be preserved while converting a Jesth Dict Section into a Python Dictionary.\n**Example with comments and whitespaces:**\n```\n# this is a comment\nkey = \"value\"\n\n# another comment\nnumber = 42\n```\n\n**Mapping with actual Python objects:**\n\n|Jesth|Python|\n|---|---|\n|comment|`jesth.box.Comment` subclassing ``str|\n|whitespace|`jesth.box.Whitespace` subclassing ``str|\n\n> **Note:** Along with `jesth.box.Comment` and `jesth.box.Whitespace`, the classes `jesth.box.CommentID` and `jesth.box.WhitespaceID` are also used to help embed comments and whitespace in a Python dictionary by playing the role of keys. They don't accept any argument, subclass `int`. Each instanciation of one of these classes create a new unique ID. The creation of this unique ID is thread-safe and handled by `jesth.box.N`.\n\n\n## Collections (nested dict and list)\nJesth Dict Section supports nested dictionaries and lists.\n\nNesting a collection implies the use of a specific tag and indentation. The Jesth name for a nested dictionary is `Dict` and its tag is `(dict)`. In the case of a nested list, its Jesth name is `List` and its tag is `(list)`.\n\n**Nested collections:**\n```\nmy_dict = (dict)\n key_1 = \"value\"\n # Nested list\n key_2 = (list)\n \"item i\"\n \"item ii\"\n # nested dict\n (dict)\n project = \"jesth\"\n number = 42\n```\n\n**Mapping with actual Python objects:**\n\n|Jesth| Python |\n|---|--------|\n|dict| `dict` |\n|list| `list` |\n\n# Application programming interface\nThe library exposes functions and classes to load and dump Jesth document with optional customization at each stage. Loaded documents and their sections are encapsulated into classes exposing methods and properties with a high emphasis on ergonomy.\n\n## Loading Jesth\nTo load Jesth data, one may use the `jesth.parser.read` function when the source is a file (file-like object, path string, or pathlib.Path object), or `jesth.parser.parse` function when the source is a string.\n\nThese functions returns a `Document` object that is expected to contain `Section` objects. Under the hood, these functions use the `jesth.parser.Parser` class that consume line by line the contents and stops when `[[END]]` is encountered.\n\nBoth functions accept the `ValueConverter` object for customization.\n\n**Examples:**\n```python\nfrom decimal import Decimal\nfrom jesth import read, parse\n\n# Using 'read'\npath = \"/path/to/jesthfile\"\ndocument = read(path) # path may also be a pathlib.Path object\n\n# Using 'parse'\nwith open(path, \"r\") as file:\n text = file.read()\ndocument = parse(text)\n\n# Force floats to use decimal.Decimal\nvc = ValueConverter(float_decoder=Decimal)\ndocument = parse(text, value_converter=vc)\n```\n\nAs showed in the example above, references to these functions are available from the root module.\n\n## Dumping Jesth\nTo save the contents of a `Document` in a file, use the `jesth.renderer.write` function. This function uses another function `jesth.renderer.render` which takes a document as argument then return its textual representation (with the square brackets and all).\n\n> **Note:** The `Document` class itself already exposes the `render` and `save` methods which use under the hood the functions `render` and `write` that has been described above.\n\n\n**Example:**\n```python\nfrom jesth import read, write, render\n\npath = \"/path/to/jesthfile\"\ndocument = read(path)\n\n# Save to same file\ndocument.save()\n# Save to another file\nnew_path = \"/path/to/copy.txt\"\ndocument.save_to(new_path)\n\n# render (convert the document into text)\ntext = document.render()\n# note that the 'render' method accepts *headers\n# and spacing arguments. \n# By default the spacing is 1 (space) between sections\n\n# Save to a file with 'write' function\nwrite(document, path) # path may be a pathlib.Path instance\n\n# render with the 'render' function\ntext = render(document)\n```\n\nAs showed in the example above, references to the `write` and `render` functions are available from the root module.\n\n## The Document object\nThe `read` and `parse` functions returns a document object which contains the list of sections objects and exposes methods to edit the document, save it in a file, render its contents etc.\n\n> **Properties:** `path`, `sections`, `value_converter`, and `headers`. \n\n\n### Edit a document\nThe document object exposes `append`, `insert`, `remove`, and `remove_all` methods to add and remove sections.\n\n**Example:**\n\n```python\nfrom jesth import read\n\npath = \"/path/to/jestfile\"\ndocument = read(path)\n\n# append a new section to the document\nbody = [\"line 1\", \"line 2\", \"line 3\"]\ndocument.append(\"header\", body=body) # body may be a dict or string\n\n# insert an empty section at a specific index\ndocument.insert(0, \"header\") # therefore, first section\n# override the first section with a new empty one\ndocument.set(0, \"header\") # 'body' is still set to None here\n\n# remove the last section with a specific header\ndocument.remove(\"header\") # sub_index defaults to -1\n# remove the first section with a specific header\ndocument.remove(\"header\", sub_index=0)\n# note that the index argument is meant to be relative\n# to this 'section family', i.e., sections with same header.\n\n# remove all sections by header\ndocument.remove_all(\"header\")\n```\n\n### Query a document\nThe document object exposes `get`, `get_all`, and `count` methods to retrieve sections or their count.\n\n**Example:**\n\n```python\nfrom jesth import read\n\npath = \"/path/to/jestfile\"\ndocument = read(path)\n\n# get the first section with a specific header\nsection = document.get(\"header\") # the index arg defaults to 0\n# get the last section with a specific header\nsection = document.get(\"header\", sub_index=-1)\n# note that the index argument is meant to be relative\n# to this 'section family', i.e., sections with same header.\n\n# get all sections with a specific header\nsections = document.get_all(\"header\")\n\n# count the number of sections sharing a specific header\nn = document.count(\"header\")\n```\n\n## The Section object\nThe Section object contains the header and the body of a section. The body is stored as a list of string that may be converted on demand, if compatible, into a Python dictionary.\n\n> **Properties:** `header`, `body`, and `value_converter`.\n\n\n**Example:**\n\n```python\nfrom jesth import read\n\npath = \"/path/to/jestfile\"\ndocument = read(path)\nsection = document.get(\"header\")\n\n# Convert the body of this section into a dict then return it.\n# Beware, this method won't raise an exception at all\n# instead, it may return a fallback value\ndata = section.to_dict() # accepts \"default\", \"fallback\", \"strict\"\n\n# Concatenate the lines of the body to form a text string\ndata = section.to_text()\n\n# Convert the body of this section into a dict then return it.\n# Beware, this method will raise an exception if the conversion fails\ndata = section.create_dict() # accepts \"default\" and \"strict\"\n# the default parameter is for default data if the new created dict\n# is empty. The strict parameter is to tell whether comment/whitespace\n# should be preserved and inserted in the new dict !\n\n# update the body of this section\nbody = [\"line 1\", \"line 2\", \"line 3\"]\nsection.update(body) # here, body may be a string or a dict\n```\n\n## The ValueConverter object\nThis object has methods to encode and decode values, and also properties to customize conversion types. \n\nMost time, you won't need to set or edit this object. The encode and decode methods of this object are only useful for test purpose.\n\n**Parameters:**\n- **dict_type**: the Python type in which a Jesth Dict should be converted into. Defaults to Python dict type.\n\n- **list_type**: the Python type in which a Jesth List should be converted into. Defaults to Python list type.\n\n- **XXX_types**: this represents a group of parameters. Here, XXX is a placeholder for a Jesth data type. Valid types are: dict, list, bin, bool, complex, date, datetime, float, integer, raw, string, text, time. Examples: dict_types, float_types, and time_types. Use this parameter to set a list of Python types that may be encoded in the Jesth type (the same used as prefix). Example: dict_types defaults to [dict, OrderedDict], i.e., while encoding some Python data, an OrderedDict instance or a regular dict instance will be encoded as a Jesth dict.\n\n- **XXX_encoder**: this represents a group of parameters. Here, XXX is a placeholder for one of: bin, bool, complex, date, datetime, float, integer, null, raw, string, text, time, fallback. Use these parameters to set a callable to encode Python values into Jesth values. Example: float_encoder = decimal.Decimal\n\n- **XXX_decoder**: this represents a group of parameters. Here, XXX is a placeholder for one of: bin, bool, complex, date, datetime, float, integer, null, raw, string, text, time, fallback. Use these parameters to set a callable to decode Jesth values into Python values. Example: integer_decoder = int\n\n> **Note:** the parameters of ValueConverter are mirrored as read and write properties.\n\n\n# Jesth Schema for data validation\nComing soon... ;)\n\n# Available implementations\nFor the moment, the only available implementation of Jesth is this reference library written in Python. \nIf you are interested into writing a Jesth library for another language (Rust, C#, Java, JavaScript, PHP, etc), feel free to get in touch with [me](https://pyrustic.github.io/#contact) for any question !\n\n# Miscellaneous\nCollection of miscellaneous notes.\n\n## Acronymy\nJesth stands for **Just Extract Sections Then Hack'em**. Here, the word `hack` is a direct reference to the [Hacker culture](https://en.wikipedia.org/wiki/Hacker_culture).\n\n## End of document\nTo tell the parser to stop parsing a document, input in the document the `[[END]]` (case-insensitive) tag on its own line.\n\n## Opening square bracket at the beginning of a line\nYou must escape the opening square brackets at the start of a line of a body with a backslash `\\` character. And whenever your line needs to start with `n` backslash(es) followed by an opening square bracket, you need to prepend an extra backslash to the line, resulting in `n+1` backslash(es).\n\n> **Note:** This does not apply when programmatically submitting the body of a section as an argument to a function or class.\n\n**Example:**\n\n```\n[header]\n\\[hello world !\n\\\\\\[Here there is one extra line to escape the bracket. Backslashes themselves shouldn't be escaped\n```\n\n## Indentation\nIndentation in a Jesth file is aligned with the de-facto indentation convention in Python. Four spaces represent one indent.\n\n## Beautiful cover image\nThe beautiful cover image is made with the amazing [carbon app](https://carbon.now.sh).\n\n# Contributing\nFeel free to open an issue to report a bug, suggest some changes, show some useful code snippets, or discuss anything related to this project. You can also directly email [me](https://pyrustic.github.io/#contact).\n\n> **Note:** I would like to automate a decent Git workflow (branching, merging patches, all the basic cool things cool kids do) with [Backstage](https://github.com/pyrustic/backstage). In the meantime, I will manually incorporate the fixes into the only existing branch (`master`). Thank you for your understanding !\n\n## Setup your development environment\nFollowing are instructions to setup your development environment\n\n```bash\n# create and activate a virtual environment\npython -m venv venv\nsource venv/bin/activate\n\n# clone the project then change into its directory\ngit clone https://github.com/pyrustic/jesth.git\ncd jesth\n\n# install the package locally (editable mode)\npip install -e .\n\n# run tests\npython -m unittest discover -f -s tests -t .\n\n# deactivate the virtual environment\ndeactivate\n```\n\n\n# Installation\n**Jesth** is **cross platform**. It is built on [Ubuntu](https://ubuntu.com/download/desktop) with [Python 3.8](https://www.python.org/downloads/) and should work on **Python 3.5** or **newer**.\n\n\n## Create and activate a virtual environment\n```bash\npython -m venv venv\nsource venv/bin/activate\n```\n\n## Install for the first time\n\n```bash\npip install jesth\n```\n\n## Upgrade the package\n```bash\npip install jesth --upgrade --upgrade-strategy eager\n```\n\n## Deactivate the virtual environment\n```bash\ndeactivate\n```\n\n# About the author\nHi, I'm Alex, a tech enthusiast ! Get in touch with [me](https://pyrustic.github.io/#contact) ! \n\n<br>\n<br>\n<br>\n\n[Back to top](#readme)\n\n\n\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Human-readable versatile data format",
"version": "0.0.11",
"project_urls": {
"Homepage": "https://github.com/pyrustic/jesth"
},
"split_keywords": [
"serialization",
"library",
"format",
"collections",
"data",
"markup",
"parser",
"config",
"configuration",
"renderer",
"language",
"typed",
"data-structure",
"pyrustic"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "d9ddda7e02a9049165eb1e19f765449e3da00fd5d3a5492b242db057c8365a6b",
"md5": "04525a67c39dc2013a00a1578ef3f73b",
"sha256": "e5b73291361f520d6af308bd640a4e70f79f745722d7d1022880a51fe3549b3f"
},
"downloads": -1,
"filename": "jesth-0.0.11-py3-none-any.whl",
"has_sig": false,
"md5_digest": "04525a67c39dc2013a00a1578ef3f73b",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.5",
"size": 47509,
"upload_time": "2024-03-13T23:02:00",
"upload_time_iso_8601": "2024-03-13T23:02:00.250184Z",
"url": "https://files.pythonhosted.org/packages/d9/dd/da7e02a9049165eb1e19f765449e3da00fd5d3a5492b242db057c8365a6b/jesth-0.0.11-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "d60423b68f98d4cbbf90f66af50d066030429b29926db0dee59ba59fd4d38761",
"md5": "3037e7835430de98cc57f40f4c762386",
"sha256": "8fa5f78065d43c790eb26b33ef690518412f5cbe8a69a61637da2f04761efc46"
},
"downloads": -1,
"filename": "jesth-0.0.11.tar.gz",
"has_sig": false,
"md5_digest": "3037e7835430de98cc57f40f4c762386",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.5",
"size": 59318,
"upload_time": "2024-03-13T23:02:05",
"upload_time_iso_8601": "2024-03-13T23:02:05.427971Z",
"url": "https://files.pythonhosted.org/packages/d6/04/23b68f98d4cbbf90f66af50d066030429b29926db0dee59ba59fd4d38761/jesth-0.0.11.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-03-13 23:02:05",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "pyrustic",
"github_project": "jesth",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "jesth"
}