edgar-analytics


Nameedgar-analytics JSON
Version 0.1.0 PyPI version JSON
download
home_pagehttps://github.com/zoharbabin/edgar_analytics
SummaryA library and CLI tool for analyzing SEC EDGAR filings with financial metrics and forecasting.
upload_time2025-01-20 09:43:48
maintainerNone
docs_urlNone
authorZohar Babin
requires_python>=3.10
licenseNone
keywords
VCS
bugtrack_url
requirements pandas numpy statsmodels edgartools click pytest pytz pytest-xdist
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # EDGAR Analytics

**A Python library for analyzing SEC EDGAR filings, computing financial metrics, generating forecasts (with a strategy-based architecture), and producing clear summary reports.**

---

## Table of Contents
1. [Overview](#overview)
2. [Features](#features)
3. [Installation](#installation)
    - [Using `pip`](#using-pip)
    - [Setting Up a Local Virtual Environment](#setting-up-a-local-virtual-environment)
4. [Quick Start](#quick-start)
5. [Usage](#usage)
    - [CLI Usage](#cli-usage)
    - [Programmatic Usage](#programmatic-usage)
6. [Core Modules](#core-modules)
7. [Detailed Method Reference](#detailed-method-reference)
8. [Logging](#logging)
9. [Extensibility](#extensibility)
10. [Testing](#testing)
11. [Contributing](#contributing)
12. [License](#license)

---

## Overview

**EDGAR Analytics** is a Python library designed to:

- Retrieve and parse financial statements from the SEC’s [EDGAR](https://www.sec.gov/edgar.shtml) system (via [edgartools](https://pypi.org/project/edgartools/)).
- Compute key financial metrics (including GAAP, IFRS, intangible, goodwill, net debt, etc.).
- Optionally **forecast revenue** using a **strategy-based** architecture (ARIMA by default).
- Summarize results (logging + CSV outputs).

It handles complexities such as:

- Multi-year and multi-quarter filing retrieval (10-K and 10-Q).
- Flexible synonyms to accommodate IFRS/GAAP labeling differences.
- Automatic sign-flipping of negative expenses.
- Common ratio calculations (Current Ratio, Debt-to-Equity, Free Cash Flow, etc.).
- Alert detection for negative margins, high leverage, negative free cash flow streaks, spikes in working capital.
- **Strategy-based forecasting** with a default ARIMA approach or customizable forecasting strategy.
- A fully scriptable **CLI**.

**IFRS & GAAP Filers**  
Supports both U.S. GAAP and IFRS filers. Foreign private issuers that file IFRS-based statements (e.g., Form 20-F) are automatically handled if synonyms match IFRS-labeled line items.

---

## Features

- **Annual & Quarterly Snapshots**: Retrieve the latest 10-K (annual) and 10-Q (quarterly) metrics for a given company.
- **Multi-Year Analysis**: Pull multiple 10-K and 10-Q statements to compute YoY growth, QoQ growth, and CAGR.
- **Key Ratios & Metrics**: Current Ratio, Debt-to-Equity, Free Cash Flow, margins, intangible ratios, net debt, lease liabilities, and more.
- **Alerts**:
  - Negative net margin
  - High leverage (debt-to-equity above threshold)
  - Consecutive quarters of negative free cash flow
  - Significant quarterly spikes in inventory/receivables
- **Revenue Forecasting**:
  - By default, uses an **ARIMA-based** strategy for annual or quarterly data.
  - Easily swap in your **own forecasting logic** by implementing a custom strategy (see [Extensibility](#extensibility)).
- **Command-Line Interface (CLI)**: Analyze one or more tickers with a single command.
- **Optional CSV Output**: Save summarized metrics to CSV.

---

## Installation

You can install EDGAR Analytics using `pip` or set it up within a local virtual environment for development.

### Using `pip`

```bash
pip install edgar-analytics
```

### Setting Up a Local Virtual Environment

1. **Create a Virtual Environment**  

   ```bash
   python3 -m venv venv
   source venv/bin/activate   # or venv\Scripts\activate on Windows
   pip install --upgrade pip setuptools wheel
   ```

2. **Install in Editable Mode (for contributing)**  

   ```bash
   pip install -e .[test]
   ```

   Make sure you’re in the repo’s root directory (where `setup.py` is located).

---

## Quick Start

Once installed, **analyze a company** via the CLI:

```bash
edgar-analytics AAPL --csv my_report.csv
```

This command:

- Fetches Apple’s latest annual (10-K) and quarterly (10-Q) metrics.
- Retrieves multi-year data for deeper analysis (CAGR, YoY growth, etc.).
- Automatically forecasts next annual and quarterly revenue using the default ARIMA strategy.
- Logs everything to the console and saves a CSV summary to `my_report.csv`.

---

## Usage

### CLI Usage

Use the `edgar-analytics` command, with arguments for the main ticker, optional peers, and an optional `--csv` path:

```bash
edgar-analytics TICKER [PEER1 [PEER2 ...]] [--csv output.csv]
```

**Examples**:

1. **Single Ticker**  
   ```bash
   edgar-analytics AAPL
   ```
   Fetches data for AAPL, logs results to the console.

2. **With Peers + CSV**  
   ```bash
   edgar-analytics AAPL MSFT GOOGL --csv analysis_results.csv
   ```
   Compares AAPL to MSFT & GOOGL, logs aggregated results, and writes a CSV to `analysis_results.csv`.

3. **Invalid Tickers**  
   If any ticker is invalid (e.g., `@BADTICKER`), the CLI logs a warning and skips it.

### Programmatic Usage

You can also **use the library in your own Python scripts**. For example:

```python
from edgar_analytics.orchestrator import TickerOrchestrator

def main():
    orchestrator = TickerOrchestrator()
    orchestrator.analyze_company(
        ticker="AAPL",
        peers=[],
        csv_path="analysis_outputs/aapl_summary.csv"
    )

if __name__ == "__main__":
    main()
```

**Key Steps** (inside `analyze_company`):

1. Validates the ticker symbol (e.g., `AAPL`).
2. Fetches annual (10-K) and quarterly (10-Q) snapshots.
3. Retrieves multi-year data, computing YoY, CAGR, etc.
4. **Forecasts annual & quarterly revenue** using the **default ARIMA strategy** (or a custom strategy if configured).
5. Summarizes everything in the console logs or an optional CSV.

> Check out the example scripts in [examples](examples) to learn more.  

---

## Core Modules

1. **`metrics.py`**  
   Computes financial metrics (Revenue, Net Income, margins, ROE, Free Cash Flow, IFRS expansions, interest coverage, etc.).

2. **`forecasting.py`**  
   Provides a **strategy-based** system for revenue forecasting.  
   - **`ArimaForecastStrategy`**: Uses ARIMA or SARIMAX if enough data is available.  
   - **`forecast_revenue()`**: A convenience function that calls the chosen strategy.

3. **`multi_period_analysis.py`**  
   Gathers multi-year or multi-quarter data, computing growth rates (YoY, QoQ) and CAGR.

4. **`orchestrator.py`**  
   High-level orchestration (`TickerOrchestrator`) to fetch EDGAR data, compute metrics, run multi-year analysis, forecast, and produce final outputs.

5. **`reporting.py`**  
   Summarizes results in a DataFrame, logs them, and optionally saves to CSV.

6. **`data_utils.py`, `synonyms_utils.py`, `synonyms.py`, `config.py`, `logging_utils.py`**  
   - Helpers for data parsing, synonyms, logging, numeric formatting, and config thresholds.

7. **`cli.py`**  
   Click-based command-line interface for `edgar-analytics`.

---

## Detailed Method Reference

Below is a select reference for the most commonly used methods. For a deeper or more technical reference, see the in-code docstrings.

| **Method**                                       | **Purpose**                                                                                                                                                        | **Usage**                                                                                                                                                                                   |
|--------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| **`cli.main()`**                                 | Entry point for the CLI (`edgar-analytics`).                                                                                                                       | **CLI**: Invoked by console script. Not typically called directly in Python code.                                                                                                            |
| **`TickerOrchestrator.analyze_company()`**       | Orchestrates data retrieval, multi-year analysis, forecasting, and final reporting for a main ticker + optional peers.                                            | **Python**: <br/>```python<br/>orch = TickerOrchestrator()<br/>orch.analyze_company("AAPL", ["MSFT"], csv_path="out.csv")<br/>```                                                           |
| **`metrics.get_single_filing_snapshot()`**       | Retrieves the latest 10-K or 10-Q for a Company, parses it into a dictionary of metrics + filing info.                                                             | **Internally** used by `_analyze_ticker_for_metrics()`.                                                                                                                                     |
| **`multi_period_analysis.retrieve_multi_year_data()`** | Fetches multiple 10-K/10-Q filings for multi-year or multi-quarter growth analysis, CAGR, etc.                                                                    | **Python**: <br/>```python<br/>data = retrieve_multi_year_data("AAPL", 3, 8)<br/>print(data["annual_data"], data["quarterly_data"])<br/>```                                                |
| **`forecasting.forecast_revenue()`**             | **Main entry** for forecasting next revenue (1-step) using a specified or default strategy (`ArimaForecastStrategy`). Clamps negatives to 0.0.                    | **CLI**: Called under the hood by the Orchestrator. <br/>**Python**: <br/>```python<br/>from edgar_analytics.forecasting import forecast_revenue<br/>fcst = forecast_revenue(rev_dict)``` |
| **`forecasting.ArimaForecastStrategy`**          | Default ARIMA-based strategy that tries ARIMA/SARIMAX model candidates and picks one by AIC.                                                                       | **Custom**: <br/>```python<br/>strategy = ArimaForecastStrategy()<br/>fcst = forecast_revenue(rev_data, strategy=strategy)<br/>```                                                         |
| **`reporting.ReportingEngine.summarize_metrics_table()`** | Builds a final summary of metrics for all tickers, logs them, and optionally saves to CSV.                                                                         | **Internally** used by `analyze_company()`, though you can call it directly with a valid `metrics_map`.                                                                                      |
| **`TickerDetector.validate_ticker_symbol()`**     | Regex-based validation of ticker format.                                                                                                                           | Used to filter out invalid tickers (e.g., `@@@`).                                                                                                                                           |
---

## Refined FCF Computation

Starting with **v0.1.0+**, EDGAR Analytics applies a smarter fallback when a direct **“capital expenditures”** line item is missing from the cash flow statement. Specifically:

1. **Checks for an explicit “capital expenditures” row** (via synonyms).  
2. **If not found**:  
   - **Takes the net investing outflow** (if negative).  
   - **Subtracts** intangible purchases and business acquisitions (when detected).  
   - The remainder is treated as **approximated “capex.”**  

This helps avoid over-counting large M&A deals or intangible acquisitions as ongoing capital expenditures. You can see this improvement in:
- `synonyms_utils.compute_capex_single_period` and `compute_capex_for_column`
- `metrics.py` for single filing snapshots.
- `multi_period_analysis.py` for multi-quarter data.

**Disclaimer**:  
Even with these refinements, certain unusual transactions may still appear in overall investing outflows. Thus, the library’s fallback remains an approximation for investor/analyst guidance only.

---

## Logging

- **Default Logging Level**: DEBUG (to stdout).
- Customize by editing `logging_utils.py` or adjusting the logger in your own script.
- Key log levels used:
  - **INFO**: Summaries, important progress messages.
  - **WARNING**: Alerts or potential issues (e.g., negative margin, high leverage).
  - **ERROR**: Failures in retrieval or parsing.

---

## Extensibility

**EDGAR Analytics** is modular, letting you add or override behavior easily:

1. **Synonyms**: Extend or override synonyms in `synonyms.py` for custom labeling.
2. **Alert Thresholds**: `config.py` contains thresholds for negative margins, high leverage, and negative FCF streaks.
3. **Additional Metrics**: Add new ratio logic to `metrics.py` or wherever suitable.
4. **Custom Forecast Strategies**:  
   - The library exposes a `ForecastStrategy` abstract base class in `forecasting.py`.
   - By default, we use `ArimaForecastStrategy`. You can **create your own** strategy subclass and pass it to `forecast_revenue()` (or the Orchestrator if you modify its internal logic).
   ```python
   from edgar_analytics.forecasting import ForecastStrategy, forecast_revenue
   from edgar_analytics.data_utils import parse_period_label

   class MyGrowthStrategy(ForecastStrategy):
       def forecast(self, rev_dict: dict, is_quarterly=False) -> float:
           # Example: always increase last known revenue by +5%
           if not rev_dict:
               return 0.0
           sorted_periods = sorted(rev_dict.keys(), key=parse_period_label)
           last_val = rev_dict[sorted_periods[-1]]
           return last_val * 1.05

   # Usage:
   data_map = {"2020": 100, "2021": 200}
   fcst = forecast_revenue(data_map, strategy=MyGrowthStrategy())
   print(fcst)
   ```
5. **Reporting Formats**: `ReportingEngine` logs and optionally writes CSV. Extend it for Excel, JSON, or any other output.

---

## Testing

We use **pytest** to ensure reliability and coverage.  
From the project root (where `pytest` can discover `tests/`):

```bash
pytest -v --cov=edgar_analytics --cov-report=term-missing
```

---

## Contributing

1. **Fork** & **Clone** the repo.
2. **Create a branch** for your feature or fix.
3. **Write Tests** for your changes; ensure existing tests still pass.
4. **Format your code** to comply with PEP8, PEP20, and PEP257.
5. **Push** to your fork and create a pull request.
6. **Discuss** & **refine** in the PR as needed.

### Coding Standards

- **PEP8**: Style guidelines.
- **PEP20**: Zen of Python (readability counts).
- **PEP257**: Docstring guidelines.
- **Security**: Validate all inputs, use robust error handling, do not trust external data blindly.
- **Tests**: Provide tests for new features or modifications to existing features.

---

## License

**MIT License**  
(See [LICENSE](LICENSE) for details.)

**Disclaimer**:  
This tool extracts data from public SEC filings. Always review official filings directly for investment decisions. The library and its maintainers are not responsible for any inaccuracies or any decisions made based on these outputs.

---

**Happy Analyzing!**  

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/zoharbabin/edgar_analytics",
    "name": "edgar-analytics",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": null,
    "keywords": null,
    "author": "Zohar Babin",
    "author_email": "z.babin@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/14/21/55f5fde9ed9e5918dd559c9966896bf9b1af8dd79a7c98e6855a1e74c64a/edgar_analytics-0.1.0.tar.gz",
    "platform": null,
    "description": "# EDGAR Analytics\n\n**A Python library for analyzing SEC EDGAR filings, computing financial metrics, generating forecasts (with a strategy-based architecture), and producing clear summary reports.**\n\n---\n\n## Table of Contents\n1. [Overview](#overview)\n2. [Features](#features)\n3. [Installation](#installation)\n    - [Using `pip`](#using-pip)\n    - [Setting Up a Local Virtual Environment](#setting-up-a-local-virtual-environment)\n4. [Quick Start](#quick-start)\n5. [Usage](#usage)\n    - [CLI Usage](#cli-usage)\n    - [Programmatic Usage](#programmatic-usage)\n6. [Core Modules](#core-modules)\n7. [Detailed Method Reference](#detailed-method-reference)\n8. [Logging](#logging)\n9. [Extensibility](#extensibility)\n10. [Testing](#testing)\n11. [Contributing](#contributing)\n12. [License](#license)\n\n---\n\n## Overview\n\n**EDGAR Analytics** is a Python library designed to:\n\n- Retrieve and parse financial statements from the SEC\u2019s [EDGAR](https://www.sec.gov/edgar.shtml) system (via [edgartools](https://pypi.org/project/edgartools/)).\n- Compute key financial metrics (including GAAP, IFRS, intangible, goodwill, net debt, etc.).\n- Optionally **forecast revenue** using a **strategy-based** architecture (ARIMA by default).\n- Summarize results (logging + CSV outputs).\n\nIt handles complexities such as:\n\n- Multi-year and multi-quarter filing retrieval (10-K and 10-Q).\n- Flexible synonyms to accommodate IFRS/GAAP labeling differences.\n- Automatic sign-flipping of negative expenses.\n- Common ratio calculations (Current Ratio, Debt-to-Equity, Free Cash Flow, etc.).\n- Alert detection for negative margins, high leverage, negative free cash flow streaks, spikes in working capital.\n- **Strategy-based forecasting** with a default ARIMA approach or customizable forecasting strategy.\n- A fully scriptable **CLI**.\n\n**IFRS & GAAP Filers**  \nSupports both U.S. GAAP and IFRS filers. Foreign private issuers that file IFRS-based statements (e.g., Form 20-F) are automatically handled if synonyms match IFRS-labeled line items.\n\n---\n\n## Features\n\n- **Annual & Quarterly Snapshots**: Retrieve the latest 10-K (annual) and 10-Q (quarterly) metrics for a given company.\n- **Multi-Year Analysis**: Pull multiple 10-K and 10-Q statements to compute YoY growth, QoQ growth, and CAGR.\n- **Key Ratios & Metrics**: Current Ratio, Debt-to-Equity, Free Cash Flow, margins, intangible ratios, net debt, lease liabilities, and more.\n- **Alerts**:\n  - Negative net margin\n  - High leverage (debt-to-equity above threshold)\n  - Consecutive quarters of negative free cash flow\n  - Significant quarterly spikes in inventory/receivables\n- **Revenue Forecasting**:\n  - By default, uses an **ARIMA-based** strategy for annual or quarterly data.\n  - Easily swap in your **own forecasting logic** by implementing a custom strategy (see [Extensibility](#extensibility)).\n- **Command-Line Interface (CLI)**: Analyze one or more tickers with a single command.\n- **Optional CSV Output**: Save summarized metrics to CSV.\n\n---\n\n## Installation\n\nYou can install EDGAR Analytics using `pip` or set it up within a local virtual environment for development.\n\n### Using `pip`\n\n```bash\npip install edgar-analytics\n```\n\n### Setting Up a Local Virtual Environment\n\n1. **Create a Virtual Environment**  \n\n   ```bash\n   python3 -m venv venv\n   source venv/bin/activate   # or venv\\Scripts\\activate on Windows\n   pip install --upgrade pip setuptools wheel\n   ```\n\n2. **Install in Editable Mode (for contributing)**  \n\n   ```bash\n   pip install -e .[test]\n   ```\n\n   Make sure you\u2019re in the repo\u2019s root directory (where `setup.py` is located).\n\n---\n\n## Quick Start\n\nOnce installed, **analyze a company** via the CLI:\n\n```bash\nedgar-analytics AAPL --csv my_report.csv\n```\n\nThis command:\n\n- Fetches Apple\u2019s latest annual (10-K) and quarterly (10-Q) metrics.\n- Retrieves multi-year data for deeper analysis (CAGR, YoY growth, etc.).\n- Automatically forecasts next annual and quarterly revenue using the default ARIMA strategy.\n- Logs everything to the console and saves a CSV summary to `my_report.csv`.\n\n---\n\n## Usage\n\n### CLI Usage\n\nUse the `edgar-analytics` command, with arguments for the main ticker, optional peers, and an optional `--csv` path:\n\n```bash\nedgar-analytics TICKER [PEER1 [PEER2 ...]] [--csv output.csv]\n```\n\n**Examples**:\n\n1. **Single Ticker**  \n   ```bash\n   edgar-analytics AAPL\n   ```\n   Fetches data for AAPL, logs results to the console.\n\n2. **With Peers + CSV**  \n   ```bash\n   edgar-analytics AAPL MSFT GOOGL --csv analysis_results.csv\n   ```\n   Compares AAPL to MSFT & GOOGL, logs aggregated results, and writes a CSV to `analysis_results.csv`.\n\n3. **Invalid Tickers**  \n   If any ticker is invalid (e.g., `@BADTICKER`), the CLI logs a warning and skips it.\n\n### Programmatic Usage\n\nYou can also **use the library in your own Python scripts**. For example:\n\n```python\nfrom edgar_analytics.orchestrator import TickerOrchestrator\n\ndef main():\n    orchestrator = TickerOrchestrator()\n    orchestrator.analyze_company(\n        ticker=\"AAPL\",\n        peers=[],\n        csv_path=\"analysis_outputs/aapl_summary.csv\"\n    )\n\nif __name__ == \"__main__\":\n    main()\n```\n\n**Key Steps** (inside `analyze_company`):\n\n1. Validates the ticker symbol (e.g., `AAPL`).\n2. Fetches annual (10-K) and quarterly (10-Q) snapshots.\n3. Retrieves multi-year data, computing YoY, CAGR, etc.\n4. **Forecasts annual & quarterly revenue** using the **default ARIMA strategy** (or a custom strategy if configured).\n5. Summarizes everything in the console logs or an optional CSV.\n\n> Check out the example scripts in [examples](examples) to learn more.  \n\n---\n\n## Core Modules\n\n1. **`metrics.py`**  \n   Computes financial metrics (Revenue, Net Income, margins, ROE, Free Cash Flow, IFRS expansions, interest coverage, etc.).\n\n2. **`forecasting.py`**  \n   Provides a **strategy-based** system for revenue forecasting.  \n   - **`ArimaForecastStrategy`**: Uses ARIMA or SARIMAX if enough data is available.  \n   - **`forecast_revenue()`**: A convenience function that calls the chosen strategy.\n\n3. **`multi_period_analysis.py`**  \n   Gathers multi-year or multi-quarter data, computing growth rates (YoY, QoQ) and CAGR.\n\n4. **`orchestrator.py`**  \n   High-level orchestration (`TickerOrchestrator`) to fetch EDGAR data, compute metrics, run multi-year analysis, forecast, and produce final outputs.\n\n5. **`reporting.py`**  \n   Summarizes results in a DataFrame, logs them, and optionally saves to CSV.\n\n6. **`data_utils.py`, `synonyms_utils.py`, `synonyms.py`, `config.py`, `logging_utils.py`**  \n   - Helpers for data parsing, synonyms, logging, numeric formatting, and config thresholds.\n\n7. **`cli.py`**  \n   Click-based command-line interface for `edgar-analytics`.\n\n---\n\n## Detailed Method Reference\n\nBelow is a select reference for the most commonly used methods. For a deeper or more technical reference, see the in-code docstrings.\n\n| **Method**                                       | **Purpose**                                                                                                                                                        | **Usage**                                                                                                                                                                                   |\n|--------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|\n| **`cli.main()`**                                 | Entry point for the CLI (`edgar-analytics`).                                                                                                                       | **CLI**: Invoked by console script. Not typically called directly in Python code.                                                                                                            |\n| **`TickerOrchestrator.analyze_company()`**       | Orchestrates data retrieval, multi-year analysis, forecasting, and final reporting for a main ticker + optional peers.                                            | **Python**: <br/>```python<br/>orch = TickerOrchestrator()<br/>orch.analyze_company(\"AAPL\", [\"MSFT\"], csv_path=\"out.csv\")<br/>```                                                           |\n| **`metrics.get_single_filing_snapshot()`**       | Retrieves the latest 10-K or 10-Q for a Company, parses it into a dictionary of metrics + filing info.                                                             | **Internally** used by `_analyze_ticker_for_metrics()`.                                                                                                                                     |\n| **`multi_period_analysis.retrieve_multi_year_data()`** | Fetches multiple 10-K/10-Q filings for multi-year or multi-quarter growth analysis, CAGR, etc.                                                                    | **Python**: <br/>```python<br/>data = retrieve_multi_year_data(\"AAPL\", 3, 8)<br/>print(data[\"annual_data\"], data[\"quarterly_data\"])<br/>```                                                |\n| **`forecasting.forecast_revenue()`**             | **Main entry** for forecasting next revenue (1-step) using a specified or default strategy (`ArimaForecastStrategy`). Clamps negatives to 0.0.                    | **CLI**: Called under the hood by the Orchestrator. <br/>**Python**: <br/>```python<br/>from edgar_analytics.forecasting import forecast_revenue<br/>fcst = forecast_revenue(rev_dict)``` |\n| **`forecasting.ArimaForecastStrategy`**          | Default ARIMA-based strategy that tries ARIMA/SARIMAX model candidates and picks one by AIC.                                                                       | **Custom**: <br/>```python<br/>strategy = ArimaForecastStrategy()<br/>fcst = forecast_revenue(rev_data, strategy=strategy)<br/>```                                                         |\n| **`reporting.ReportingEngine.summarize_metrics_table()`** | Builds a final summary of metrics for all tickers, logs them, and optionally saves to CSV.                                                                         | **Internally** used by `analyze_company()`, though you can call it directly with a valid `metrics_map`.                                                                                      |\n| **`TickerDetector.validate_ticker_symbol()`**     | Regex-based validation of ticker format.                                                                                                                           | Used to filter out invalid tickers (e.g., `@@@`).                                                                                                                                           |\n---\n\n## Refined FCF Computation\n\nStarting with **v0.1.0+**, EDGAR Analytics applies a smarter fallback when a direct **\u201ccapital expenditures\u201d** line item is missing from the cash flow statement. Specifically:\n\n1. **Checks for an explicit \u201ccapital expenditures\u201d row** (via synonyms).  \n2. **If not found**:  \n   - **Takes the net investing outflow** (if negative).  \n   - **Subtracts** intangible purchases and business acquisitions (when detected).  \n   - The remainder is treated as **approximated \u201ccapex.\u201d**  \n\nThis helps avoid over-counting large M&A deals or intangible acquisitions as ongoing capital expenditures. You can see this improvement in:\n- `synonyms_utils.compute_capex_single_period` and `compute_capex_for_column`\n- `metrics.py` for single filing snapshots.\n- `multi_period_analysis.py` for multi-quarter data.\n\n**Disclaimer**:  \nEven with these refinements, certain unusual transactions may still appear in overall investing outflows. Thus, the library\u2019s fallback remains an approximation for investor/analyst guidance only.\n\n---\n\n## Logging\n\n- **Default Logging Level**: DEBUG (to stdout).\n- Customize by editing `logging_utils.py` or adjusting the logger in your own script.\n- Key log levels used:\n  - **INFO**: Summaries, important progress messages.\n  - **WARNING**: Alerts or potential issues (e.g., negative margin, high leverage).\n  - **ERROR**: Failures in retrieval or parsing.\n\n---\n\n## Extensibility\n\n**EDGAR Analytics** is modular, letting you add or override behavior easily:\n\n1. **Synonyms**: Extend or override synonyms in `synonyms.py` for custom labeling.\n2. **Alert Thresholds**: `config.py` contains thresholds for negative margins, high leverage, and negative FCF streaks.\n3. **Additional Metrics**: Add new ratio logic to `metrics.py` or wherever suitable.\n4. **Custom Forecast Strategies**:  \n   - The library exposes a `ForecastStrategy` abstract base class in `forecasting.py`.\n   - By default, we use `ArimaForecastStrategy`. You can **create your own** strategy subclass and pass it to `forecast_revenue()` (or the Orchestrator if you modify its internal logic).\n   ```python\n   from edgar_analytics.forecasting import ForecastStrategy, forecast_revenue\n   from edgar_analytics.data_utils import parse_period_label\n\n   class MyGrowthStrategy(ForecastStrategy):\n       def forecast(self, rev_dict: dict, is_quarterly=False) -> float:\n           # Example: always increase last known revenue by +5%\n           if not rev_dict:\n               return 0.0\n           sorted_periods = sorted(rev_dict.keys(), key=parse_period_label)\n           last_val = rev_dict[sorted_periods[-1]]\n           return last_val * 1.05\n\n   # Usage:\n   data_map = {\"2020\": 100, \"2021\": 200}\n   fcst = forecast_revenue(data_map, strategy=MyGrowthStrategy())\n   print(fcst)\n   ```\n5. **Reporting Formats**: `ReportingEngine` logs and optionally writes CSV. Extend it for Excel, JSON, or any other output.\n\n---\n\n## Testing\n\nWe use **pytest** to ensure reliability and coverage.  \nFrom the project root (where `pytest` can discover `tests/`):\n\n```bash\npytest -v --cov=edgar_analytics --cov-report=term-missing\n```\n\n---\n\n## Contributing\n\n1. **Fork** & **Clone** the repo.\n2. **Create a branch** for your feature or fix.\n3. **Write Tests** for your changes; ensure existing tests still pass.\n4. **Format your code** to comply with PEP8, PEP20, and PEP257.\n5. **Push** to your fork and create a pull request.\n6. **Discuss** & **refine** in the PR as needed.\n\n### Coding Standards\n\n- **PEP8**: Style guidelines.\n- **PEP20**: Zen of Python (readability counts).\n- **PEP257**: Docstring guidelines.\n- **Security**: Validate all inputs, use robust error handling, do not trust external data blindly.\n- **Tests**: Provide tests for new features or modifications to existing features.\n\n---\n\n## License\n\n**MIT License**  \n(See [LICENSE](LICENSE) for details.)\n\n**Disclaimer**:  \nThis tool extracts data from public SEC filings. Always review official filings directly for investment decisions. The library and its maintainers are not responsible for any inaccuracies or any decisions made based on these outputs.\n\n---\n\n**Happy Analyzing!**  \n",
    "bugtrack_url": null,
    "license": null,
    "summary": "A library and CLI tool for analyzing SEC EDGAR filings with financial metrics and forecasting.",
    "version": "0.1.0",
    "project_urls": {
        "Homepage": "https://github.com/zoharbabin/edgar_analytics"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "76c2ef63ab9e995b059d16f2c11b7ac32041c582b25c845abe49e81f0788bc2f",
                "md5": "7ac52e18d0a3f9cec155ec7ffa363ce7",
                "sha256": "1822b91285cb7587907f45f08dded923e1dbbe1295ebe4169d12e42d1b9a9a59"
            },
            "downloads": -1,
            "filename": "edgar_analytics-0.1.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "7ac52e18d0a3f9cec155ec7ffa363ce7",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 49236,
            "upload_time": "2025-01-20T09:43:46",
            "upload_time_iso_8601": "2025-01-20T09:43:46.605495Z",
            "url": "https://files.pythonhosted.org/packages/76/c2/ef63ab9e995b059d16f2c11b7ac32041c582b25c845abe49e81f0788bc2f/edgar_analytics-0.1.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "142155f5fde9ed9e5918dd559c9966896bf9b1af8dd79a7c98e6855a1e74c64a",
                "md5": "ea4770dd88f190b5d15cee82b0ba40ea",
                "sha256": "2a068a8764e4196eb735f5de135bf5a8e775af40c6a7ccc6c64d19290909cfcf"
            },
            "downloads": -1,
            "filename": "edgar_analytics-0.1.0.tar.gz",
            "has_sig": false,
            "md5_digest": "ea4770dd88f190b5d15cee82b0ba40ea",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 50257,
            "upload_time": "2025-01-20T09:43:48",
            "upload_time_iso_8601": "2025-01-20T09:43:48.622873Z",
            "url": "https://files.pythonhosted.org/packages/14/21/55f5fde9ed9e5918dd559c9966896bf9b1af8dd79a7c98e6855a1e74c64a/edgar_analytics-0.1.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-01-20 09:43:48",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "zoharbabin",
    "github_project": "edgar_analytics",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [
        {
            "name": "pandas",
            "specs": []
        },
        {
            "name": "numpy",
            "specs": []
        },
        {
            "name": "statsmodels",
            "specs": []
        },
        {
            "name": "edgartools",
            "specs": []
        },
        {
            "name": "click",
            "specs": []
        },
        {
            "name": "pytest",
            "specs": []
        },
        {
            "name": "pytz",
            "specs": []
        },
        {
            "name": "pytest-xdist",
            "specs": []
        }
    ],
    "lcname": "edgar-analytics"
}
        
Elapsed time: 1.14065s