Skip to content

Changelog

Unreleased

Added

Changed

  • Moved the README badges section from html p tags to markdown format for better compatibility across platforms.

[0.1.4] - 02-12-2025

Added

  • New function clean_data() added for improved data cleaning and preprocessing instead of integrating it into data extraction function.

  • New documentation page for Data Cleaning added:

  • docs/usage/data-cleaning.md

  • Added to mkdocs.yml navigation.

  • New API overview documentation page added:

  • docs/api.md

  • Added to mkdocs.yml navigation.
  • New mkdocstrings configuration added to mkdocs.yml for automatic API documentation generation.

  • New tests added for remaining utils functions.

  • Added pytest coverage tracking (50%) using pytest-cov and coverage report generation using codecov.

Fixed

  • Tests updated to reflect changes in data cleaning process.

Removed

  • Arguments related to data cleaning removed from data extraction function.

Changed

[0.1.3] - 04-11-2025

Fixed

  • RecursiveCharacterTextSplitter importing updated for latest langchain version to avoid import errors:
  • Changed from from langchain.text_splitter import RecursiveCharacterTextSplitter
  • To from langchain.text_splitter.recursive_character import RecursiveCharacterTextSplitter

[0.1.2] - 24-10-2025

Added

  • Link to ComProScanner preprint on arXiv in the documentation index page and README.md: arXiv:2510.20362

[0.1.1] - 22-10-2025

Fixed

[0.1.0] - 22-10-2025

Added

  • Initial release of ComProScanner.