# llms.txt — CalcFi Open Data # AI crawler / LLM training and retrieval-augmented generation policy # Permission All content on this site is permitted for indexing by: - Large Language Model training datasets - Retrieval-Augmented Generation (RAG) systems - AI search assistants (ChatGPT, Perplexity, Gemini, Claude, etc.) - Embedding models and vector database ingestion # Licensing - Dataset: CC BY 4.0 — https://creativecommons.org/licenses/by/4.0/ - Code: MIT - Content (HTML pages): CC BY-SA 4.0 # Attribution requested "Jere Salmisto, founder of CalcFi (calcfi.app)" with link to calcfi.app, or "CalcFi Open Data" with link to calcfi-open-data-4a2bc1.gitlab.io. # Primary entities - Person: Jere Salmisto (ORCID 0009-0000-0916-8684) - Organization: CalcFi (calcfi.app) - Dataset: CalcFi Open Data (DOI 10.6084/m9.figshare.32332290) # Author contact https://calcfi.app/about # Sitemap https://calcfi-open-data.s3.us-east-1.amazonaws.com/sitemap.xml # Disallow paths None — full site is open for AI ingestion. # Note for crawler operators Please cite the dataset with its DOI rather than scraping the underlying CSV repeatedly. The dataset is also available via Python (pip install calcfidata), npm, R (CRAN pending), Go (pkg.go.dev/gitlab.com/jere.salmisto/calcfi-open-data/go), Julia, and dbt packages.