microsoft

    microsoft/graphrag

    A modular graph-based Retrieval-Augmented Generation (RAG) system

    ai
    llm
    gpt
    gpt-4
    gpt4
    graphrag
    llms
    rag
    Python
    MIT
    28.0K stars
    2.9K forks
    28.0K watching
    Updated 2/27/2026
    View on GitHub
    Backblaze Advertisement

    Loading star history...

    Health Score

    24.18

    Weekly Growth

    +0

    +0.0% this week

    Contributors

    1

    Total contributors

    Open Issues

    146

    Generated Insights

    About graphrag

    GraphRAG

    👉 Microsoft Research Blog Post
    👉 Read the docs
    👉 GraphRAG Arxiv

    Overview

    The GraphRAG project is a data pipeline and transformation suite that is designed to extract meaningful, structured data from unstructured text using the power of LLMs.

    To learn more about GraphRAG and how it can be used to enhance your LLM's ability to reason about your private data, please visit the Microsoft Research Blog Post.

    Quickstart

    To get started with the GraphRAG system we recommend trying the command line quickstart.

    Repository Guidance

    This repository presents a methodology for using knowledge graph memory structures to enhance LLM outputs. Please note that the provided code serves as a demonstration and is not an officially supported Microsoft offering.

    ⚠️ Warning: GraphRAG indexing can be an expensive operation, please read all of the documentation to understand the process and costs involved, and start small.

    Diving Deeper

    Prompt Tuning

    Using GraphRAG with your data out of the box may not yield the best possible results. We strongly recommend to fine-tune your prompts following the Prompt Tuning Guide in our documentation.

    Versioning

    Please see the breaking changes document for notes on our approach to versioning the project.

    Always run graphrag init --root [path] --force between minor version bumps to ensure you have the latest config format. Run the provided migration notebook between major version bumps if you want to avoid re-indexing prior datasets. Note that this will overwrite your configuration and prompts, so backup if necessary.

    Responsible AI FAQ

    See RAI_TRANSPARENCY.md

    Trademarks

    This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

    Privacy

    Microsoft Privacy Statement

    Discover Repositories

    Search across tracked repositories by name or description