With GraphRAG emerging as the most advanced Retrieval-Augmented Generation (RAG) approach, this competition challenges you to develop an intelligent agent that can query and reason over graph data with precision and efficiency.
Traditional RAG systems rely solely on vector-based retrieval, which can lead to hallucinations, context fragmentation, and a lack of structured reasoning. GraphRAG addresses these shortcomings by integrating graph-based retrieval, preserving contextual relationships between entities and enabling more accurate, structured, and interpretable AI-generated responses.
By leveraging GraphRAG, participants can minimize hallucinations, improve knowledge retrieval, and enhance AI-generated insights - a fundamental breakthrough for enterprises using generative AI in business-critical applications.
At the highest level, the Hackathon involves the following steps/deliverables:
Choose a dataset that is relevant to something of interest to you or your organization. ArangoDB recommends a few public datasets if you don’t have your own.
Convert/load the dataset into NetworkX.
Persist the NetworkX data to a graph within ArangoDB.
Build an Agentic App on top of the graph that processes natural language queries.
The bulk of the creative effort in the Hackathon will relate to Step 4 - Building the Agentic App. The previous 3 steps are simply a lead up to that effort.
Bring Your Own Data (BYOD) – Use open-source datasets relevant to specific industries or a use case of interest to you or your team (e.g., social networks, cybersecurity, supply chain, healthcare, transportation, etc.). This dataset should be compatible with a graph structure (either already in a graph or convertible to a graph). This data will be loaded into NetworkX (and then persisted to ArangoDB) as one of the steps in the Hackathon. For instance, if you want to explore some publicly-available datasets, consider the following sites:
Stanford Large Network Dataset Collection
Netzschleuder: the network catalogue, repository and centrifuge
Use one of ArangoDB’s Provided Datasets – These are pre-configured graph datasets provided by ArangoDB. Note that these datasets can be loaded directly into ArangoDB, thereby allowing you to skip the “Data Preparation into NetworkX” stage:
Feb. 10, 2025 - March 10, 2025
ArangoDB
Online
$29,750