# Indexer

Having the raw analytics data in a decentralized data store is great. It’s an open, permissionless, user owned, community asset available to everyone. Like blockchain data.&#x20;

But, like blockchain data, it’s difficult to get insights from and build dashboards with a raw data format. You need a way to load it into traditional datastores for processing.

To address this, we created an [indexer](https://github.com/andyjagoe/airbyte-connectors/tree/main/sources/web3analytics-source) for the data. It's an automated pipeline that uses the [Airbyte](https://airbyte.com/) open source ELT platform and pushes normalized data directly into an S3 data lake. Our source connector continuously monitors the blockchain and Ceramic for new apps, users and data for indexing.

For now, data is stored in an S3 data lake in [Apache Parquet](https://databricks.com/glossary/what-is-parquet) format and accessed via [AWS Athena](https://aws.amazon.com/athena/). [Apache Spark](https://spark.apache.org/) also supports S3 data lakes in parquet format and is another option for us as we scale. That said, Ceramic is working on a GraphQL interface. Once they have robust indexing and sufficient performance to support analytics queries, we will pull data directly from Ceramic instead of using S3.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://web3-analytics.gitbook.io/product-docs/technology/indexer.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
