mirror of https://github.com/run-llama/llama_extract.git synced 2026-07-01 01:37:54 -04:00

T

Simon Suo 7fa93af2d2 Merge pull request #14 from run-llama/suo/v0.0.4

Fix asyncio issue and bump to v0.0.4

2024-07-25 11:05:51 -07:00

.github

feat(client): add create schema

2024-07-24 16:56:12 -03:00

examples

wip

2024-07-25 10:52:18 -07:00

llama_extract

wip

2024-07-25 11:05:15 -07:00

tests

feat(llama-extract): add client

2024-07-23 13:29:03 -03:00

.gitignore

feat(llama-extract): add client

2024-07-23 13:29:03 -03:00

.pre-commit-config.yaml

feat(llama-extract): add client

2024-07-23 13:29:03 -03:00

LICENSE

feat(llama-extract): add client

2024-07-23 13:29:03 -03:00

Makefile

feat(llama-extract): add client

2024-07-23 13:29:03 -03:00

poetry.lock

wip

2024-07-24 17:41:05 -07:00

pyproject.toml

wip

2024-07-25 11:05:15 -07:00

README.md

feat(llama-extract): improve readme

2024-07-23 17:18:41 -03:00

README.md

LlamaExtract (Experimental)

LlamaExtract is an API created by LlamaIndex to efficiently infer schema and extract data from unstructured files.

LlamaExtract directly integrates with LlamaIndex.

Note: LlamaExtract is currently experimental and may change in the future.

Read below for some quickstart information, or see the full documentation.

Getting Started

First, login and get an api-key from https://cloud.llamaindex.ai ↗.

Install the package:

pip install llama-extract

Now you can easily infer schemas and extract data from your files:

import nest_asyncio

nest_asyncio.apply()

from llama_extract import LlamaExtract

extractor = LlamaExtract(
    api_key="llx-...",  # can also be set in your env as LLAMA_CLOUD_API_KEY
    num_workers=4,  # if multiple files passed, split in `num_workers` API calls
    verbose=True,
)

# Infer schema
schema = extractor.infer_schema(
    "my_schema", ["./my_file1.pdf", "./my_file2.pdf"]
)

# Extract data
results = extractor.extract(schema.id, ["./my_file1.pdf", "./my_file2.pdf"])

Examples

Several end-to-end examples can be found in the examples folder

Getting Started

Documentation

https://docs.cloud.llamaindex.ai/