Simon Suo 7fa93af2d2 Merge pull request #14 from run-llama/suo/v0.0.4
Fix asyncio issue and bump to v0.0.4
2024-07-25 11:05:51 -07:00
2024-07-24 16:56:12 -03:00
wip
2024-07-25 10:52:18 -07:00
wip
2024-07-25 11:05:15 -07:00
2024-07-23 13:29:03 -03:00
2024-07-23 13:29:03 -03:00
2024-07-23 13:29:03 -03:00
2024-07-23 13:29:03 -03:00
wip
2024-07-24 17:41:05 -07:00
wip
2024-07-25 11:05:15 -07:00
2024-07-23 17:18:41 -03:00

LlamaExtract (Experimental)

LlamaExtract is an API created by LlamaIndex to efficiently infer schema and extract data from unstructured files.

LlamaExtract directly integrates with LlamaIndex.

Note: LlamaExtract is currently experimental and may change in the future.

Read below for some quickstart information, or see the full documentation.

Getting Started

First, login and get an api-key from https://cloud.llamaindex.ai ↗.

Install the package:

pip install llama-extract

Now you can easily infer schemas and extract data from your files:

import nest_asyncio

nest_asyncio.apply()

from llama_extract import LlamaExtract

extractor = LlamaExtract(
    api_key="llx-...",  # can also be set in your env as LLAMA_CLOUD_API_KEY
    num_workers=4,  # if multiple files passed, split in `num_workers` API calls
    verbose=True,
)

# Infer schema
schema = extractor.infer_schema(
    "my_schema", ["./my_file1.pdf", "./my_file2.pdf"]
)

# Extract data
results = extractor.extract(schema.id, ["./my_file1.pdf", "./my_file2.pdf"])

Examples

Several end-to-end examples can be found in the examples folder

Documentation

https://docs.cloud.llamaindex.ai/

S
Description
No description provided
Readme 2.3 MiB
Latest
2025-01-28 23:47:46 -05:00