fix: add client-side resilience for S3 transaction race condition

Address the S3 race condition (LI-5775) where the backend DB may commit a SUCCESS status before the S3 data write fully propagates, causing transient 500 errors or null data when fetching extraction runs. Changes: - Add _get_run_with_data_check() wrapper in _wait_for_job_result that detects SUCCESS runs with null data and retries with exponential backoff (2s, 4s, 8s) to allow S3 writes to propagate - Increase run_retry_attempts from 3 to 5 and run_max_wait from 4/20 to 30 for more resilient handling of transient 500 errors from the backend's S3 read failures - Add warning messages when the race condition pattern is detected to aid debugging Co-authored-by: George He <georgewho96@gmail.com>
More robust extract tests with pytest xdist (#1117 )
2026-07-01 21:44:37 -04:00 · 2026-02-17 21:33:10 +00:00 · 2026-02-16 16:16:15 -08:00 · 2026-02-14 21:03:05 -06:00 · 2026-02-13 15:29:09 -08:00 · 2026-02-13 15:20:52 -08:00
77 changed files with 910 additions and 207 deletions
@@ -0,0 +1,162 @@
+name: Extract E2E Tests (every 4 hours)
+
+on:
+  schedule:
+    - cron: "0 */4 * * *"
+  workflow_dispatch:
+    # Allows manual triggering
+    inputs:
+      environment:
+        description: "Environment to run the tests in"
+        required: false
+        default: staging
+        type: choice
+        options:
+          - staging
+          - production
+      notify_slack:
+        description: "Notify Slack"
+        required: false
+        default: false
+        type: boolean
+  workflow_call:
+
+env:
+  UV_VERSION: "0.7.20"
+  PYTHON_VERSION: "3.12"
+  SLACK_CHANNEL_ID: C078PHNTF44 # Extract channel ID
+  API_E2E_LOG_PATH: ${{ github.workspace }}/extract-e2e.log
+
+jobs:
+  extract-e2e:
+    name: "Extract E2E Tests (${{ matrix.environment }})"
+    runs-on: ubuntu-latest
+    timeout-minutes: 30
+    concurrency:
+      group: ${{ github.workflow }}-${{ github.ref }}-${{ matrix.environment }}
+      cancel-in-progress: true
+    strategy:
+      fail-fast: false
+      matrix:
+        environment: ${{ github.event_name == 'schedule' && fromJson('["staging", "production"]') || fromJson(format('["{0}"]', github.event.inputs.environment || 'staging')) }}
+    steps:
+      - name: Set runtime inputs
+        id: runtime
+        run: |
+          environment=${{ matrix.environment }}
+          notify_slack=${{ github.event.inputs.notify_slack || github.event_name == 'schedule' }}
+          echo "environment=${environment}" >> $GITHUB_OUTPUT
+          echo "notify_slack=${notify_slack}" >> $GITHUB_OUTPUT
+
+          if [ "${environment}" = "production" ]; then
+            echo "LLAMA_CLOUD_BASE_URL=https://api.cloud.llamaindex.ai" >> $GITHUB_ENV
+            api_key_secret="${{ secrets.LLAMA_CLOUD_API_KEY }}"
+            project_id_secret="${{ secrets.LLAMA_CLOUD_PROJECT_ID }}"
+          else
+            echo "LLAMA_CLOUD_BASE_URL=https://api.staging.llamaindex.ai" >> $GITHUB_ENV
+            api_key_secret="${{ secrets.LLAMA_CLOUD_API_KEY_STAGING }}"
+            project_id_secret="${{ secrets.LLAMA_CLOUD_PROJECT_ID_STAGING }}"
+          fi
+
+          if [ -n "$api_key_secret" ]; then
+            echo "LLAMA_CLOUD_API_KEY=$api_key_secret" >> $GITHUB_ENV
+          fi
+
+          if [ -n "$project_id_secret" ]; then
+            echo "LLAMA_CLOUD_PROJECT_ID=$project_id_secret" >> $GITHUB_ENV
+          fi
+
+      - uses: actions/checkout@v5
+        with:
+          fetch-depth: 0
+
+      - name: Install uv
+        uses: astral-sh/setup-uv@v7
+        with:
+          version: ${{ env.UV_VERSION }}
+
+      - name: Set up Python
+        run: uv python install ${{ env.PYTHON_VERSION }} && uv python pin ${{ env.PYTHON_VERSION }}
+
+      - name: Run Extract E2E tests
+        id: extract-tests
+        continue-on-error: true
+        working-directory: py
+        run: |
+          set -o pipefail
+          rm -f "$API_E2E_LOG_PATH"
+          uv run pytest -v -n 8 --timeout=300 --session-timeout=1740 tests/extract/ 2>&1 | tee "$API_E2E_LOG_PATH"
+
+      - name: Extract pytest failure summary
+        id: failed-tests
+        if: steps.extract-tests.outcome == 'failure' || cancelled()
+        run: |
+          summary="$(python3 - <<'PY'
+          import os
+          import re
+          from pathlib import Path
+
+          log_path = Path(os.environ["API_E2E_LOG_PATH"])
+          if not log_path.exists():
+              print("Test log not found.")
+              raise SystemExit(0)
+
+          lines = log_path.read_text(errors="ignore").splitlines()
+
+          # Find the "short test summary info" section
+          start = None
+          for i, line in enumerate(lines):
+              if line.startswith("=") and "short test summary info" in line:
+                  start = i + 1
+                  break
+
+          if start is None:
+              print("No test summary found.")
+              raise SystemExit(0)
+
+          # Extract just the FAILED/ERROR lines (test name + short reason)
+          failed_tests = []
+          for line in lines[start:]:
+              if line.startswith("="):
+                  break  # End of section
+              if line.startswith("FAILED ") or line.startswith("ERROR "):
+                  # Extract test name and truncate the error message
+                  match = re.match(r"(FAILED|ERROR) ([\w/:.\[\]_-]+)", line)
+                  if match:
+                      failed_tests.append(f"{match.group(1)}: {match.group(2)}")
+
+          if failed_tests:
+              print("\n".join(failed_tests[:20]))  # Limit to 20 tests max
+          else:
+              print("No failed tests found in summary.")
+          PY
+          )"
+          if [ -z "$summary" ]; then
+            summary="Failed test summary not available. Review the full run logs."
+          fi
+          {
+            printf 'summary<<EOF\n%s\nEOF\n' "$summary"
+          } >> "$GITHUB_OUTPUT"
+
+      - name: Check test results
+        if: always()
+        run: |
+          if [ "${{ steps.extract-tests.outcome }}" == "failure" ]; then
+            echo "Extract E2E tests failed"
+            exit 1
+          fi
+
+      - name: Post to Extract Slack channel
+        id: slack
+        if: (failure() || cancelled()) && steps.runtime.outputs.notify_slack == 'true'
+        uses: slackapi/slack-github-action@v2.1.1
+        with:
+          channel-id: ${{ env.SLACK_CHANNEL_ID }}
+          slack-message: |
+            :red_circle: *Extract E2E Failed* (${{ steps.runtime.outputs.environment }})
+            ```
+            ${{ steps.failed-tests.outputs.summary }}
+            ```
+            <${{ github.server_url }}/${{ github.repository }}/actions/runs/${{ github.run_id }}|View Run>
+        env:
+          SLACK_BOT_TOKEN: ${{ secrets.SLACK_BOT_TOKEN }}
@@ -22,7 +22,7 @@ repos:
    hooks:
      - id: ruff
        args: [--fix, --exit-non-zero-on-fix]
-        exclude: ".*uv.lock"
+        exclude: ".*uv.lock|examples/"
  - repo: https://github.com/psf/black-pre-commit-mirror
    rev: 23.10.1
    hooks:
@@ -3,6 +3,16 @@
 [![Discord](https://img.shields.io/discord/1059199217496772688)](https://discord.gg/dGcwcsnxhU)

 # Llama Cloud Services
+> **⚠️ DEPRECATION NOTICE**
+>
+> This repository and its packages are deprecated and will be maintained until **May 1, 2026**.
+>
+> **Please migrate to the new packages:**
+> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))
+> - **TypeScript**: `npm install @llamaindex/llama-cloud` ([GitHub](https://github.com/run-llama/llama-cloud-ts))
+>
+> The new packages provide the same functionality with improved performance, better support, and active development.
+

 This repository contains the code for hand-written SDKs and clients for interacting with LlamaCloud.

@@ -1,4 +1,14 @@
 # LlamaCloud Services Examples - Python
+> **⚠️ DEPRECATION NOTICE**
+>
+> This repository and its packages are deprecated and will be maintained until **May 1, 2026**.
+>
+> **Please migrate to the new packages:**
+> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))
+> - **TypeScript**: `npm install @llamaindex/llama-cloud` ([GitHub](https://github.com/run-llama/llama-cloud-ts))
+>
+> The new packages provide the same functionality with improved performance, better support, and active development.
+

 In this folder you will find several python notebooks that contain examples regarding:

@@ -17,6 +17,14 @@
    "This is useful when you need to parse many documents at once, as the batch API handles the orchestration and provides progress tracking."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "0c2b5e1a",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "cell-1",
@@ -16,6 +16,14 @@
    "![](asset_manager_fund_analysis.png)\n"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "cbafd7ee",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "cda2e5e9-fe9d-42d9-9387-f529d970ff7b",
@@ -20,6 +20,14 @@
    "This workflow is designed for equity research analysts and investment professionals."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "e7979faf",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -19,6 +19,13 @@
    "The example we go through below is also replicable within Llama Cloud as well, where you will also be able to pick between a number of pre-defined schemas, instead of building your own."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -15,6 +15,13 @@
    "Dow Jones Industrial Average (DJIA) is a stock market index that consists of 30 large companies listed on the New York Stock Exchange and the NASDAQ and is considered a good proxy for the overall US stock market. For this exercise, we will extract the insider transactions for all the companies in the DJIA. Let's first get the list of tickers in the Dow Jones Industrial Average using Wikipedia."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -16,6 +16,14 @@
    "This approach reduces manual data entry, improves extraction accuracy and standardization, and provides traceability for each technical detail."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "8d1efe6e",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "a3b8c8d5-ff3e-48ce-b0b8-29b6b1f517f8",
@@ -11,6 +11,13 @@
    "Take a look at one of the resumes in the `data/resumes` directory. "
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -20,6 +20,14 @@
    "> **Note:** This principle of what fields generalize across your target documents and what might be optional is an important one to keep in mind when designing your schema. \n"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "355adfd4",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -21,6 +21,14 @@
    "The following notebook uses the event‑driven syntax (with custom events, steps, and a workflow class) adapted from the technical datasheet and contract review examples."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "ab7be988",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "36d8e34e-ed98-46ac-b744-1642f6e253d5",
@@ -35,6 +35,14 @@
    "📖 For more details, see the [Extraction Target documentation](https://developers.llamaindex.ai/python/cloud/llamaextract/features/concepts/#extraction-target)."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "cb760594",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -31,6 +31,13 @@
    "| Sep-02-2025  | 0.6.62  | Active     |\n"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -19,6 +19,13 @@
    "The workflow is implemented as a proper LlamaIndex Workflow with separate steps for parsing, classification, and extraction, connected by typed events. This provides modularity, observability, and type safety."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -27,6 +27,14 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "e2b422f5",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "2e4f707a-c7b5-473f-b4a6-881e2245e82d",
@@ -14,6 +14,13 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -1,4 +1,9 @@
 """
+⚠️ DEPRECATION NOTICE:
+This example uses the deprecated llama-cloud-services package, which will be maintained until May 1, 2026.
+Please migrate to: pip install llama-cloud>=1.0 (https://github.com/run-llama/llama-cloud-py)
+"""
+"""
 Example: Batch Processing a Folder of PDFs with LlamaParse

 This script demonstrates how to process multiple PDFs from a folder
@@ -17,6 +17,14 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "0cb82ca8",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "ef115dbe-b834-4639-828e-e2c11aef710b",
@@ -18,6 +18,13 @@
    "| Aug-18-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -14,6 +14,13 @@
    "| Aug-18-2025   | N/A     | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -14,6 +14,13 @@
    "| Aug-18-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -18,6 +18,13 @@
    "| Aug-18-2025   | 0.6.61  | Maintained |\n"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -19,6 +19,14 @@
    "| Aug-18-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "bb595498",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "a004db48-8d3f-421c-915a-477692f71b90",
@@ -16,6 +16,13 @@
    "| Aug-19-2025   | 0.6.61  | Deprecated |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -19,6 +19,14 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "8b937443",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "a004db48-8d3f-421c-915a-477692f71b90",
@@ -19,6 +19,14 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "037cc6d9",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "a004db48-8d3f-421c-915a-477692f71b90",
@@ -19,6 +19,14 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "7aa3be47",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -21,6 +21,13 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -8,6 +8,14 @@
    "<a href=\"https://colab.research.google.com/github/run-llama/llama_cloud_services/blob/main/examples/parse/demo_starter_multimodal.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "da52cfa3",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "4e081457",
@@ -7,6 +7,13 @@
    "<a href=\"https://colab.research.google.com/github/run-llama/llama_cloud_services/blob/main/examples/parse/demo_starter_parse_selected_pages.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -14,6 +14,13 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -17,6 +17,14 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |\n"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "a3636937",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "5f7d99ad-6ebd-47d0-92a7-566630b0c22a",
@@ -7,6 +7,13 @@
    "<a href=\"https://colab.research.google.com/github/run-llama/llama_cloud_services/blob/main/examples/parse/excel/o1_excel_rag.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -17,6 +17,14 @@
    "| Before Feb 2025   | N/A  | Deprecated |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "0facb0b9",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "e8db8ac2-5221-44de-a53e-cb5ab37ac8f5",
@@ -19,6 +19,14 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |\n"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "bb943339",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -19,6 +19,14 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |\n"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "17e62444",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -19,6 +19,14 @@
    "| Aug-19-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "fe7e837a",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "15e60ecf-519c-41fc-911b-765adaf8bad4",
@@ -9,6 +9,13 @@
    "<a href=\"https://colab.research.google.com/github/run-llama/llama_cloud_services/blob/main/examples/parse/multimodal/insurance_rag.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -23,6 +23,13 @@
    "- [US Immigration Case](https://github.com/user-attachments/files/16536446/us_immigration_case.pdf)"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -27,6 +27,14 @@
    "![mm_rag_diagram](./multimodal_contextual_retrieval_rag_img.png)"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "93d4f9ab",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "54e8d9a7-5036-4d32-818f-00b2e888521f",
@@ -27,6 +27,14 @@
    "![mm_rag_diagram](./multimodal_rag_slide_deck_img.png)"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "fc1b5803",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "54e8d9a7-5036-4d32-818f-00b2e888521f",
@@ -19,6 +19,14 @@
    "| Aug-20-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "7dafd458",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "54e8d9a7-5036-4d32-818f-00b2e888521f",
@@ -21,6 +21,14 @@
    "We use our workflow abstraction to define an agentic system that contains two main phases: a research phase that pulls in relevant files through chunk-level or file-level retrieval, and then a blog generation phase that synthesizes the final report."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "8c881021",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "54e8d9a7-5036-4d32-818f-00b2e888521f",
@@ -9,6 +9,13 @@
    "<a href=\"https://colab.research.google.com/github/run-llama/llama_cloud_services/blob/main/examples/parse/multimodal/product_manual_rag.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -19,6 +19,14 @@
    "| Prior to Feb-2025   | N/A  | Deprecated |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "b27f0e78",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -14,6 +14,13 @@
    "| Prior to Feb-2025   | N/A  | Deprecated |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -29,6 +29,13 @@
    "In this demonstration, we showcase how parsing instructions can be used to extract specific information from unstructured documents. Using a McDonald's Receipt, we show how to ignore parts of the document and only parse the price of each order and the final amount to be paid."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -18,6 +18,13 @@
    "Many documents can have varying complexity across pages - some pages have text, and other pages have images. The text-only pages only require cheap parsing modes, whereas the image-based pages require more advanced modes. In this notebook we show you how to take advantage of \"auto mode\" in LlamaParse which adaptively parses different pages according to different modes, which lets you get optimal performance at the cheapest cost.\n"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -37,6 +37,13 @@
    "With visual references, you can build applications that preserve document structure and provide users with trustworthy, traceable visual citations. We will now leverage this feature to build our query engine."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -24,6 +24,13 @@
    "| Aug-18-2025   | 0.6.61  | Maintained |"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -26,6 +26,14 @@
    "We use LlamaParse to parse the context documents as well as the RFP document itself."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "ad140aef",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -22,6 +22,14 @@
    "**NOTE**: The pricing for LlamaParse + gpt4o is an order more expensive than using LlamaParse by default. Currently, every page parsed with gpt4o counts for 10 pages in the LlamaParse usage tracker.\n"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "211c52fe",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -1,4 +1,9 @@
 """
+⚠️ DEPRECATION NOTICE:
+This example uses the deprecated llama-cloud-services package, which will be maintained until May 1, 2026.
+Please migrate to: pip install llama-cloud>=1.0 (https://github.com/run-llama/llama-cloud-py)
+"""
+"""
 Generate sample spreadsheets for LlamaSheets + Claude workflows.

 This script creates example Excel files that demonstrate different use cases:
@@ -1,3 +1,8 @@
+"""
+⚠️ DEPRECATION NOTICE:
+This example uses the deprecated llama-cloud-services package, which will be maintained until May 1, 2026.
+Please migrate to: pip install llama-cloud>=1.0 (https://github.com/run-llama/llama-cloud-py)
+"""
 """Helper script to extract spreadsheets using LlamaSheets."""

 import asyncio
@@ -1,4 +1,9 @@
 """
+⚠️ DEPRECATION NOTICE:
+This example uses the deprecated llama-cloud-services package, which will be maintained until May 1, 2026.
+Please migrate to: pip install llama-cloud>=1.0 (https://github.com/run-llama/llama-cloud-py)
+"""
+"""
 Generate sample spreadsheets for LlamaSheets + LlamaIndex Agent workflows.

 This script creates example Excel files that demonstrate different use cases:
@@ -1,4 +1,9 @@
 """
+⚠️ DEPRECATION NOTICE:
+This example uses the deprecated llama-cloud-services package, which will be maintained until May 1, 2026.
+Please migrate to: pip install llama-cloud>=1.0 (https://github.com/run-llama/llama-cloud-py)
+"""
+"""
 LlamaSheets Agent with LlamaIndex

 This example shows how to build an agent that can work with spreadsheet data
@@ -1,3 +1,8 @@
+"""
+⚠️ DEPRECATION NOTICE:
+This example uses the deprecated llama-cloud-services package, which will be maintained until May 1, 2026.
+Please migrate to: pip install llama-cloud>=1.0 (https://github.com/run-llama/llama-cloud-py)
+"""
 """Helper script to extract spreadsheets using LlamaSheets."""

 import asyncio
@@ -26,6 +26,13 @@
    "We'll split this into segments categorized as either `essay` or `research_paper`.\n"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> **⚠️ DEPRECATION NOTICE**>> This example uses the deprecated `llama-cloud-services` package, which will be maintained until **May 1, 2026**.>> **Please migrate to:**> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))> - **New Package Documentation**: https://docs.cloud.llamaindex.ai/>> The new package provides the same functionality with improved performance and support."
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
@@ -1,5 +1,23 @@
 # llama-cloud-services-py

+## 0.6.94
+
+### Patch Changes
+
+- 232c55b: Include xlsx files in extract input
+
+## 0.6.93
+
+### Patch Changes
+
+- da1916c: Add more warnings
+
+## 0.6.92
+
+### Patch Changes
+
+- 2358df1: add deprecation notices
+
 ## 0.6.91

 ### Patch Changes
@@ -4,6 +4,16 @@

 # Llama Cloud Services

+> **⚠️ DEPRECATION NOTICE**
+>
+> This repository and its packages are deprecated and will be maintained until **May 1, 2026**.
+>
+> **Please migrate to the new packages:**
+> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))
+> - **TypeScript**: `npm install @llamaindex/llama-cloud` ([GitHub](https://github.com/run-llama/llama-cloud-ts))
+>
+> The new packages provide the same functionality with improved performance, better support, and active development.
+
 This repository contains the code for hand-written SDKs and clients for interacting with LlamaCloud.

 This includes:
@@ -1,3 +1,5 @@
+import warnings
+
 from llama_cloud_services.parse import LlamaParse
 from llama_cloud_services.extract import LlamaExtract, ExtractionAgent
 from llama_cloud_services.utils import SourceText, FileInput
@@ -8,6 +10,16 @@ from llama_cloud_services.index import (
    LlamaCloudRetriever,
 )

+# Emit deprecation warning once when package is imported
+warnings.warn(
+    "This package (llama-cloud-services) is deprecated and will be maintained until May 1, 2026. "
+    "Please migrate to the new package: pip install llama-cloud>=1.0 "
+    "(https://github.com/run-llama/llama-cloud-py). "
+    "The new package provides the same functionality with improved performance and support.",
+    DeprecationWarning,
+    stacklevel=2,
+)
+
 __all__ = [
    "LlamaParse",
    "LlamaExtract",
@@ -4,10 +4,11 @@ import os
 import time
 from io import BufferedIOBase, TextIOWrapper
 from pathlib import Path
-from typing import List, Optional, Type, Union, Coroutine, Any, TypeVar
+from typing import Callable, List, Optional, Type, Union, Coroutine, Any, TypeVar
 import warnings
 import httpx
 from pydantic import BaseModel
+from functools import wraps
 from tenacity import (
    retry_if_exception,
    stop_after_attempt,
@@ -54,7 +55,7 @@ DEFAULT_EXTRACT_CONFIG = ExtractConfig(
 def _is_retryable_error(exception: BaseException) -> bool:
    """Check if an exception is retryable."""
    if isinstance(exception, ApiError):
-        return exception.status_code in (502, 503, 504, 425, 408)
+        return exception.status_code in (429, 500, 502, 503, 504, 425, 408)
    elif isinstance(
        exception, (httpx.HTTPStatusError, httpx.RequestError, httpx.TimeoutException)
    ):
@@ -62,6 +63,33 @@ def _is_retryable_error(exception: BaseException) -> bool:
    return False


+def _async_retry(
+    max_attempts: int = 5,
+    initial_wait: float = 1,
+    max_wait: float = 30,
+    jitter: float = 3,
+) -> Callable:
+    """Decorator for async functions with retry logic for rate limiting and transient errors."""
+
+    def decorator(func: Callable) -> Callable:
+        @wraps(func)
+        async def wrapper(*args: Any, **kwargs: Any) -> Any:
+            async for attempt in AsyncRetrying(
+                retry=retry_if_exception(_is_retryable_error),
+                stop=stop_after_attempt(max_attempts),
+                wait=wait_exponential_jitter(
+                    initial=initial_wait, max=max_wait, jitter=jitter
+                ),
+                reraise=True,
+            ):
+                with attempt:
+                    return await func(*args, **kwargs)
+
+        return wrapper
+
+    return decorator
+
+
 async def _validate_schema(
    client: AsyncLlamaCloud, data_schema: SchemaInput
 ) -> JSONObjectType:
@@ -82,50 +110,6 @@ async def _validate_schema(
    return validated_schema.data_schema


-async def _get_job_with_retry(
-    client: AsyncLlamaCloud,
-    job_id: str,
-    max_attempts: int = 5,
-    initial_wait: float = 1,
-    max_wait: float = 60,
-    jitter: float = 5,
-) -> ExtractJob:
-    """Get extraction job with retry logic."""
-    async for attempt in AsyncRetrying(
-        retry=retry_if_exception(_is_retryable_error),
-        stop=stop_after_attempt(max_attempts),
-        wait=wait_exponential_jitter(initial=initial_wait, max=max_wait, jitter=jitter),
-        reraise=True,
-    ):
-        with attempt:
-            return await client.llama_extract.get_job(job_id=job_id)
-
-
-async def _get_run_with_retry(
-    client: AsyncLlamaCloud,
-    job_id: str,
-    project_id: Optional[str] = None,
-    organization_id: Optional[str] = None,
-    max_attempts: int = 3,
-    initial_wait: float = 1,
-    max_wait: float = 20,
-    jitter: float = 3,
-) -> ExtractRun:
-    """Get extraction run with retry logic."""
-    async for attempt in AsyncRetrying(
-        retry=retry_if_exception(_is_retryable_error),
-        stop=stop_after_attempt(max_attempts),
-        wait=wait_exponential_jitter(initial=initial_wait, max=max_wait, jitter=jitter),
-        reraise=True,
-    ):
-        with attempt:
-            return await client.llama_extract.get_run_by_job_id(
-                job_id=job_id,
-                project_id=project_id,
-                organization_id=organization_id,
-            )
-
-
 async def _wait_for_job_result(
    client: AsyncLlamaCloud,
    job_id: str,
@@ -137,35 +121,68 @@ async def _wait_for_job_result(
    job_retry_attempts: int = 5,
    job_max_wait: float = 60,
    job_jitter: float = 5,
-    run_retry_attempts: int = 3,
-    run_max_wait: float = 20,
+    run_retry_attempts: int = 5,
+    run_max_wait: float = 30,
    run_jitter: float = 3,
+    data_availability_retries: int = 3,
+    data_availability_initial_delay: float = 2.0,
 ) -> Optional[ExtractRun]:
-    """Wait for and return the results of an extraction job."""
+    """Wait for and return the results of an extraction job.
+
+    Includes resilience against a known S3 race condition on the backend where
+    the database may be updated with a SUCCESS status before the S3 data write
+    fully propagates. This can cause transient 500 errors or runs with null data.
+    The retry parameters and data availability checks mitigate this.
+    """
+
+    @_async_retry(
+        max_attempts=job_retry_attempts, max_wait=job_max_wait, jitter=job_jitter
+    )
+    async def _get_job() -> ExtractJob:
+        return await client.llama_extract.get_job(job_id=job_id)
+
+    @_async_retry(
+        max_attempts=run_retry_attempts, max_wait=run_max_wait, jitter=run_jitter
+    )
+    async def _get_run() -> ExtractRun:
+        return await client.llama_extract.get_run_by_job_id(
+            job_id=job_id,
+            project_id=project_id,
+            organization_id=organization_id,
+        )
+
+    async def _get_run_with_data_check() -> ExtractRun:
+        """Fetch the extraction run, retrying if data is missing due to S3 race condition.
+
+        When the backend has status SUCCESS but the S3 data write hasn't propagated,
+        the run may come back with data=None or the API may return a 500 error.
+        The 500 case is handled by the @_async_retry on _get_run(). This wrapper
+        handles the data=None case by retrying with exponential backoff.
+        """
+        for attempt in range(data_availability_retries):
+            run = await _get_run()
+            if run.data is not None or run.status != StatusEnum.SUCCESS:
+                return run
+            delay = data_availability_initial_delay * (2**attempt)
+            warnings.warn(
+                f"Extraction run for job {job_id} has status SUCCESS but data is "
+                f"not yet available (possible S3 race condition). "
+                f"Retrying in {delay:.1f}s "
+                f"(attempt {attempt + 1}/{data_availability_retries})..."
+            )
+            await asyncio.sleep(delay)
+        return await _get_run()
+
    start = time.perf_counter()
    poll_count = 0

    while True:
        await asyncio.sleep(check_interval)
        poll_count += 1
-        job = await _get_job_with_retry(
-            client,
-            job_id,
-            max_attempts=job_retry_attempts,
-            max_wait=job_max_wait,
-            jitter=job_jitter,
-        )
+        job = await _get_job()

        if job.status == StatusEnum.SUCCESS:
-            return await _get_run_with_retry(
-                client,
-                job_id,
-                project_id,
-                organization_id,
-                max_attempts=run_retry_attempts,
-                max_wait=run_max_wait,
-                jitter=run_jitter,
-            )
+            return await _get_run_with_data_check()
        elif job.status == StatusEnum.PENDING:
            end = time.perf_counter()
            if end - start > max_timeout:
@@ -177,15 +194,7 @@ async def _wait_for_job_result(
            warnings.warn(
                f"Failure in job: {job_id}, status: {job.status}, error: {job.error}"
            )
-            return await _get_run_with_retry(
-                client,
-                job_id,
-                project_id,
-                organization_id,
-                max_attempts=run_retry_attempts,
-                max_wait=run_max_wait,
-                jitter=run_jitter,
-            )
+            return await _get_run()


 def run_in_thread(
@@ -344,8 +353,8 @@ class ExtractionAgent:
            job_retry_attempts=5,
            job_max_wait=60,
            job_jitter=5,
-            run_retry_attempts=3,
-            run_max_wait=20,
+            run_retry_attempts=5,
+            run_max_wait=30,
            run_jitter=3,
        )

@@ -498,9 +507,12 @@ class ExtractionAgent:
        Args:
            run_id (str): The ID of the extraction run to delete
        """
-        self._run_in_thread(
-            self._client.llama_extract.delete_extraction_run(run_id=run_id)
-        )
+
+        @_async_retry()
+        async def _delete() -> None:
+            return await self._client.llama_extract.delete_extraction_run(run_id=run_id)
+
+        self._run_in_thread(_delete())

    def list_extraction_runs(
        self, page: int = 0, limit: int = 100
@@ -510,13 +522,16 @@ class ExtractionAgent:
        Returns:
            PaginatedExtractRunsResponse: Paginated list of extraction runs
        """
-        return self._run_in_thread(
-            self._client.llama_extract.list_extract_runs(
+
+        @_async_retry()
+        async def _list() -> PaginatedExtractRunsResponse:
+            return await self._client.llama_extract.list_extract_runs(
                extraction_agent_id=self.id,
                skip=page * limit,
                limit=limit,
            )
-        )
+
+        return self._run_in_thread(_list())

    def __repr__(self) -> str:
        return f"ExtractionAgent(id={self.id}, name={self.name})"
@@ -658,15 +673,17 @@ class LlamaExtract(BaseComponent):
                "data_schema must be either a dictionary or a Pydantic model"
            )

-        agent = self._run_in_thread(
-            self._async_client.llama_extract.create_extraction_agent(
+        @_async_retry()
+        async def _create() -> CloudExtractAgent:
+            return await self._async_client.llama_extract.create_extraction_agent(
                project_id=self._project_id,
                organization_id=self._organization_id,
                name=name,
                data_schema=data_schema,
                config=config,
            )
-        )
+
+        agent = self._run_in_thread(_create())

        return ExtractionAgent(
            client=self._async_client,
@@ -702,19 +719,27 @@ class LlamaExtract(BaseComponent):
            )

        if id:
-            agent = self._run_in_thread(
-                self._async_client.llama_extract.get_extraction_agent(
+
+            @_async_retry()
+            async def _get_by_id() -> CloudExtractAgent:
+                return await self._async_client.llama_extract.get_extraction_agent(
                    extraction_agent_id=id,
                )
-            )
+
+            agent = self._run_in_thread(_get_by_id())

        elif name:
-            agent = self._run_in_thread(
-                self._async_client.llama_extract.get_extraction_agent_by_name(
-                    name=name,
-                    project_id=self._project_id,
+
+            @_async_retry()
+            async def _get_by_name() -> CloudExtractAgent:
+                return (
+                    await self._async_client.llama_extract.get_extraction_agent_by_name(
+                        name=name,
+                        project_id=self._project_id,
+                    )
                )
-            )
+
+            agent = self._run_in_thread(_get_by_name())
        else:
            raise ValueError("Either name or extraction_agent_id must be provided.")

@@ -734,11 +759,14 @@ class LlamaExtract(BaseComponent):

    def list_agents(self) -> List[ExtractionAgent]:
        """List all available extraction agents."""
-        agents = self._run_in_thread(
-            self._async_client.llama_extract.list_extraction_agents(
+
+        @_async_retry()
+        async def _list() -> List[CloudExtractAgent]:
+            return await self._async_client.llama_extract.list_extraction_agents(
                project_id=self._project_id,
            )
-        )
+
+        agents = self._run_in_thread(_list())

        return [
            ExtractionAgent(
@@ -763,11 +791,14 @@ class LlamaExtract(BaseComponent):
        Args:
            agent_id (str): ID of the extraction agent to delete
        """
-        self._run_in_thread(
-            self._async_client.llama_extract.delete_extraction_agent(
-                extraction_agent_id=agent_id
+
+        @_async_retry()
+        async def _delete() -> None:
+            return await self._async_client.llama_extract.delete_extraction_agent(
+                extraction_agent_id=agent_id,
            )
-        )
+
+        self._run_in_thread(_delete())

    async def _wait_for_job_result(self, job_id: str) -> Optional[ExtractRun]:
        """Wait for and return the results of an extraction job."""
@@ -782,8 +813,8 @@ class LlamaExtract(BaseComponent):
            job_retry_attempts=3,
            job_max_wait=4,
            job_jitter=5,
-            run_retry_attempts=3,
-            run_max_wait=4,
+            run_retry_attempts=5,
+            run_max_wait=30,
            run_jitter=3,
        )

@@ -805,6 +836,7 @@ class LlamaExtract(BaseComponent):
            # Document files
            ".pdf": "application/pdf",
            ".docx": "application/vnd.openxmlformats-officedocument.wordprocessingml.document",
+            ".xlsx": "application/vnd.openxmlformats-officedocument.spreadsheetml.sheet",
            # Image files
            ".png": "image/png",
            ".jpg": "image/jpeg",
@@ -1,5 +1,28 @@
 # llama_parse

+## 0.6.94
+
+### Patch Changes
+
+- 232c55b: Include xlsx files in extract input
+- Updated dependencies [232c55b]
+  - llama-cloud-services-py@0.6.94
+
+## 0.6.93
+
+### Patch Changes
+
+- da1916c: Add more warnings
+- Updated dependencies [da1916c]
+  - llama-cloud-services-py@0.6.93
+
+## 0.6.92
+
+### Patch Changes
+
+- Updated dependencies [2358df1]
+  - llama-cloud-services-py@0.6.92
+
 ## 0.6.91

 ### Patch Changes
@@ -1,5 +1,15 @@
 # LlamaParse

+> **⚠️ DEPRECATION NOTICE**
+>
+> This repository and its packages are deprecated and will be maintained until **May 1, 2026**.
+>
+> **Please migrate to the new packages:**
+> - **Python**: `pip install llama-cloud>=1.0` ([GitHub](https://github.com/run-llama/llama-cloud-py))
+> - **TypeScript**: `npm install @llamaindex/llama-cloud` ([GitHub](https://github.com/run-llama/llama-cloud-ts))
+>
+> The new packages provide the same functionality with improved performance, better support, and active development.
+
 [![PyPI - Downloads](https://img.shields.io/pypi/dm/llama-parse)](https://pypi.org/project/llama-parse/)
 [![GitHub contributors](https://img.shields.io/github/contributors/run-llama/llama_parse)](https://github.com/run-llama/llama_parse/graphs/contributors)
 [![Discord](https://img.shields.io/discord/1059199217496772688)](https://discord.gg/dGcwcsnxhU)
@@ -1,8 +1,18 @@
-from llama_cloud_services.parse import (
+import warnings
+from llama_cloud_services.parse import (  # type: ignore[attr-defined]
    LlamaParse,
    ResultType,
    ParsingMode,
    FailedPageMode,
 )

+warnings.warn(
+    "The 'llama-parse' package is deprecated and will no longer receive updates. "
+    "Please migrate to the new unified SDK. "
+    "See https://developers.llamaindex.ai/python/cloud/llamaparse/getting_started/ "
+    "and https://github.com/run-llama/llama-cloud-py/blob/main/README.md for migration instructions.",
+    DeprecationWarning,
+    stacklevel=2,
+)
+
 __all__ = ["LlamaParse", "ResultType", "ParsingMode", "FailedPageMode"]
@@ -1,6 +1,6 @@
 {
  "name": "llama_parse",
-  "version": "0.6.91",
+  "version": "0.6.94",
  "description": "",
  "main": "index.js",
  "private": false,
@@ -11,13 +11,13 @@ dev = [

 [project]
 name = "llama-parse"
-version = "0.6.91"
+version = "0.6.94"
 description = "Parse files into RAG-Optimized formats."
 authors = [{name = "Logan Markewich", email = "logan@llamaindex.ai"}]
 requires-python = ">=3.9,<4.0"
 readme = "README.md"
 license = "MIT"
-dependencies = ["llama-cloud-services>=0.6.91"]
+dependencies = ["llama-cloud-services>=0.6.94"]

 [project.scripts]
 llama-parse = "llama_parse.cli.main:parse"
@@ -1,6 +1,6 @@
 {
  "name": "llama-cloud-services-py",
-  "version": "0.6.91",
+  "version": "0.6.94",
  "private": false,
  "license": "MIT",
  "scripts": {},
@@ -23,7 +23,7 @@ dev = [

 [project]
 name = "llama-cloud-services"
-version = "0.6.91"
+version = "0.6.94"
 description = "Tailored SDK clients for LlamaCloud services."
 authors = [{name = "Logan Markewich", email = "logan@runllama.ai"}]
 requires-python = ">=3.9,<4.0"
@@ -1,5 +1,4 @@
-import os
-from typing import Any, Dict, List, Optional, Union
+from typing import Any, Dict, Optional, Union

 from llama_cloud.core.api_error import ApiError
 from llama_cloud.types import ExtractConfig
@@ -13,9 +12,6 @@ from tenacity import (

 from llama_cloud_services.extract import ExtractionAgent, LlamaExtract

-# Global storage for agents to cleanup
-_TEST_AGENTS_TO_CLEANUP: List[str] = []
-

 def _is_rate_limit_error(exception: BaseException) -> bool:
    """Check if the exception is a rate limit error (429)."""
@@ -42,38 +38,3 @@ def pytest_configure(config):
    """Register custom markers for extract tests."""
    config.addinivalue_line("markers", "agent_name: custom agent name for test")
    config.addinivalue_line("markers", "agent_schema: custom agent schema for test")
-
-
-def pytest_sessionfinish(session, exitstatus):
-    """Hook that runs after all tests complete - cleanup agents here"""
-    print(
-        f"pytest_sessionfinish hook called! Agents to cleanup: {_TEST_AGENTS_TO_CLEANUP}"
-    )
-
-    if _TEST_AGENTS_TO_CLEANUP:
-        print("Creating cleanup client...")
-        # Create a fresh client just for cleanup
-        cleanup_client = LlamaExtract(
-            api_key=os.getenv("LLAMA_CLOUD_API_KEY"),
-            base_url=os.getenv("LLAMA_CLOUD_BASE_URL"),
-            project_id=os.getenv("LLAMA_CLOUD_PROJECT_ID"),
-            verbose=True,
-        )
-
-        for agent_id in _TEST_AGENTS_TO_CLEANUP:
-            try:
-                print(f"Deleting agent {agent_id}...")
-                cleanup_client.delete_agent(agent_id)
-                print(f"Cleaned up agent {agent_id}")
-            except Exception as e:
-                print(f"Warning: Failed to delete agent {agent_id}: {e}")
-
-        _TEST_AGENTS_TO_CLEANUP.clear()
-        print("Agent cleanup completed")
-    else:
-        print("No agents to cleanup")
-
-
-def register_agent_for_cleanup(agent_id: str):
-    """Register an agent ID for cleanup at the end of the test session"""
-    _TEST_AGENTS_TO_CLEANUP.append(agent_id)
@@ -1,4 +1,6 @@
 import os
+import shutil
+import uuid
 import pytest
 from pathlib import Path
 from pydantic import BaseModel
@@ -6,7 +8,7 @@ from pydantic import BaseModel
 from llama_cloud_services.extract import LlamaExtract, ExtractionAgent, SourceText
 from llama_cloud.types import ExtractConfig, ExtractMode, ExtractRun
 from tests.extract.util import load_test_dotenv
-from .conftest import register_agent_for_cleanup, create_agent_with_retry
+from .conftest import create_agent_with_retry

 load_test_dotenv()

@@ -59,17 +61,27 @@ def test_schema_dict():


@pytest.fixture
-def test_agent(llama_extract, test_agent_name, test_schema_dict, request):
-    """Creates a test agent and collects it for cleanup at the end of all tests"""
-    test_id = request.node.nodeid
-    test_hash = hex(hash(test_id))[-8:]
-    base_name = test_agent_name
+def unique_test_pdf(tmp_path):
+    """Copy test PDF to a unique path to avoid file deduplication across parallel tests.

+    Uses a UUID in the filename so that external_file_id is unique regardless of
+    whether the full path or just the filename is sent to the backend.
+    """
+    unique_name = f"{TEST_PDF.stem}-{uuid.uuid4().hex[:8]}{TEST_PDF.suffix}"
+    unique_pdf = tmp_path / unique_name
+    shutil.copy2(TEST_PDF, unique_pdf)
+    return unique_pdf
+
+
+@pytest.fixture
+def test_agent(llama_extract, test_agent_name, test_schema_dict, request):
+    """Creates a test agent with a unique name and cleans it up after the test."""
+    unique_id = uuid.uuid4().hex[:8]
    base_name = next(
        (marker.args[0] for marker in request.node.iter_markers("agent_name")),
-        base_name,
+        test_agent_name,
    )
-    name = f"{base_name}_{test_hash}"
+    name = f"{base_name}_{unique_id}"

    schema = next(
        (
@@ -79,21 +91,20 @@ def test_agent(llama_extract, test_agent_name, test_schema_dict, request):
        test_schema_dict,
    )

-    # Cleanup existing agent
-    try:
-        for agent in llama_extract.list_agents():
-            if agent.name == name:
-                llama_extract.delete_agent(agent.id)
-    except Exception as e:
-        print(f"Warning: Failed to cleanup existing agent: {e}")
-
-    agent = create_agent_with_retry(llama_extract, name=name, data_schema=schema)
-
-    # Add agent to cleanup list via conftest helper
-    register_agent_for_cleanup(agent.id)
+    # Use config with cache invalidation to ensure fresh results in tests
+    config = ExtractConfig(invalidate_cache=True)
+    agent = create_agent_with_retry(
+        llama_extract, name=name, data_schema=schema, config=config
+    )

    yield agent

+    # Inline cleanup -- each worker cleans up its own agents
+    try:
+        llama_extract.delete_agent(agent.id)
+    except Exception as e:
+        print(f"Warning: Failed to cleanup agent {agent.id}: {e}")
+

 class TestLlamaExtract:
    def test_init_without_api_key(self):
@@ -134,34 +145,38 @@ class TestLlamaExtract:

 class TestExtractionAgent:
    @pytest.mark.asyncio
-    async def test_extract_single_file(self, test_agent):
-        result = await test_agent.aextract(TEST_PDF)
+    async def test_extract_single_file(self, test_agent, unique_test_pdf):
+        result = await test_agent.aextract(unique_test_pdf)
        assert result.status == "SUCCESS"
        assert result.data is not None
        assert isinstance(result.data, dict)
        assert "title" in result.data
        assert "summary" in result.data

-    def test_sync_extract_single_file(self, test_agent):
-        result = test_agent.extract(TEST_PDF)
+    def test_sync_extract_single_file(self, test_agent, unique_test_pdf):
+        result = test_agent.extract(unique_test_pdf)
        assert result.status == "SUCCESS"
        assert result.data is not None
        assert isinstance(result.data, dict)
        assert "title" in result.data
        assert "summary" in result.data

-    def test_extract_file_from_buffered_io(self, test_agent):
-        result = test_agent.extract(SourceText(file=open(TEST_PDF, "rb")))
+    def test_extract_file_from_buffered_io(self, test_agent, unique_test_pdf):
+        result = test_agent.extract(
+            SourceText(file=open(unique_test_pdf, "rb"), filename=unique_test_pdf.name)
+        )
        assert result.status == "SUCCESS"
        assert result.data is not None
        assert isinstance(result.data, dict)
        assert "title" in result.data
        assert "summary" in result.data

-    def test_extract_file_from_bytes(self, test_agent):
-        with open(TEST_PDF, "rb") as f:
+    def test_extract_file_from_bytes(self, test_agent, unique_test_pdf):
+        with open(unique_test_pdf, "rb") as f:
            file_bytes = f.read()
-        result = test_agent.extract(SourceText(file=file_bytes, filename=TEST_PDF.name))
+        result = test_agent.extract(
+            SourceText(file=file_bytes, filename=unique_test_pdf.name)
+        )
        assert result.status == "SUCCESS"
        assert result.data is not None
        assert isinstance(result.data, dict)
@@ -177,7 +192,10 @@ class TestExtractionAgent:
        weight for 8 to 13 km (5–8 miles).[3] The name llama (also historically spelled
        "glama") was adopted by European settlers from native Peruvians.
        """
-        result = test_agent.extract(SourceText(text_content=TEST_TEXT))
+        unique_name = f"text-{uuid.uuid4().hex[:8]}.txt"
+        result = test_agent.extract(
+            SourceText(text_content=TEST_TEXT, filename=unique_name)
+        )
        assert result.status == "SUCCESS"
        assert result.data is not None
        assert isinstance(result.data, dict)
@@ -185,8 +203,8 @@ class TestExtractionAgent:
        assert "summary" in result.data

    @pytest.mark.asyncio
-    async def test_extract_multiple_files(self, test_agent):
-        files = [TEST_PDF, TEST_PDF]  # Using same file twice for testing
+    async def test_extract_multiple_files(self, test_agent, unique_test_pdf):
+        files = [unique_test_pdf, unique_test_pdf]  # Using same file twice for testing
        response = await test_agent.aextract(files)

        assert len(response) == 2
@@ -215,15 +233,15 @@ class TestExtractionAgent:
        updated_agent = llama_extract.get_agent(name=test_agent.name)
        assert "new_field" in updated_agent.data_schema["properties"]

-    def test_list_extraction_runs(self, test_agent: ExtractionAgent):
+    def test_list_extraction_runs(self, test_agent: ExtractionAgent, unique_test_pdf):
        assert test_agent.list_extraction_runs().total == 0
-        test_agent.extract(TEST_PDF)
+        test_agent.extract(unique_test_pdf)
        runs = test_agent.list_extraction_runs()
        assert runs.total > 0

-    def test_delete_extraction_run(self, test_agent: ExtractionAgent):
+    def test_delete_extraction_run(self, test_agent: ExtractionAgent, unique_test_pdf):
        assert test_agent.list_extraction_runs().total == 0
-        run: ExtractRun = test_agent.extract(TEST_PDF)
+        run: ExtractRun = test_agent.extract(unique_test_pdf)
        test_agent.delete_extraction_run(run.id)
        runs = test_agent.list_extraction_runs()
        assert runs.total == 0
@@ -237,7 +255,7 @@ class TestStatelessExtraction:

    @pytest.fixture
    def test_config(self):
-        return ExtractConfig(extraction_mode=ExtractMode.FAST)
+        return ExtractConfig(extraction_mode=ExtractMode.FAST, invalidate_cache=True)

    @pytest.fixture
    def test_schema_dict(self):
@@ -1,14 +1,16 @@
 import os
+from pathlib import Path
 import pytest

 from llama_cloud_services.extract import LlamaExtract, ExtractionAgent
+from llama_cloud_services.utils import SourceText
 from collections import namedtuple
 import json
 import uuid
 from llama_cloud.types import ExtractConfig, ExtractMode
 from deepdiff import DeepDiff
 from tests.extract.util import json_subset_match_score, load_test_dotenv
-from .conftest import register_agent_for_cleanup, create_agent_with_retry
+from .conftest import create_agent_with_retry

 load_test_dotenv()

@@ -56,10 +58,16 @@ def get_test_cases():
            input_files.append(file_path)

        settings = [
-            ExtractConfig(extraction_mode=ExtractMode.FAST),
-            ExtractConfig(extraction_mode=ExtractMode.BALANCED),
-            ExtractConfig(extraction_mode=ExtractMode.MULTIMODAL),
-            ExtractConfig(extraction_mode=ExtractMode.PREMIUM),
+            ExtractConfig(extraction_mode=ExtractMode.FAST, invalidate_cache=True),
+            ExtractConfig(extraction_mode=ExtractMode.BALANCED, invalidate_cache=True),
+            ExtractConfig(
+                extraction_mode=ExtractMode.MULTIMODAL, invalidate_cache=True
+            ),
+            ExtractConfig(
+                extraction_mode=ExtractMode.PREMIUM,
+                invalidate_cache=True,
+                parse_model="anthropic-sonnet-4.5",
+            ),
        ]

        for input_file in sorted(input_files):
@@ -101,32 +109,24 @@ def extractor():
@pytest.fixture
 def extraction_agent(test_case: ExtractionTestCase, extractor: LlamaExtract):
    """Fixture to create and cleanup extraction agent for each test."""
-    # Create unique name with random UUID (important for CI to avoid conflicts)
    unique_id = uuid.uuid4().hex[:8]
    agent_name = f"{test_case.name}_{unique_id}"

    with open(test_case.schema_path, "r") as f:
        schema = json.load(f)

-    # Clean up any existing agents with this name
-    try:
-        agents = extractor.list_agents()
-        for agent in agents:
-            if agent.name == agent_name:
-                extractor.delete_agent(agent.id)
-    except Exception as e:
-        print(f"Warning: Failed to cleanup existing agent: {str(e)}")
-
-    # Create new agent with retry logic for rate limiting
    agent = create_agent_with_retry(
        extractor, name=agent_name, data_schema=schema, config=test_case.config
    )

-    # Register agent for cleanup at the end of the test session
-    register_agent_for_cleanup(agent.id)
-
    yield agent

+    # Inline cleanup -- each worker cleans up its own agents
+    try:
+        extractor.delete_agent(agent.id)
+    except Exception as e:
+        print(f"Warning: Failed to cleanup agent {agent.id}: {e}")
+

@pytest.mark.skipif(
    os.environ.get("LLAMA_CLOUD_API_KEY", "") == "",
@@ -136,7 +136,12 @@ def extraction_agent(test_case: ExtractionTestCase, extractor: LlamaExtract):
 def test_extraction(
    test_case: ExtractionTestCase, extraction_agent: ExtractionAgent
 ) -> None:
-    result = extraction_agent.extract(test_case.input_file).data  # type: ignore
+    # Use a unique external_file_id per upload to avoid cross-test collisions.
+    input_path = Path(test_case.input_file)
+    unique_filename = f"{input_path.stem}-{uuid.uuid4().hex}{input_path.suffix}"
+    result = extraction_agent.extract(
+        SourceText(file=str(input_path), filename=unique_filename)
+    ).data  # type: ignore
    with open(test_case.expected_output, "r") as f:
        expected = json.load(f)
    # TODO: fix the saas_slide test
@@ -1,5 +1,11 @@
 # llama-cloud-services

+## 0.5.4
+
+### Patch Changes
+
+- 2358df1: add deprecation notices
+
 ## 0.5.3

 ### Patch Changes
@@ -1,6 +1,6 @@
 {
  "name": "llama-cloud-services",
-  "version": "0.5.3",
+  "version": "0.5.4",
  "type": "module",
  "license": "MIT",
  "scripts": {
@@ -1,3 +1,13 @@
+// Emit deprecation warning once when package is imported
+if (typeof console !== "undefined" && console.warn) {
+  console.warn(
+    "⚠️  DEPRECATION WARNING: This package (llama_cloud_services) is deprecated and will be maintained until May 1, 2026. " +
+      "Please migrate to the new package: npm install @llamaindex/llama-cloud " +
+      "(https://github.com/run-llama/llama-cloud-ts). " +
+      "The new package provides the same functionality with improved performance and support.",
+  );
+}
+
 export { LLamaCloudFileService } from "./LLamaCloudFileService.js";
 export { LlamaCloudIndex } from "./LlamaCloudIndex.js";
 export {
Author	SHA1	Message	Date
Cursor Agent	a87743df0f	fix: add client-side resilience for S3 transaction race condition Address the S3 race condition (LI-5775) where the backend DB may commit a SUCCESS status before the S3 data write fully propagates, causing transient 500 errors or null data when fetching extraction runs. Changes: - Add _get_run_with_data_check() wrapper in _wait_for_job_result that detects SUCCESS runs with null data and retries with exponential backoff (2s, 4s, 8s) to allow S3 writes to propagate - Increase run_retry_attempts from 3 to 5 and run_max_wait from 4/20 to 30 for more resilient handling of transient 500 errors from the backend's S3 read failures - Add warning messages when the race condition pattern is detected to aid debugging Co-authored-by: George He <georgewho96@gmail.com>	2026-02-17 21:33:10 +00:00
Neeraj Pradhan	5ea758b853	More robust extract tests with pytest xdist (#1117 )	2026-02-16 16:16:15 -08:00
dependabot[bot]	208b6f2fa5	build(deps): bump slackapi/slack-github-action from 1.27.0 to 2.1.1 (#1092 ) Bumps [slackapi/slack-github-action](https://github.com/slackapi/slack-github-action) from 1.27.0 to 2.1.1. - [Release notes](https://github.com/slackapi/slack-github-action/releases) - [Commits](https://github.com/slackapi/slack-github-action/compare/v1.27.0...v2.1.1) --- updated-dependencies: - dependency-name: slackapi/slack-github-action dependency-version: 2.1.1 dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-02-14 21:03:05 -06:00
github-actions[bot]	e1b9143f79	chore: version packages (#1116 ) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2026-02-13 15:29:09 -08:00
Neeraj Pradhan	232c55bd6a	Bump up patch version (#1115 )	2026-02-13 15:20:52 -08:00
Neeraj Pradhan	ab6f2f8da5	Allows xlsx files in the sdk for extract (#1114 )	2026-02-13 14:44:25 -08:00
github-actions[bot]	66c2639ec8	chore: version packages (#1112 )	2026-02-11 15:18:43 -06:00
Logan	da1916c69f	more loudly deprecate ancient llama-parse package (#1111 )	2026-02-11 15:16:01 -06:00
Neeraj Pradhan	345e272573	Lower frequency for e2e tests (#1110 )	2026-02-11 09:07:15 -08:00
github-actions[bot]	d70fbac1ce	chore: version packages (#1103 )	2026-02-02 11:46:39 -06:00
Logan	2358df10c6	add notice (don't merge until ready) (#1065 )	2026-02-02 11:42:47 -06:00
Neeraj Pradhan	829628cc86	Use unique filenames when running dist tests (#1101 )	2026-01-30 14:00:27 -08:00
Neeraj Pradhan	42b7bbd1ae	Use sonnet when testing premium mode in extract e2e (#1098 ) * Use sonnet when testing premium mode in extract e2e * fix parse model	2026-01-27 16:16:48 -08:00
Neeraj Pradhan	38da9a52d7	Invalidate cache when running extract tests (#1097 )	2026-01-26 17:33:23 -08:00
Neeraj Pradhan	1e7ec40ee7	Fix verbose logging on slack channel (#1096 )	2026-01-26 17:12:50 -08:00
Neeraj Pradhan	dd83c1a9d0	Add retries to all extract sdk functions uniformly (#1095 )	2026-01-26 12:05:16 -08:00
Neeraj Pradhan	7cb83f5cd3	Change cron schedule for hourly extract tests (#1094 )	2026-01-26 10:15:34 -08:00
Neeraj Pradhan	b05266be6d	Try to reparse scheduled workflow (#1093 )	2026-01-26 09:56:22 -08:00
Neeraj Pradhan	eab4798165	Force github reparse of the workflow (#1090 )	2026-01-23 11:36:28 -08:00
Neeraj Pradhan	b174fa8fab	Run hourly extract tests to catch SDK schema drifts (#1089 ) * Run hourly extract tests to catch SDK schema drifts * fix url * fix prod/staging env	2026-01-22 18:18:45 -08:00