Commit Graph

  • 84ea07029f chore(deps): bump maturin from 1.10.2 to 1.11.5 in /docs (#19740) dependabot[bot] 2026-01-11 22:30:42 +11:00
  • 30c6ff1982 chore: bump dependabot PR limit for cargo from 5 to 15 (#19730) Jeffrey Vo 2026-01-11 20:16:24 +09:00
  • 0c5c97b22a fix(functions-aggregate): drain CORR state vectors for streaming aggregation (#19669) Geoffrey Claude 2026-01-11 08:36:07 +01:00
  • 9fa7500bbc Fix internal error "Physical input schema should be the same as the one converted from logical input schema." (#18412) Andrew Lamb 2026-01-10 22:30:58 -05:00
  • 41a0b85afa Add support for additional numeric types in to_timestamp functions (#19663) Goksel Kabadayi 2026-01-11 06:25:26 +03:00
  • 013efb4fe1 docs: Refine Communication documentation to highlight Discord (#19714) Andrew Lamb 2026-01-10 22:21:26 -05:00
  • 4e0161d99c fix: Don't treat quoted column names as placeholder variables in SQL (#19339) pmallex 2026-01-10 19:19:49 -08:00
  • 458b49109a perf: optimize left function by eliminating double chars() iteration (#19571) Liang-Chi Hsieh 2026-01-10 16:34:41 +08:00
  • 45fb0b4b98 fix(accumulators): preserve state in evaluate() for window frame queries (#19618) Ganesh Patil 2026-01-10 08:13:04 +05:30
  • afc912106b Optimize concat/concat_ws scalar path by pre-allocating memory (#19547) lyne 2026-01-10 10:32:18 +08:00
  • 3087ca8a92 perf: optimize NthValue when ignore_nulls is true (#19496) Mikhail Zabaluev 2026-01-10 04:19:53 +02:00
  • 821d410fc0 feat(spark): Implement collect_list/collect_set aggregate functions (#19699) cht42 2026-01-10 06:17:45 +04:00
  • 5c2b1236b5 feat(spark): implement array_repeat function (#19702) cht42 2026-01-10 06:17:01 +04:00
  • 20870da20d infer parquet file order from metadata and use it to optimize scans (#19433) Adrian Garcia Badaracco 2026-01-09 14:34:36 -05:00
  • 209a0a2e86 fix: unnest struct field with an alias failed with internal error (#19698) Kumar Ujjawal 2026-01-09 21:27:58 +05:30
  • 8ba46466d3 docs: Fix two small issues in introduction.md (#19712) Adam Gutglick 2026-01-09 13:39:02 +00:00
  • 07e63edfa0 Fix TopK aggregation for UTF-8/Utf8View group keys and add safe fallback for unsupported string aggregates (#19285) kosiew 2026-01-09 21:27:31 +08:00
  • b7091c0d2f Optimize Nullstate / accumulators (#19625) Daniël Heres 2026-01-09 11:28:32 +01:00
  • c98fa5616e perfect hash join (#19411) UBarney 2026-01-09 18:08:53 +08:00
  • 62658cd624 implement var distinct (#19706) iamthinh 2026-01-09 01:21:08 -08:00
  • a55b77e7de fix: DynamicFilterPhysicalExpr violates Hash/Eq contract (#19659) Kumar Ujjawal 2026-01-09 04:03:38 +05:30
  • b9a3b9f94f Record sort order when writing Parquet with WITH ORDER (#19595) Adrian Garcia Badaracco 2026-01-08 13:04:17 -05:00
  • 0cf45cae9c Refactor cache APIs to support ordering information (#19597) Adrian Garcia Badaracco 2026-01-08 12:56:41 -05:00
  • 5194fd5eb0 chore(deps): bump taiki-e/install-action from 2.65.13 to 2.65.15 (#19676) dependabot[bot] 2026-01-08 08:29:05 -08:00
  • e6049de5a7 Make default ListingFilesCache table scoped (#19616) jizezhang 2026-01-08 06:34:10 -08:00
  • 1d5d63c413 Feat: Allow pow with negative & non-integer exponent on decimals (#19369) Yuvraj 2026-01-08 19:37:32 +05:30
  • d18e670e78 feat: Allow log with non-integer base on decimals (#19372) Yuvraj 2026-01-08 19:36:34 +05:30
  • 102caeb226 minor: More comments to ParquetOpener::open() (#19677) Yongting You 2026-01-08 11:14:09 +08:00
  • 646213ec7c feat: add Time type support to date_trunc function (#19640) Kumar Ujjawal 2026-01-08 07:12:17 +05:30
  • 35ff4ab0a0 Allow logical optimizer to be run without evaluating now() & refactor SimplifyInfo (#19505) Adrian Garcia Badaracco 2026-01-07 19:38:46 -05:00
  • 142f5972d5 Store example data directly inside the datafusion-examples (#19141) (#19319) Sergey Zhukov 2026-01-08 03:23:05 +03:00
  • 3a0ca4ef79 Remove dependency on rust_decimal, remove ignore of RUSTSEC-2026-0001 (#19666) Andrew Lamb 2026-01-07 14:15:42 -05:00
  • 566bcde9e1 Parquet: Push down supported list predicates (array_has/any/all) during decoding (#19545) kosiew 2026-01-07 20:08:50 +08:00
  • 680ddcc6cb feat: split BatchPartitioner::try_new into hash and round-robin constructors (#19668) Mohit rao 2026-01-07 15:12:11 +05:30
  • ce08307a4c refactor: Use Signature::coercible for isnan/iszero (#19604) Kumar Ujjawal 2026-01-07 12:50:29 +05:30
  • 1f654bbe62 feat: implement metrics for AsyncFuncExec (#19626) feniljain 2026-01-07 07:43:33 +05:30
  • 166ef81121 Perf: Optimize substring_index via single-byte fast path and direct indexing (#19590) lyne 2026-01-07 05:02:13 +08:00
  • e8196f462f Remove coalesce batches rule and deprecate CoalesceBatchesExec (#19622) feniljain 2026-01-07 02:25:33 +05:30
  • ed01b67f2d Refactor PartitionedFile: add ordering field and new_from_meta constructor (#19596) Adrian Garcia Badaracco 2026-01-06 15:51:09 -05:00
  • 7942e751cb Update dependencies (#19667) Andrew Lamb 2026-01-06 15:50:14 -05:00
  • a2f02f069a fix: Return Int for Date - Date instead of duration (#19563) Kumar Ujjawal 2026-01-07 02:10:54 +05:30
  • 924037ea04 perf: Improve performance of split_part (#19570) Andy Grove 2026-01-06 09:39:23 -07:00
  • 1037f0aa20 feat: add list_files_cache table function for datafusion-cli (#19388) jizezhang 2026-01-06 05:23:39 -08:00
  • c3e1c3644a Downgrade aws-smithy-runtime, update rust_decimal, ignore RUSTSEC-2026-0001 to get clean CI (#19657) Andrew Lamb 2026-01-06 07:55:54 -05:00
  • 5c2ee3650d perf: optimize HashTableLookupExpr::evaluate (#19602) UBarney 2026-01-06 18:05:36 +08:00
  • adf00a6495 Add a protection to release candidate branch 52 (#19660) xudong.w 2026-01-06 15:08:14 +08:00
  • aee5cd9f35 fix(functions): Make translate function postgres compatible (#19630) Devanshu 2026-01-06 06:57:08 +05:30
  • ff38480f22 Refactor percentile_cont to clarify support input types (#19611) Jeffrey Vo 2026-01-06 10:23:03 +09:00
  • 2d56253893 fix(spark): Use wrapping addition/subtraction in SparkDateAdd and SparkDateSub (#19377) Mikhail Zabaluev 2026-01-06 03:14:01 +02:00
  • 9b2505ce6a fix(doc): close #19393, make upgrading guide match v51 api (#19648) mag1c1an1 2026-01-06 09:11:58 +08:00
  • ada0923a39 Respect execution timezone in to_timestamp and related functions (#19078) Bruce Ritchie 2026-01-05 16:03:42 -05:00
  • fd79241633 chore(deps): bump syn from 2.0.111 to 2.0.113 (#19645) dependabot[bot] 2026-01-05 15:57:38 -05:00
  • c8620129f3 chore(deps): bump tracing from 0.1.43 to 0.1.44 (#19644) dependabot[bot] 2026-01-05 15:56:52 -05:00
  • 47df535d2c chore(deps): bump taiki-e/install-action from 2.65.11 to 2.65.13 (#19646) dependabot[bot] 2026-01-05 22:11:30 +11:00
  • 52bbc8afcf chore(deps): bump insta from 1.45.0 to 1.46.0 (#19643) dependabot[bot] 2026-01-05 10:50:53 +01:00
  • 418f62ae3b fix: NULL handling in arrow_intersect and arrow_union (#19415) feniljain 2026-01-05 05:58:52 +05:30
  • 45d4948b35 Validate parquet writer version (#19515) Aly Abdelmoneim 2026-01-05 02:22:48 +02:00
  • e0b4e8d82f feat: Implement partition_statistics API for SortMergeJoinExec (#19567) Kumar Ujjawal 2026-01-05 03:09:45 +05:30
  • 7e049749eb feat: Implement Spark function space (#19610) Kazantsev Maksim 2026-01-04 13:34:57 -08:00
  • 955fd41d8e docs: fix typos in PartitionEvaluator trait documentation (#19631) Solari Systems 2026-01-03 22:27:53 -08:00
  • 7fde30a8aa fix: handle invalid byte ranges in calculate_range for single-line files (#19607) Michele Vigilante 2026-01-04 04:35:10 +01:00
  • 09455f1814 chore: bump testcontainers-modules to 0.14 and remove testcontainers dep (#19620) Jeffrey Vo 2026-01-04 04:31:21 +09:00
  • 70daf88251 feat: plan-time SQL expression simplifying (#19311) theirix 2026-01-03 10:53:32 +00:00
  • 2ac032b40d fix: emit empty RecordBatch for empty file writes (#19370) nlimpid 2026-01-03 17:22:49 +08:00
  • 8809dae280 perf: Improve performance of ltrim, rtrim, btrim (#19551) Andy Grove 2026-01-02 19:02:55 -07:00
  • 715962c80b perf: optimize factorial function performance (#19575) Namgung Chan 2026-01-03 10:59:40 +09:00
  • a295698592 chore(deps): bump taiki-e/install-action from 2.65.10 to 2.65.11 (#19601) dependabot[bot] 2026-01-02 12:36:56 -08:00
  • 1320069246 Fix typo in contributor guide architecture section (#19613) Clement de Groc 2026-01-02 17:29:36 +01:00
  • 987b94ca47 chore: Add TPCDS benchmark comparison for PR (#19552) Oleks V 2026-01-02 08:15:14 -08:00
  • 56fec71c7b Add left function benchmark (#19600) Liang-Chi Hsieh 2026-01-02 23:19:16 +08:00
  • 9a9ff8d616 perf: Improve performance of hex encoding in spark functions (#19586) Shashidhar B M 2026-01-02 05:06:47 +05:30
  • 195d3d64bb perf: optimize strpos by eliminating double iteration for UTF-8 (#19572) Liang-Chi Hsieh 2026-01-02 05:24:30 +08:00
  • bc753c201f chore(deps): bump taiki-e/install-action from 2.65.8 to 2.65.10 (#19578) dependabot[bot] 2026-01-01 11:04:04 -08:00
  • 90f5bfe309 feat: Implement Spark functions hour, minute, second (#19512) Andy Grove 2026-01-01 09:00:16 -07:00
  • 0db668bc99 Refactor duplicate code in type_coercion/functions.rs (#19518) Jeffrey Vo 2026-01-02 00:30:21 +09:00
  • 818706ab78 feat: to_time function (#19540) Kumar Ujjawal 2026-01-01 20:56:45 +05:30
  • cd12d51039 Replace deprecated structopt with clap in datafusion-benchmarks (#19492) Yuvraj 2026-01-01 20:09:23 +05:30
  • db7b8cc4db Implement partition_statistics API for NestedLoopJoinExec (#19468) Kumar Ujjawal 2026-01-01 20:08:41 +05:30
  • 8959b3d11e feat: output statistics for constant columns in projections (#19419) Shashidhar B M 2026-01-01 19:34:21 +05:30
  • fd263216c9 feat: integrate batch coalescer with async fn exec (#19342) feniljain 2026-01-01 19:19:26 +05:30
  • b818f93416 perf: Improve performance of md5 (#19568) Andy Grove 2025-12-31 02:24:34 -07:00
  • 79f67b8ef2 feat: implement partition_statistics for WindowAggExec (#18534) Dongpo Liu 2025-12-31 08:12:43 +01:00
  • 27de50d055 fix: Reverse row selection should respect the row group index (#19557) Qi Zhu 2025-12-31 14:35:33 +08:00
  • 1ce4b51a48 Various refactors to string functions (#19402) Jeffrey Vo 2025-12-31 11:45:15 +09:00
  • f1e5c94f3a Compute Dynamic Filters only when a consumer supports them (#19546) Lía Adriana 2025-12-31 02:04:33 +01:00
  • 4e45c19d10 Enables DefaultListFilesCache by default (#19366) Blake Orth 2025-12-30 17:45:15 -07:00
  • 56a2be17d0 perf: optimize regexp_count to avoid String allocation when start position is provided (#19553) Liang-Chi Hsieh 2025-12-31 05:13:05 +08:00
  • 9690f958e2 perf: improve performance of lpad/rpad by reusing buffers (#19558) Liang-Chi Hsieh 2025-12-31 04:29:10 +08:00
  • 1704d1e740 refactor: extract the data generate out of aggregate_topk benchmark (#19523) Huaijin 2025-12-31 03:39:38 +08:00
  • d13d89129b feat: Add DELETE/UPDATE hooks to TableProvider trait and to MemTable implementation (#19142) Ethan Urbanski 2025-12-30 10:35:33 -05:00
  • 34addca6ba bugfix: preserve schema metadata for record batch in FFI (#19293) Tim Saucer 2025-12-30 08:38:37 -05:00
  • 3420a2d4ad chore(deps): bump taiki-e/install-action from 2.65.6 to 2.65.8 (#19559) dependabot[bot] 2025-12-30 17:25:05 +08:00
  • a51e3a079e minor : add crypto function benchmark (#19539) Namgung Chan 2025-12-30 14:52:46 +09:00
  • a6fd5cc840 Upgrade hashbrown to 0.16 (#19554) Daniël Heres 2025-12-30 06:07:22 +01:00
  • 13f38435a2 Introduce TypeSignatureClass::Any (#19485) Jeffrey Vo 2025-12-30 12:54:41 +09:00
  • 1d2b389599 perf: Optimize contains for scalar search arg (#19529) Andy Grove 2025-12-29 20:41:05 -07:00
  • 8469aa1dcd minor: implement more arms for get_data_types() for NativeType (#19449) Jeffrey Vo 2025-12-30 11:20:52 +09:00
  • 8ac500bf07 Revert Spark Elt nullability change (#19510) Jeffrey Vo 2025-12-30 04:16:22 +09:00
  • f9cdfea7fe docs: Improve config tables' readability (#19522) Nuno Faria 2025-12-29 19:13:42 +00:00
  • 673d7c93a0 Refactor TopKHashTable to use HashTable API (#19464) Daniël Heres 2025-12-29 19:38:01 +01:00