Commit Graph

  • b80bf2ca8e fix: filter pushdown when merge filter (#20110) Huaijin 2026-02-04 05:26:51 +08:00
  • 5981d62c28 chore: remove datatype check functions in favour of upstream versions (#20104) Jeffrey Vo 2026-02-04 06:26:14 +09:00
  • c3eb9ef05f Make session state builder clonable (#20136) Albert Skalt 2026-02-04 00:23:42 +03:00
  • a41f9081e9 chore(deps): bump bytes from 1.11.0 to 1.11.1 (#20141) dependabot[bot] 2026-02-03 15:06:15 -05:00
  • 4dfc193cbf Improve performance of CASE WHEN x THEN y ELSE NULL expressions (#20097) Pepijn Van Eeckhoudt 2026-02-03 20:40:34 +01:00
  • 35e78ca10a Optimize the evaluation of date_part(<col>) == <constant> when pushed down (#19733) Kosta Tarasov 2026-02-03 11:55:53 -05:00
  • 81f7a871a9 Add BufferExec execution plan (#19760) Gabriel 2026-02-03 14:48:35 +01:00
  • 29d63c1964 Optimize PhysicalExprSimplifier (#20111) Adam Gutglick 2026-02-03 12:22:33 +00:00
  • 4557033c30 chore(deps): bump clap from 4.5.54 to 4.5.56 (#20131) dependabot[bot] 2026-02-03 06:45:14 -05:00
  • 5be5259221 chore(deps): bump flate2 from 1.1.8 to 1.1.9 (#20130) dependabot[bot] 2026-02-03 11:41:30 +00:00
  • a462b79a95 chore(deps): bump insta from 1.46.1 to 1.46.3 (#20129) dependabot[bot] 2026-02-03 06:41:04 -05:00
  • 9f33776c2c chore(deps): bump tonic from 0.14.2 to 0.14.3 (#20127) dependabot[bot] 2026-02-03 11:40:24 +00:00
  • 60f7a3cdae chore(deps): bump actions/setup-node from 4 to 6 (#20125) dependabot[bot] 2026-02-03 11:39:55 +00:00
  • 40762f4710 chore(deps): bump taiki-e/install-action from 2.67.13 to 2.67.18 (#20124) dependabot[bot] 2026-02-03 06:39:40 -05:00
  • 1825f97b76 Add duplicate name error reproducer (#20106) Gabriel 2026-02-03 10:09:11 +01:00
  • b2a689300a chore: reduce production noise by using debug macro (#19885) Alan Tang 2026-02-03 12:36:16 +08:00
  • 8c478e9452 Disallow positional struct casting when field names don’t overlap (#19955) kosiew 2026-02-03 10:22:13 +08:00
  • 96a6bd71ed Runs-on for linux-build-lib and linux-test (2X faster CI) (#20107) Dmitrii Blaginin 2026-02-03 02:07:31 +00:00
  • 39da29f5ee Add ScalarValue::RunEndEncoded variant (#19895) Jeffrey Vo 2026-02-03 08:49:13 +09:00
  • dab903e668 Improve memory accounting for ArrowBytesViewMap (#20077) Vignesh 2026-02-03 05:05:47 +05:30
  • 030621c6b6 refactor: Rename FileSource::try_reverse_output to FileSource::try_pushdown_sort (#20043) Kumar Ujjawal 2026-02-03 05:05:15 +05:30
  • d28a03c583 Adjust case_when DivideByZeroProtection benchmark so that "percentage of zeroes" corresponds to "number of times protection is needed" (#20105) Pepijn Van Eeckhoudt 2026-02-03 00:33:58 +01:00
  • 7388eed29f add ability to customize tokens in parser (#19978) Albert Skalt 2026-02-03 02:33:03 +03:00
  • 9962911ee5 feat: add ExpressionPlacement enum for optimizer expression placement decisions (#20065) Adrian Garcia Badaracco 2026-02-02 10:58:48 -06:00
  • 66ee0afcab Preserve PhysicalExpr graph in proto round trip using Arc pointers as unique identifiers (#20037) Adrian Garcia Badaracco 2026-02-02 08:04:11 -06:00
  • 545c37fbca perf: optimise right for byte access and StringView (#20069) theirix 2026-02-02 04:30:10 +00:00
  • 1a0c2e05c4 Use return_field_from_args in information schema and date_trunc (#20079) Andrea Bozzo 2026-02-02 05:05:52 +01:00
  • c849374723 Refactor iszero() and isnan() to accept all numeric types (#20093) Kumar Ujjawal 2026-02-02 09:35:16 +05:30
  • 51c04754ee Implement preimage for floor function to enable predicate pushdown (#20059) Devanshu 2026-02-01 11:07:13 +07:00
  • 3797995960 perf: improve performance of array_repeat function (#20049) lyne 2026-02-01 10:14:16 +08:00
  • f779e96680 perf: Optimize scalar path for chr function (#20073) Kumar Ujjawal 2026-02-01 07:14:56 +05:30
  • 9de192af29 docs: update data_types.md to reflect current Arrow type mappings (#20072) karuppuchamysuresh 2026-01-31 09:50:23 -08:00
  • bc4c245035 Automatically generate examples documentation adv (#19294) (#19750) Sergey Zhukov 2026-01-31 17:05:41 +03:00
  • a02e6836e7 Reduce number of traversals per node in PhysicalExprSimplifier (#20082) Adam Gutglick 2026-01-30 21:47:10 +00:00
  • 4a6365979e Minor: update tests in limit_pushdown.rs to insta (#20066) Andrew Lamb 2026-01-30 16:16:54 -05:00
  • b50e1f755b chore: Add microbenchmark (compared to ExprOrExpr) (#20076) Yu-Chuan Hung 2026-01-31 03:07:37 +08:00
  • 2860adaf4b fix: The limit_pushdown physical optimization rule removes limits in some cases leading to incorrect results (#20048) Mason 2026-01-30 08:35:37 -05:00
  • cad38654c7 fix: correct weight handling in approx_percentile_cont_with_weight (#19941) Sergio Esteves 2026-01-30 12:44:37 +00:00
  • f0de02fd66 Use BooleanBufferBuilder rather than Vec<bool> in ArrowBytesViewMap (#20064) Eesh Sagar Singh 2026-01-30 16:04:43 +05:30
  • 7c3ea0540c feat: add AggregateMode::PartialReduce for tree-reduce aggregation (#20019) Nathaniel J. Smith 2026-01-30 01:54:50 -08:00
  • f997169b27 Improve documentation for ScalarUDFImpl::preimage (#20008) Andrew Lamb 2026-01-29 13:30:37 -05:00
  • ed0a060185 replace private is_volatile_expression_tree with equivalent public is_volatile (#20056) Adrian Garcia Badaracco 2026-01-29 04:33:12 -06:00
  • 41bfb794c8 feat: support Spark-compatible abs math function part 2 - ANSI mode (#18828) hsiang-c 2026-01-28 19:05:39 -08:00
  • 8152b44430 feat: optimise copying in left for Utf8 and LargeUtf8 (#19980) theirix 2026-01-29 03:04:18 +00:00
  • 92f60ad513 docs: Automatically update DataFusion version in docs (#20001) Nuno Faria 2026-01-29 03:02:37 +00:00
  • 36c0cda206 fix: respect DataFrameWriteOptions::with_single_file_output for paths without extensions (#19931) Kumar Ujjawal 2026-01-29 02:27:37 +05:30
  • 20d5f06a55 perf: Push down join key filters for LEFT/RIGHT/ANTI joins (#19918) Nuno Faria 2026-01-28 14:59:58 +00:00
  • a77e5a5c50 Further refactoring of type coercion function code (#19603) Jeffrey Vo 2026-01-28 22:54:04 +09:00
  • 568f19fa39 Simplify Spark sha2 implementation (#19475) Jeffrey Vo 2026-01-28 20:50:42 +09:00
  • 4b31aaa14c Fix constant value from stats (#20042) Gabriel 2026-01-28 12:45:05 +01:00
  • 5f9bacddcd Enforce clippy::allow_attributes globally across workspace (#19576) Jeffrey Vo 2026-01-28 16:15:10 +09:00
  • 16368983bd minor: add tests for coercible signature considering nulls/dicts/ree (#19459) Jeffrey Vo 2026-01-28 13:07:11 +09:00
  • 47bf17859c fix: return correct length array for scalar null input to calculate_binary_math (#19861) Jeffrey Vo 2026-01-28 13:00:27 +09:00
  • 81e972328d Fix/parquet opener page index policy (#19890) Aviral Garg 2026-01-28 03:40:38 +05:30
  • 0a68b81ade [main] Update version to 52.1.0 (#19878) (#20028) Andrew Lamb 2026-01-27 16:51:24 -05:00
  • bef1368c33 Simplify wait_complete function (#19937) Lía Adriana 2026-01-27 22:13:48 +01:00
  • f819061833 Tweak adapter serialization example (#20035) Adrian Garcia Badaracco 2026-01-27 14:49:20 -06:00
  • 41d48b3f6c feat: implement protobuf converter trait to allow control over serialization and deserialization processes (#19437) Tim Saucer 2026-01-27 13:09:59 -05:00
  • aae3e0f4a1 refactor: make PhysicalExprAdatperFactory::create fallible (#20017) niebayes 2026-01-28 01:20:37 +08:00
  • adb8c8a716 minor: Move metric page_index_rows_pruned to verbose level in EXPLAIN ANALYZE (#20026) Yongting You 2026-01-28 00:31:14 +08:00
  • 1a48d580c8 add more projection pushdown slt tests (#20015) Adrian Garcia Badaracco 2026-01-27 10:13:54 -06:00
  • aca7169016 feat(spark): add unix date and timestamp functions (#19892) cht42 2026-01-27 16:18:14 +04:00
  • af771970b7 chore(deps): bump taiki-e/install-action from 2.67.9 to 2.67.13 (#20020) dependabot[bot] 2026-01-27 20:08:01 +11:00
  • 82f27e9af1 refactor: extract pushdown test utilities to shared module (#20010) Adrian Garcia Badaracco 2026-01-27 03:45:46 -05:00
  • 52deee513f chore(deps): bump setuptools from 80.10.1 to 80.10.2 in /docs (#20022) dependabot[bot] 2026-01-27 16:44:24 +08:00
  • f5709e7484 feat(spark): implement StringView for SparkConcat (#19984) Aryan Anand 2026-01-27 12:28:33 +05:30
  • c560beeb1a perf: Optimize repeat function for scalar and array fast (#19976) Kumar Ujjawal 2026-01-27 12:27:17 +05:30
  • 58fb6e19d2 feat(spark): implement from/to_utc_timestamp functions (#19880) cht42 2026-01-27 10:56:23 +04:00
  • 8653851f57 feat(spark): Add SessionStateBuilderSpark to datafusion-spark (#19865) cht42 2026-01-27 10:55:33 +04:00
  • 073174b034 feat: Show the number of matched Parquet pages in DataSourceExec (#19977) Nuno Faria 2026-01-27 06:03:41 +00:00
  • adddd4c32b fix: Make generate_series return an empty set with invalid ranges (#19999) Nuno Faria 2026-01-27 05:21:36 +00:00
  • 5c47d6d13f Fix ClickBench EventDate handling by casting UInt16 days-since-epoch to DATE via hits view (#19881) kosiew 2026-01-27 09:12:00 +08:00
  • cda6425287 perf: Optimize ArrowBytesViewMap with direct view access (#19975) Tushar Das 2026-01-27 02:15:47 +05:30
  • 6524d91938 minor: pull uuid into workspace dependencies (#19997) Jeffrey Vo 2026-01-27 01:39:41 +09:00
  • 27abe5a62d perf: improve performance of array_remove, array_remove_n and array_remove_all functions (#19996) lyne 2026-01-27 00:37:22 +08:00
  • 8efc2b605e feat(spark): add base64 and unbase64 functions (#19968) cht42 2026-01-26 20:36:42 +04:00
  • 50a3e138e3 Speedup statistics_from_parquet_metadata (#20004) Daniël Heres 2026-01-26 14:03:51 +01:00
  • f8a22a51bf docs: Fix some broken / missing links in the DataFusion documentation (#19958) Andrew Lamb 2026-01-26 07:37:27 -05:00
  • b6f7521752 Do not require mut in memory reservation methods (#19759) Gabriel 2026-01-26 12:44:33 +01:00
  • 4d63f8c927 minor: Add favicon (#20000) Nuno Faria 2026-01-26 05:19:56 +00:00
  • 8023947fad fix: maintain inner list nullability for array_sort (#19948) Jeffrey Vo 2026-01-26 07:03:10 +09:00
  • d1eea0755a Fix broken links in the documentation (#19964) Andrew Lamb 2026-01-25 17:02:46 -05:00
  • e062ae5053 minor: Fix doc about write_batch_size (#19979) Nuno Faria 2026-01-25 22:01:27 +00:00
  • 03c2751f19 chore(deps): bump uuid from 1.19.0 to 1.20.0 (#19993) dependabot[bot] 2026-01-26 03:10:42 +11:00
  • d590824ede chore(deps): bump sysinfo from 0.37.2 to 0.38.0 (#19990) dependabot[bot] 2026-01-25 22:11:11 +11:00
  • 355cae870b chore(deps): bump nix from 0.30.1 to 0.31.1 (#19991) dependabot[bot] 2026-01-25 22:10:31 +11:00
  • 853a61716d chore(deps): bump quote from 1.0.43 to 1.0.44 (#19992) dependabot[bot] 2026-01-25 22:09:08 +11:00
  • 6647583296 chore(deps): bump taiki-e/install-action from 2.66.7 to 2.67.9 (#19987) dependabot[bot] 2026-01-25 22:08:07 +11:00
  • 0b60c588be chore(deps): bump setuptools from 80.9.0 to 80.10.1 in /docs (#19988) dependabot[bot] 2026-01-25 20:29:33 +11:00
  • 4127688735 Change GitHub actions dependabot schedule to weekly (#19981) Jeffrey Vo 2026-01-25 18:22:51 +09:00
  • e5e76366a6 export SessionState::register_catalog_list(...) (#19925) Albert Skalt 2026-01-24 16:03:29 +03:00
  • c1352360bd Improve error messages with nicer formatting of Date and Time types (#19954) Emil Ernerfeldt 2026-01-24 13:16:59 +01:00
  • 05802e205b perf: Optimize factorial scalar path (#19949) Kumar Ujjawal 2026-01-24 11:23:20 +05:30
  • 1897924b7a fix: change token consumption to pick to test on EOF in parser (#19927) Albert Skalt 2026-01-24 08:52:11 +03:00
  • c5e99e595a perf: Optimize scalar path for ascii function (#19951) Kumar Ujjawal 2026-01-24 11:18:59 +05:30
  • 17cbff06e0 Misc hash / hash aggregation performance improvements (#19910) Daniël Heres 2026-01-24 06:39:51 +01:00
  • 23f5003ca1 Add struct pushdown query benchmark and projection pushdown tests (#19962) Adrian Garcia Badaracco 2026-01-23 22:10:03 -05:00
  • 35e99b97bc preserve FilterExec batch size during ser/de (#19960) Albert Skalt 2026-01-24 04:05:47 +03:00
  • b463a9f9e3 chore(deps): bump actions/checkout from 6.0.1 to 6.0.2 (#19953) dependabot[bot] 2026-01-23 13:26:39 -05:00
  • 45c8ae43e8 Support LargeUtf8 as partition column (#19942) Dewey Dunnington 2026-01-23 12:22:20 -06:00