Evgenii Khramkov

Senior Software Engineer at Spice AI

View all authors

Spice v2.0-rc.2 (Apr 10, 2026)

April 10, 2026 · 28 min read

Evgenii Khramkov

Senior Software Engineer at Spice AI

Announcing the release of Spice v2.0-rc.2! 🔥

v2.0.0-rc.2 is the second release candidate for advanced testing of v2.0, building on v2.0.0-rc.1.

Highlights in this release candidate include:

Distributed Spice Cayenne Query and Write Improvements with data-local query routing and partition-aware write-through
DataFusion v52.4.0 Upgrade with aligned arrow-rs, datafusion-federation, and datafusion-table-providers
MERGE INTO for Spice Cayenne catalog tables with distributed support across executors
PARTITION BY Support for Cayenne enabling SQL-defined partitioning in CREATE TABLE statements
ADBC Data Connector & Catalog with full query federation, BigQuery support, and schema/table discovery
Databricks Lakehouse Federation Improvements with improved reliability, resilience, DESCRIBE TABLE fallback, and source-native type parsing
Delta Lake Column Mapping supporting Name and Id mapping modes
HTTP Pagination support for paginated API endpoints in the HTTP data connector
New Catalog Connectors for PostgreSQL, MySQL, MSSQL, and Snowflake
JSON Ingestion Improvements with single-object support, soda (Socrata Open Data) format support, json_pointer extraction, and auto-detection
Per-Model Rate-Limited AI UDF Execution for controlling concurrent AI function invocations
Dependency upgrades including Turso v0.5.3, iceberg-rust v0.9, and Vortex improvements

What's New in v2.0.0-rc.2

Distributed Cayenne Query and Write Improvements

Distributed query for Cayenne-backed tables now has better partition awareness for both reads and writes.

Key improvements:

Data-Local Query Routing: Cayenne catalog queries can now be routed to executors that hold the relevant partitions, improving distributed query efficiency.
Partition-Aware Write Through: Scheduler-side Flight DoPut ingestion now splits partitioned Cayenne writes and forwards them to the responsible executors instead of routing through a single raw-forward path.
Dynamic Partition Assignment: Newly observed partitions can be added and assigned atomically as data arrives, with persisted partition metadata for future routing.
Better Cluster Coordination: Partition management is now separated for accelerated and federated tables, improving routing behavior for distributed Cayenne catalog workloads.
Distributed UPDATE/DELETE DML: UPDATE and DELETE statements for Cayenne catalog tables are now forwarded to all executors in distributed mode, with all executors required to succeed.
Distributed runtime.task_history: Task history is now replicated across the distributed cluster for observability.
RefreshDataset Control Stream: Dataset refresh operations are now distributed via the control stream to executors.
Executor DDL Sync: When an executor connects, it receives DDL for all existing tables, ensuring late-joining executors have full table state.

MERGE INTO for Spice Cayenne

Spice now supports MERGE INTO statements for Cayenne catalog tables, enabling upsert-style data operations with full distributed support.

Key improvements:

MERGE INTO Support: Execute MERGE INTO statements against Cayenne catalog tables for combined insert/update/delete operations.
Distributed MERGE: MERGE operations are automatically distributed across executors in cluster mode.
Data Safety: Duplicate source keys are detected and prevented to avoid data loss during MERGE operations.
Chunked Delete Filters: Large MERGE delete filter lists are chunked to prevent stack overflow with Vortex IN-list expressions.

`PARTITION BY` Support for Cayenne

SQL Partition Management: Spice now supports PARTITION BY for Cayenne-backed CREATE TABLE statements, enabling partition definitions to be expressed directly in SQL and persisted in the Cayenne catalog.

Key improvements:

SQL Partition Definition: Define Cayenne table partitioning directly in SQL using CREATE TABLE ... PARTITION BY (...).
Partition Validation: Partition expressions are parsed and validated during DDL analysis before table creation.
Persisted Partition Metadata: Partition metadata is stored in the Cayenne catalog and can be reloaded by the runtime after restart.
Distributed DDL Support: Partition metadata is forwarded when CREATE TABLE is distributed to executors in cluster mode.
Improved Type Support: Partition utilities now support newer string scalar variants such as Utf8View.

Example:

CREATE TABLE events (id INT, region TEXT, ts TIMESTAMP) PARTITION BY (region)

Catalog Connector Enhancements

Spice now includes additional catalog connectors for major database systems, improving schema discovery and federation workflows across external data systems.

Key improvements:

New Catalog Connectors: Added catalog connectors for PostgreSQL, MySQL, MSSQL, and Snowflake.
Schema and Table Discovery: Connectors use native metadata catalogs such as information_schema / INFORMATION_SCHEMA to discover schemas and tables.
Improved Federation Workflows: These connectors make it easier to expose external database metadata through Spice for cross-system federation scenarios.
PostgreSQL Partitioned Tables: Fixed schema discovery for PostgreSQL partitioned tables.

Example PostgreSQL catalog configuration:

catalogs:
  - from: pg
    name: pg
    include:
      - 'public.*'
    params:
      pg_host: localhost
      pg_port: 5432
      pg_user: postgres
      pg_pass: ${secrets:POSTGRES_PASSWORD}
      pg_db: my_database
      pg_sslmode: disable

JSON Ingestion Improvements

JSON ingestion is now more flexible and robust.

Key improvements:

More JSON Formats: Added support for single-object JSON documents, auto-detected JSON formats, and Socrata SODA responses.
json_pointer Extraction: Extract nested payloads before schema inference and reading using RFC 6901 JSON Pointer syntax.
Better Auto-Detection: JSON format detection now handles arrays, objects, JSONL, and BOM-prefixed input more reliably, including single multi-line objects.
SODA Support: Added schema extraction and data conversion for Socrata Open Data API responses.
Broader Compatibility: Improved handling for BOM-prefixed files, CRLF-delimited JSONL, nested payloads, mixed structures, and wrapped documents.

Example using json_pointer to extract nested data from an API response:

datasets:
  - from: https://api.example.com/v1/data
    name: users
    params:
      json_pointer: /data/users

DataFusion v52.4.0 Upgrade

Apache DataFusion has been upgraded from v52.2.0 to v52.4.0, with aligned updates across arrow-rs, datafusion-federation, and datafusion-table-providers.

Key improvements:

DataFusion v52.4.0: Brings the latest fixes and compatibility improvements across query planning and execution.
Strict Overflow Handling: try_cast_to now uses strict cast to return errors on overflow instead of silently producing NULL values.
Federation Fix: Fixed SQL unparsing for Inexact filter pushdown with aliases.
Partial Aggregation Optimization: Improved partial aggregation performance for FlightSQLExec.

Dependency Upgrades

Dependency	Version / Update
Turso (libsql)	v0.5.3 (from v0.4.4)
iceberg-rust	v0.9
Vortex	Map type support, stack-safe IN-lists
arrow-rs	Arrow v57.2.0
datafusion-federation	Updated for DataFusion v52.4.0 alignment
datafusion-table-providers	Updated for DataFusion v52.4.0 alignment
datafusion-ballista	Bumped to fix BatchCoalescer schema mismatch panic

Other Improvements

Cayenne released as RC: Cayenne data accelerator is now promoted to release candidate status.
File Update Acceleration Mode: Added mode: file_update acceleration mode for file-based data refresh.
spice completions Command: New CLI command for generating shell completion scripts, with auto-detection of shell directory.
--endpoint Flag: Added --endpoint flag to spice run with scheme-based routing for custom endpoints.
mTLS Client Auth: Added mTLS client authentication support to the spice sql REPL.
DynamoDB DML: Implemented DML (INSERT, UPDATE, DELETE) support for the DynamoDB table provider.
Caching Retention: Added retention policies for cached query results.
GraphQL Custom Auth Headers: Added custom authorization header support for the GraphQL connector.
ClickHouse Date32 Support: Added Date32 type support for the ClickHouse connector.
AWS IAM Role Source: Added iam_role_source parameter for fine-grained AWS credential configuration.
S3 Metadata Columns: Metadata columns renamed to _location, _last_modified, _size for consistency, with more robust handling in projected queries.
S3 URL Style: Added s3_url_style parameter for S3 connector URL addressing (path-style vs virtual-hosted). Useful for S3-compatible stores like MinIO:
```
params:
  s3_endpoint: https://minio.local:9000
  s3_url_style: path
```
S3 Parquet Performance: Improved S3 parquet read performance.
HTTP Caching: Transient HTTP error responses such as 429 and 5xx are no longer cached, preventing stale error payloads from being served from cache.
HTTP Connector Metadata: Added response_headers as structured map data for HTTP datasets.

Views on_zero_results: Accelerated views now support on_zero_results: use_source to fall back to the source when no results are found:

views:
  - name: sales_summary
    sql: |
      SELECT region, SUM(amount) as total
      FROM sales
      GROUP BY region
    acceleration:
      enabled: true
      on_zero_results: use_source

Flight DoPut Ingestion Metrics: Added rows_written and bytes_written metrics for Flight DoPut / ADBC ETL ingestion.
EXPLAIN ANALYZE Metrics: Added metrics for EXPLAIN ANALYZE in FlightSQLExec.
Scheduler Executor Metrics: Added scheduler_active_executors_count metric for monitoring active executors.
Query Memory Limit: Updated default query memory limit from 70% to 90%, with GreedyMemoryPool for improved memory management.
MetastoreTransaction Support: Added transaction support to prevent concurrent metastore transaction conflicts.
Iceberg REST Catalog: Coerce unsupported Arrow types to Iceberg v2 equivalents in the REST catalog API.
CDC Cache Invalidation: Improved cache invalidation for CDC-backed datasets.
Spice.ai Connector Alignment: Parameter names aligned across catalog and data connectors for Spice.ai Cloud.
Cayenne File Size: Cayenne now correctly respects the configured target file size (defaults to 128MB).
Cayenne Primary Keys: Properly set primary_keys/on_conflict for Cayenne tables.
Turso Metastore Performance: Cached metastore connections and prepared statements for improved Turso and SQLite metastore performance.
Turso SQL Robustness: More robust SQL unparsing and date comparison handling for Turso.
Dictionary Type Normalization: Normalize Arrow Dictionary types for DuckDB and SQLite acceleration.
GitHub Connector Resilience: Improved GraphQL client resilience, performance, and ref filter handling.
ODBC Fix: Fixed ODBC queries silently returning 0 rows on query failure.
Anthropic Fixes: Fixed compatibility issues with Anthropic model provider.
v1/responses API Fix: The /v1/responses API now correctly preserves client instructions when system_prompt is set.
Shared Acceleration Snapshots: Show an error when snapshots are enabled on a shared acceleration file.
Distributed Mode Error Handling: Improved error handling for distributed mode and state_location configuration.
Helm Chart: Added support for ServiceAccount annotations and AWS IRSA example.
Perplexity Removed: Removed Perplexity model provider support.
Rust v1.93.1: Upgraded Rust toolchain to v1.93.1.

Contributors

Breaking Changes

S3 metadata columns renamed: S3 metadata columns renamed from location, last_modified, size to _location, _last_modified, _size.
v1/evals API removed: The /v1/evals endpoint has been removed.
Perplexity removed: Perplexity model provider support has been removed.
Default query memory limit changed: Default query memory limit increased from 70% to 90%.

Upgrading

To upgrade to v2.0.0-rc.2, use one of the following methods:

CLI:

spice upgrade v2.0.0-rc.2

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:2.0.0-rc.2 image:

docker pull spiceai/spiceai:2.0.0-rc.2

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai --version 2.0.0-rc.2

AWS Marketplace:

Spice is available in the AWS Marketplace.

What's Changed

Changelog

ci: fix E2E CLI upgrade test to use latest release for spiced download by @phillipleblanc in #9613
fix(DF): Lazily initialize BatchCoalescer in RepartitionExec to avoid schema type mismatch by @sgrebnov in #9623
feat: Implement catalog connectors for various databases by @lukekim in #9509
Refactor and clean up code across multiple crates by @lukekim in #9620
fix: Improve error handling for distributed mode and state_location configuration by @lukekim in #9611
Properly install postgres in install-postgres action by @krinart in #9629
fix: Use Python venv for schema validation in CI by @phillipleblanc in #9637
Update spicepod.schema.json by @app/github-actions in #9640
Update testoperator dispatch to use release/2.0 branch by @phillipleblanc in #9641
fix: Align CUDA asset names in Dockerfile and install tests with build output by @phillipleblanc in #9639
Fix expect test scripts in E2E Installation AI test by @sgrebnov in #9643
testoperator for partitioned arrow accelerator by @Jeadie in #9635
Remove default 1s refresh_check_interval from spidapter for hive datasets by @phillipleblanc in #9645
Fix scheduler panic and cancel race condition by @phillipleblanc in #9644
Align Spice.ai connector parameter names across catalog/data connectors by @lukekim in #9632
docs: update distribution details and add NAS support in release notes by @lukekim in #9650
Enable postgres-accel in CI builds for benchmarks by @sgrebnov in #9649
perf: Cache Turso metastore connection across operations by @penberg in #9646
Add 'scheduler_state_location' to spidapter by @Jeadie in #9655
Implement Cayenne S3 Express multi-zone live test with data validation by @lukekim in #9631
chore(spidapter): bump default memory limit from 8Gi to 32Gi by @phillipleblanc in #9661
perf: Use prepare_cached() in Turso and SQLite metastore backends by @penberg in #9662
Improve CDC cache invalidation by @krinart in #9651
Refactor Cayenne IDs to use UUIDv7 strings by @lukekim in #9667
fix: add liveness check for dead executors in partition routing by @Jeadie in #9657
fix(s3): Fix metadata column schema mismatches in projected queries by @sgrebnov in #9664
s3_metadata_columns tests: include test for location outside table prefix by @sgrebnov in #9676
docs: Update DuckDB, GCS, Git connector and Cayenne documentation by @lukekim in #9671
Add s3_url_style support for S3 connector URL addressing by @phillipleblanc in #9642
Consolidate E2E workflows and require WSL for Windows runtime by @lukekim in #9660
Upgrade to Rust v1.93.1 by @lukekim in #9669
Security fixes and improvements by @lukekim in #9666
feat(flight): add DoPut rows/bytes written metrics for DoPut ETL ingestion tracking by @phillipleblanc in #9663
Skip caching http error response + add response_headers by @krinart in #9670
refactor: Remove v1/evals functionality by @Jeadie in #9420
Make a test harness for Distributed Spice integration tests by @Jeadie in #9615
Enable on_zero_results: use_source for views by @krinart in #9699
fix(spidapter): Lower memory limit, passthrough AWS secrets, override flight URL by @peasee in #9704
Show an error on a shared acceleration file with snapshots enabled by @krinart in #9698
Fixes for anthropic by @Jeadie in #9707
Use max_partitions_per_executor in allocate_initial_partitions by @Jeadie in #9659
[SpiceDQ] Accelerations must have partition key by @Jeadie in #9711
Upgrade to Turso v0.5 by @lukekim in #9628
feat: Rename metadata columns to _location, _last_modified, _size by @phillipleblanc in #9712
fix: bump datafusion-ballista to fix BatchCoalescer schema mismatch panic by @phillipleblanc in #9716
fix: Ensure Cayenne respects target file size by @peasee in #9730
refactor: Make DDL preprocessing generic from Iceberg DDL processing by @peasee in #9731
[SpiceDQ] Distribute query of Cayenne Catalog to executors with data by @Jeadie in #9727
Properly set primary_keys/on_conflict for Cayenne tables by @krinart in #9739
Add executor resource and replica support to cloud app config by @ewgenius in #9734
feat: Support PARTITION BY in Cayenne Catalog table creation by @peasee in #9741
Update datafusion and related packages to version 52.3.0 by @lukekim in #9708
Route FlightSQL statement updates through QueryBuilder by @phillipleblanc in #9754
JSON file format improvements by @lukekim in #9743
[SpiceDQ] Partition Cayenne catalogs writes through to executors by @Jeadie in #9737
Update to DF v52.3.0 versions of datafusion & datafusion-tableproviders by @lukekim in #9756
Make S3 metadata column handling more robust by @sgrebnov in #9762
Fetch API keys from dedicated endpoint instead of apps response by @phillipleblanc in #9767
Update arrow-rs, datafusion-federation, and datafusion-table-providers dependencies by @phillipleblanc in #9769
Chunk metastore batch inserts to respect SQLite parameter limits by @phillipleblanc in #9770
Improve JSON SODA support by @lukekim in #9795
Add ADBC Data Connector by @lukekim in #9723
docs: Release Cayenne as RC by @peasee in #9766
cli[feat]: cloud mode to use region-specific endpoints by @lukekim in #9803
Include updated JSON formats in HTTPS connector by @lukekim in #9800
Flight DoPut: Partition-aware write-through forwarding by @Jeadie in #9759
Pass through authentication to ADBC connector by @lukekim in #9801
Move scheduler_state_location from adapter metadata to env var by @phillipleblanc in #9802
Fix Cayenne DoPut upsert returning stale data after 3+ writes by @phillipleblanc in #9806
Fix JSON column projection producing schema mismatch by @sgrebnov in #9811
Fix http connector by @krinart in #9818
Fix ADBC Connector build and test by @lukekim in #9813
Support update & delete DML for distributed cayenne catalog by @Jeadie in #9805
Set allow_http param when S3 endpoint uses http scheme by @phillipleblanc in #9834
fix: Cayenne Catalog DDL requires a connected executor in distributed mode by @Jeadie in #9838
fix: Add conditional put support for file:// scheduler state location by @Jeadie in #9842
fix: Require the DDL primary key contain the partition key by @Jeadie in #9844
fix: Databricks SQL Warehouse schema retrieval with INLINE disposition and async retry by @lukekim in #9846
Filter pushdown improvements for SqlTable by @lukekim in #9852
feat: add iam_role_source parameter for AWS credential configuration by @lukekim in #9854
Fix ODBC queries silently returning 0 rows on query failure by @lukekim in #9864
feat(adbc): Add ADBC catalog connector with schema/table discovery by @lukekim in #9865
Make Turso SQL unparsing more robust and fix date comparisons by @lukekim in #9871
Fix Flight/FlightSQL filter precedence and mutable query consistency by @lukekim in #9876
Partial Aggregation optimisation for FlightSQLExec by @lukekim in #9882
fix: v1/responses API preserves client instructions when system_prompt is set by @Jeadie in #9884
feat: emit scheduler_active_executors_count and use it in spidapter by @Jeadie in #9885
feat: Add custom auth header support for GraphQL connector by @krinart in #9899
Add --endpoint flag to spice run with scheme-based routing by @lukekim in #9903
When executor connects, send DDL for existing tables by @Jeadie in #9904
fix: Improve ADBC driver shutdown handling and error classification by @lukekim in #9905
fix: require all executors to succeed for distributed DML (DELETE/UPDATE) forwarding by @Jeadie in #9908
fix(cayenne catalog): fix catalog refresh race condition causing duplicate primary keys by @Jeadie in #9909
Remove Perplexity support by @Jeadie in #9910
Fix refresh_sql support for debezium constraints by @krinart in #9912
Implement DML for DynamoDBTableProvider by @lukekim in #9915
chore: Update iceberg-rust fork to v0.9 by @lukekim in #9917
Run physical optimizer on FallbackOnZeroResultsScanExec fallback plan by @sgrebnov in #9927
Improve Databricks error message when dataset has no columns by @sgrebnov in #9928
Delta Lake: fix data skipping for >= timestamp predicates by @sgrebnov in #9932
fix: Ensure distributed Cayenne DML inserts are forwarded to executors by @Jeadie in #9948
Add full query federation support for ADBC data connector by @lukekim in #9953
Make time_format deserialization case-insensitive by @vyershov in #9955
Hash ADBC join-pushdown context to prevent credential leaks in EXPLAIN plans by @lukekim in #9956
fix: Normalize Arrow Dictionary types for DuckDB and SQLite acceleration by @sgrebnov in #9959
ADBC BigQuery: Improve BigQuery dialect date/time and interval SQL generation by @lukekim in #9967
Make BigQueryDialect more robust and add BigQuery TPC-H benchmark support by @lukekim in #9969
fix: Show proper unauthorized error instead of misleading runtime unavailable by @lukekim in #9972
fix: Enforce target_chunk_size as hard maximum in chunking by @lukekim in #9973
Add caching retention by @krinart in #9984
fix: improve Databricks schema error detection and messages by @lukekim in #9987
fix: Set default S3 region for opendal operator and fix cayenne nextest by @phillipleblanc in #9995
fix(PostgreSQL): fix schema discovery for PostgreSQL partitioned tables by @sgrebnov in #9997
fix: Defer cache size check until after encoding for compressed results by @krinart in #10001
fix: Rewrite numeric BETWEEN to CAST(AS REAL) for Turso by @lukekim in #10003
fix: Handle integer time columns in append refresh for all accelerators by @sgrebnov in #10004
fix: preserve s3a:// scheme when building OpenDalStorageFactory with custom endpoint by @phillipleblanc in #10006
Fix ISO8601 time_format with Vortex/Cayenne append refresh by @sgrebnov in #10009
fix: Address data correctness bugs found in audit by @sgrebnov in #10015
fix(federation): fix SQL unparsing for Inexact filter pushdown with alias by @lukekim in #10017
Improve GitHub connector ref handling and resilience by @lukekim in #10023
feat: Add spice completions command for shell completion generation by @lukekim in #10024
fix: Fix data correctness bugs in DynamoDB decimal conversion and GraphQL pagination by @sgrebnov in #10054
Implement RefreshDataset for distributed control stream by @Jeadie in #10055
perf: Improve S3 parquet read performance by @sgrebnov in #10064
fix: Prevent write-through stalls and preserve PartitionTableProvider during catalog refresh by @Jeadie in #10066
feat: spice completions auto-detects shell directory and writes file by @lukekim in #10068
fix: Bug in DynamoDB, GraphQL, and ISO8601 refresh data handling by @sgrebnov in #10063
fix partial aggregation deduplication on string checking by @lukekim in #10078
fix: add MetastoreTransaction support to prevent concurrent transaction conflicts by @phillipleblanc in #10080
fix: Use GreedyMemoryPool, add spidapter query memory limit arg by @phillipleblanc in #10082
feat: Add metrics for EXPLAIN ANALYZE in FlightSQLExec by @lukekim in #10084
Use strict cast in try_cast_to to error on overflow instead of silent NULL by @sgrebnov in #10104
feat: Implement MERGE INTO for Cayenne catalog tables by @peasee in #10105
feat: Add distributed MERGE INTO support for Cayenne catalog tables by @peasee in #10106
Improve JSON format auto-detection for single multi-line objects by @lukekim in #10107
Add mode: file_update acceleration mode by @krinart in #10108
Coerce unsupported Arrow types to Iceberg v2 equivalents in REST catalog API by @peasee in #10109
fix: Update default query memory limit to 90% from 70% by @phillipleblanc in #10112
feat: Add mTLS client auth support to spice sql REPL by @lukekim in #10113
fix(datafusion-federation): report error on overflow instead of silent NULL by @sgrebnov in #10124
fix: Prevent data loss in MERGE when source has duplicate keys by @peasee in #10126
feat: Add ClickHouse Date32 type support by @sgrebnov in #10132
Add Delta Lake column mapping support (Name/Id modes) by @sgrebnov in #10134
fix: Restore Turso numeric BETWEEN rewrite lost in DML revert by @lukekim in #10139
fix: Enable arm64 Linux builds with fp16 and lld workarounds by @lukekim in #10142
fix: remove double trailing slash in Unity Catalog storage locations by @sgrebnov in #10147
fix: Improve GitHub GraphQL client resilience and performance by @lukekim in #10151
Enable reqwest compression and optimize HTTP client settings by @lukekim in #10154
fix: executor startup failures by @Jeadie in #10155
feat: Distributed runtime.task_history support by @Jeadie in #10156
fix: Preserve timestamp timezone in DDL forwarding to executors by @peasee in #10159
feat: Per-model rate-limited concurrent AI UDF execution by @Jeadie in #10160
fix(Turso): Reject subquery/outer-ref filter pushdown in Turso provider by @lukekim in #10174
Fix linux/macos spice upgrade by @phillipleblanc in #10194
Improve CREATE TABLE LIKE error messages, success output, EXPLAIN, and validation by @peasee in #10203
fix: chunk MERGE delete filters and update Vortex for stack-safe IN-lists by @peasee in #10207
Propagate runtime.params.parquet_page_index to Delta Lake connector by @sgrebnov in #10209
Properly mark dataset as Ready on Scheduler by @Jeadie in #10215
fix: handle Utf8View/LargeUtf8 in GitHub connector ref filters by @lukekim in #10217
fix(databricks): Fix schema introspection and timestamp overflow by @lukekim in #10226
fix(databricks): Fix schema introspection failures for non-Unity-Catalog environments by @lukekim in #10227
feat: Add pagination support to HTTP data connector by @lukekim in #10228
feat(databricks): DESCRIBE TABLE fallback and source-native type parsing for Lakehouse Federation by @lukekim in #10229
fix(databricks): harden HTTP retries, compression, and token refresh by @lukekim in #10232
feat[helm chart]: Add support for ServiceAccount annotations and AWS IRSA example by @peasee in #9833
fix: Log warning and fall back gracefully on Cayenne config change by @krinart in #9092
fix: Handle engine mismatch gracefully in snapshot fallback loop by @krinart in #9187

Full Changelog: https://github.com/spiceai/spiceai/compare/v2.0.0-rc.1...v2.0.0-rc.2

Spice v1.11.0-rc.1 (Jan 6, 2026)

January 7, 2026 · 17 min read

Evgenii Khramkov

Senior Software Engineer at Spice AI

Announcing the release of Spice v1.11.0-rc.1! ⭐

v1.11.0-rc.1 is the first release candidate for early testing of v1.11 features including Distributed Query with mTLS for enterprise-grade secure cluster communication, new SMB and NFS Data Connectors for direct network-attached storage access, Prepared Statements for improved query performance and security, Cayenne Accelerator Enhancements with Key-based deletion vectors and Amazon S3 Express One Zone support, Google LLM Support for expanded AI inference capabilities, and Spice Java SDK v0.5.0 with parameterized query support.

What's New in v1.11.0-rc.1

Distributed Query with mTLS

Enterprise-Grade Secure Cluster Communication: Distributed query cluster mode now enables mutual TLS (mTLS) by default for secure communication between schedulers and executors. Internal cluster communication includes highly privileged RPC calls like fetching Spicepod configuration and expanding secrets. mTLS ensures only authenticated nodes can join the cluster and access sensitive data.

Key Features:

Mutual TLS Authentication: All executor-to-scheduler and executor-to-executor gRPC connections on the internal cluster port (50052) are secured with mTLS, securing communication, and preventing unauthorized nodes from joining the cluster
Certificate Management CLI: New developer spice cluster tls init and spice cluster tls add commands for generating CA certificates and node certificates with proper SANs (Subject Alternative Names)
Simplified CLI Arguments: Renamed cluster arguments for clarity (--role, --scheduler-address, --node-mtls-*) with --scheduler-address implying --role executor
Port Separation: Public services (Flight queries, HTTP API, Prometheus metrics) remain on ports 50051, 8090, and 9090 respectively, while internal cluster services (SchedulerGrpcServer, ClusterService) are isolated on port 50052 with mTLS enforced
Development Mode: Use --allow-insecure-connections flag to disable mTLS requirement for local development and testing

Quick Start:

# Generate certificates for development
spice cluster tls init
spice cluster tls add scheduler1
spice cluster tls add executor1

# Start scheduler
spiced --role scheduler \
  --node-mtls-ca-certificate-file ca.crt \
  --node-mtls-certificate-file scheduler1.crt \
  --node-mtls-key-file scheduler1.key

# Start executor
spiced --role executor \
  --scheduler-address https://scheduler1:50052 \
  --node-mtls-ca-certificate-file ca.crt \
  --node-mtls-certificate-file executor1.crt \
  --node-mtls-key-file executor1.key

For more details, refer to the Distributed Query Documentation.

SMB and NFS Data Connectors

Network-Attached Storage Connectors: New data connectors for SMB (Server Message Block) and NFS (Network File System) protocols enable direct federated queries against network-attached storage without requiring data movement to cloud object stores.

Key Features:

SMB Protocol Support: Connect to Windows file shares and Samba servers with authentication support
NFS Protocol Support: Connect to Unix/Linux NFS exports for direct data access
Federated Queries: Query Parquet, CSV, JSON, and other file formats directly from network storage with full SQL support
Acceleration Support: Accelerate data from SMB/NFS sources using DuckDB, Spice Cayenne, or other accelerators

Example spicepod.yaml configuration:

datasets:
  # SMB share
  - from: smb://fileserver/share/data.parquet
    name: smb_data
    params:
      smb_username: ${secrets:SMB_USER}
      smb_password: ${secrets:SMB_PASS}

  # NFS export
  - from: nfs://nfsserver/export/data.parquet
    name: nfs_data

For more details, refer to the Data Connectors Documentation.

Prepared Statements

Improved Query Performance and Security: Spice now supports prepared statements, enabling parameterized queries that improve both performance through query plan caching and security by preventing SQL injection attacks.

Key Features:

Query Plan Caching: Prepared statements cache query plans, reducing planning overhead for repeated queries
SQL Injection Prevention: Parameters are safely bound, preventing SQL injection vulnerabilities
Arrow Flight SQL Support: Full prepared statement support via Arrow Flight SQL protocol

SDK Support:

SDK	Support	Min Version	Method
gospice (Go)	✅ Full	v8.0.0+	`SqlWithParams()` with typed constructors (`Int32Param`, `StringParam`, `TimestampParam`, etc.)
spice-rs (Rust)	✅ Full	v3.0.0+	`query_with_params()` with `RecordBatch` parameters
spice-dotnet (.NET)	❌ Not yet	-	Coming soon
spice-java (Java)	✅ Full	v0.5.0+	`queryWithParams()` with typed `Param` constructors (`Param.int64()`, `Param.string()`, etc.)
spice.js (JavaScript)	❌ Not yet	-	Coming soon
spicepy (Python)	❌ Not yet	-	Coming soon

Example (Go):

import "github.com/spiceai/gospice/v8"

client, _ := spice.NewClient()
defer client.Close()

// Parameterized query with typed parameters
results, _ := client.SqlWithParams(ctx,
    "SELECT * FROM products WHERE price > $1 AND category = $2",
    spice.Float64Param(10.0),
    spice.StringParam("electronics"),
)

Example (Java):

import ai.spice.SpiceClient;
import ai.spice.Param;
import org.apache.arrow.adbc.core.ArrowReader;

try (SpiceClient client = new SpiceClient()) {
    // With automatic type inference
    ArrowReader reader = client.queryWithParams(
        "SELECT * FROM products WHERE price > $1 AND category = $2",
        10.0, "electronics");

    // With explicit typed parameters
    ArrowReader reader = client.queryWithParams(
        "SELECT * FROM products WHERE price > $1 AND category = $2",
        Param.float64(10.0),
        Param.string("electronics"));
}

For more details, refer to the Parameterized Queries Documentation.

Spice Cayenne Accelerator Enhancements

The Spice Cayenne data accelerator has been improved with several key enhancements:

KeyBased Deletion Vectors: Improved deletion vector support using key-based lookups for more efficient data management and faster delete operations. KeyBased deletion vectors are more memory-efficient than positional vectors for sparse deletions.
S3 Express One Zone Support: Store Cayenne data files in S3 Express One Zone for single-digit millisecond latency, ideal for latency-sensitive query workloads that require persistence.

Example spicepod.yaml configuration:

datasets:
  - from: s3://my-bucket/data.parquet
    name: fast_data
    acceleration:
      enabled: true
      engine: cayenne
      mode: file
      params:
        # Use S3 Express One Zone for data files
        cayenne_s3express_bucket: my-express-bucket--usw2-az1--x-s3

For more details, refer to the Cayenne Documentation.

Google LLM Support

Expanded AI Provider Support: Spice now supports Google embedding and chat models via the Google AI provider, expanding the available LLM options for AI inference workloads alongside existing providers like OpenAI, Anthropic, and AWS Bedrock.

Key Features:

Google Chat Models: Access Google's Gemini models for chat completions
Google Embeddings: Generate embeddings using Google's text embedding models
Unified API: Use the same OpenAI-compatible API endpoints for all LLM providers

Example spicepod.yaml configuration:

models:
  - from: google:gemini-2.0-flash
    name: gemini
    params:
      google_api_key: ${secrets:GOOGLE_API_KEY}

embeddings:
  - from: google:text-embedding-004
    name: google_embeddings
    params:
      google_api_key: ${secrets:GOOGLE_API_KEY}

For more details, refer to the Google LLM Documentation (see docs PR #1286).

Spice Java SDK v0.5.0

Parameterized Query Support for Java: The Spice Java SDK v0.5.0 introduces parameterized queries using ADBC (Arrow Database Connectivity), providing a safer and more efficient way to execute queries with dynamic parameters.

Key Features:

SQL Injection Prevention: Parameters are safely bound, preventing SQL injection vulnerabilities
Automatic Type Inference: Java types are automatically mapped to Arrow types (e.g., double → Float64, String → Utf8)
Explicit Type Control: Use the new Param class with typed factory methods (Param.int64(), Param.string(), Param.decimal128(), etc.) for precise control over Arrow types
Updated Dependencies: Apache Arrow Flight SQL upgraded to 18.3.0, plus new ADBC driver support

Example:

import ai.spice.SpiceClient;
import ai.spice.Param;

try (SpiceClient client = new SpiceClient()) {
    // With automatic type inference
    ArrowReader reader = client.queryWithParams(
        "SELECT * FROM taxi_trips WHERE trip_distance > $1 LIMIT 10",
        5.0);

    // With explicit typed parameters for precise control
    ArrowReader reader = client.queryWithParams(
        "SELECT * FROM orders WHERE order_id = $1 AND amount >= $2",
        Param.int64(12345),
        Param.decimal128(new BigDecimal("99.99"), 10, 2));
}

Maven:

<dependency>
  <groupId>ai.spice</groupId>
  <artifactId>spiceai</artifactId>
  <version>0.5.0</version>
</dependency>

For more details, refer to the Spice Java SDK Repository.

OpenTelemetry Improvements

Unified Telemetry Endpoint: OTel metrics ingestion has been consolidated to the Flight port (50051), simplifying deployment by removing the separate OTel port (50052). The push-based metrics exporter continues to support integration with OpenTelemetry collectors.

Note: This is a breaking change. Update your configurations if you were using the dedicated OTel port 50052. Internal cluster communication now uses port 50052 exclusively.

Developer Experience Improvements

Turso v0.3.2 Upgrade: Upgraded Turso accelerator for improved performance and reliability
Rust 1.91 Upgrade: Updated to Rust 1.91 for latest language features and performance improvements
Spice Cloud CLI: Added spice cloud CLI commands for cloud deployment management
Improved Spicepod Schema: Enhanced JSON schema generation for better IDE support and validation
Acceleration Snapshots: Added configurable snapshots_create_interval for periodic acceleration snapshots independent of refresh cycles
Tiered Caching with Localpod: The Localpod connector now supports caching refresh mode, enabling multi-layer acceleration where a persistent cache feeds a fast in-memory cache
GitHub Data Connector: Added workflows and workflow runs support for GitHub repositories
NDJSON/LDJSON Support: Added support for Newline Delimited JSON and Line Delimited JSON file formats

Additional Improvements & Bug Fixes

Reliability: Fixed DynamoDB IAM role authentication with new dynamodb_auth: iam_role parameter
Reliability: Fixed cluster executors to use scheduler's temp_directory parameter for shuffle files
Reliability: Initialize secrets before object stores in cluster executor mode
Reliability: Added page-level retry with backoff for transient GitHub GraphQL errors
Performance: Improved statistics for rewritten DistributeFileScanOptimizer plans
Developer Experience: Added max_message_size configuration for Flight service

Contributors

Breaking Changes

OTel Ingestion Port Change

OTel ingestion has been moved to the Flight port (50051), removing the separate OTel port 50052. Port 50052 is now used exclusively for internal cluster communication. Update your configurations if you were using the dedicated OTel port.

Distributed Query Cluster Mode Requires mTLS

Distributed query cluster mode now requires mTLS for secure communication between cluster nodes. This is a security enhancement to prevent unauthorized nodes from joining the cluster and accessing secrets.

Migration Steps:

Generate certificates using spice cluster tls init and spice cluster tls add
Update scheduler and executor startup commands with --node-mtls-* arguments
For development/testing, use --allow-insecure-connections to opt out of mTLS

Renamed CLI Arguments:

Old Name	New Name
`--cluster-mode`	`--role`
`--cluster-ca-certificate-file`	`--node-mtls-ca-certificate-file`
`--cluster-certificate-file`	`--node-mtls-certificate-file`
`--cluster-key-file`	`--node-mtls-key-file`
`--cluster-address`	`--node-bind-address`
`--cluster-advertise-address`	`--node-advertise-address`
`--cluster-scheduler-url`	`--scheduler-address`

Removed CLI Arguments:

--cluster-api-key: Replaced by mTLS authentication

Cookbook Updates

No major cookbook updates.

The Spice Cookbook includes 84 recipes to help you get started with Spice quickly and easily.

Upgrading

To try v1.11.0-rc.1, use one of the following methods:

CLI:

spice upgrade --version 1.11.0-rc.1

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:1.11.0-rc.1 image:

docker pull spiceai/spiceai:1.11.0-rc.1

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai --version 1.11.0-rc.1

AWS Marketplace:

🎉 Spice is available in the AWS Marketplace!

What's Changed

Changelog

OTel exporter for push metrics by @lukekim in #8442
fix: Update benchmark snapshots by @app/github-actions in #8448
Add TPCH append tests to scheduled dispatch workflow by @sgrebnov in #8451
Add snapshot creation logging by @krinart in #8469
Fix PeriodicReader panic by @krinart in #8471
Benchmarks: increase readiness timeout for turso acceleration (TPC-H) by @sgrebnov in #8470
fix: Pin CUDA build actions to commits by @peasee in #8477
Add Criterion benchmarking to chunking crate. by @Jeadie in #8431
DuckDB agg pushdown: gate behind accelerator parameter by @mach-kernel in #8474
Rename aggregate_pushdown_optimization -> optimizer_duckdb_aggregate_pushdown by @ewgenius in #8485
Handle throttling exception for DynamoDB streams by @phillipleblanc in #8492
docs: Add release notes by @peasee in #8478
Update spicepod.schema.json by @app/github-actions in #8496
Move 'test_projection_pushdown' to runtime-datafusion by @Jeadie in #8490
Fix OTEL metrics HTTP exporter client setup by @phillipleblanc in #8489
Update endgame to include new caching accelerator cookbook by @phillipleblanc in #8487
DynamoDB tests and fixes by @lukekim in #8491
Align make lint-rust-fix with make lint-rust by @Jeadie in #8499
fix: Remove unused Cayenne parameters by @peasee in #8500
Force task history captured_plan outputs to be captured even if they would be filtered out otherwise by @phillipleblanc in #8501
release: post-release updates by @peasee in #8503
CI: Fix E2E models dispatch by @mach-kernel in #8505
Use an isolated Tokio runtime for refresh tasks that is separate from the main query API by @phillipleblanc in #8504
Update openapi.json by @app/github-actions in #8512
Update dependencies by @phillipleblanc in #8513
fix: Avoid double hashing cache key by @peasee in #8511
fix: Eagerly drop cached records for results larger than max by @peasee in #8516
Revert "fix: Move enforce-pulls to hosted runner (#8686)" by @phillipleblanc in #8709
Initial 'testoperator run text-to-sql' by @Jeadie in #8618
Add support for abfss by @krinart in #8706
Add testoperator TPCH dispatch for ABFS with hierarchical namespace disabled + versioning enabled by @phillipleblanc in #8711
Update openapi.json by @app/github-actions in #8692
cluster: validate --role argument by @phillipleblanc in #8717
Upgrade to Turso v0.3.2 by @lukekim in #8716
Rename --insecure to --allow-insecure-connections to be consistent with existing naming by @lukekim in #8720
Remove 'testoperator run http-consistency/http-overhead' by @Jeadie in #8708
refactor: Remove cluster feature flag by @phillipleblanc in #8718
Docs: Distributed query ADR by @mach-kernel in #8608
Use model.datasets to allowlist on tools by @Jeadie in #8714
cluster: quality of life improvements to starting cluster mode locally by @phillipleblanc in #8719
Docs: Ballista extension ADR by @mach-kernel in #8616
Improve deprecation messages when going from prefixed -> non-prefixed. by @Jeadie in #8724
Remove tools from auto-defaults by @Jeadie in #8725
Make distinct providers for vector spilling, vector partitioning. by @Jeadie in #8546
cluster: default scheduler address port by @phillipleblanc in #8728
Add Makefile targets for testoperator by @Jeadie in #8729
text-to-sql dispatch in testoperator by @Jeadie in #8705
DR-006: High Availability Distributed Query with Stateless Schedulers by @lukekim in #8721
DR-007: mTLS for Distributed Query Cluster Communication by @lukekim in #8722
SMB and NFS improvements by @lukekim in #8710
fix: Cluster executors use scheduler's temp_directory for shuffle files by @phillipleblanc in #8733
use 'max_message_size' in flight service too by @Jeadie in #8730
Add page-level retry for transient GraphQL errors with backoff and increase GitHub rate limit buffer up to 100 by @ewgenius in #8726
Make testoperator Dockerfile; CI to build docker image to ghcr.io. by @Jeadie in #8732
cluster: UnionProjectionPushdownOptimizer: Add projection pushdown diagnostics for union children by @phillipleblanc in #8734
Fix column projection order mismatch with location metadata columns by @phillipleblanc in #8738
Fixes for testoperator. by @Jeadie in #8737
Improve Cayenne Deletion Vectors with KeyBased support by @lukekim in #8713
Fix testoperator_dispatch.yaml by @Jeadie in #8740
Add spice cloud CLI commands by @lukekim in #8528
Add FTP, NFS, & SMB TPCH SF1 spicepods by @lukekim in #8739
Prepared Statements by @lukekim in #7588
Schedule dispatch of testoperator run text-to-sql. by @Jeadie in #8745
Fix minio for ai benchmark CI by @Jeadie in #8743
Upgrade to Rust 1.91 by @phillipleblanc in #8749
fix: Update benchmark snapshots by @app/github-actions in #8763
Benchmarks: make row count validation skip logic configurable by scale factor, query set, and overrides by @sgrebnov in #8756
Make benchmark tests more robust by @sgrebnov in #8766
Add parameter to force using iam_role for DynamoDB by @krinart in #8767
fix: Update Search integration test snapshots by @app/github-actions in #8735
v1.10.4 release notes by @phillipleblanc in #8790
Trace metrics export errors by @sgrebnov in #8791
fix: correctly identify deprecated openai_* parameters by @phillipleblanc in #8809
Don't CAST strings which breaks push down optimizer by @lukekim in #8810
Add timezone database to Docker image to fix Cayenne acceleration panic by @sgrebnov in #8799
Update async-openai to latest revision 4dcd633aad6f - brings fix for openai compatible model providers by @ewgenius in #8816
Add auth/iam_role_source to DynamoDB connector by @krinart in #8808
DynamoDB fixes: JSON nesting for Streams, proper batch deletions by @krinart in #8821

Spice v1.5.0 (July 21, 2025)

July 22, 2025 · 14 min read

Evgenii Khramkov

Senior Software Engineer at Spice AI

Announcing the release of Spice v1.5.0! 🔍

Spice v1.5.0 brings major upgrades to search and retrieval. It introduces native support for Amazon S3 Vectors, enabling petabyte scale vector search directly from S3 vector buckets, alongside SQL-integrated vector and tantivy-powered full-text search, partitioning for DuckDB acceleration, and automated refreshes for search indexes and views. It includes the AWS Bedrock Embeddings Model Provider, the Oracle Database connector, and the now-stable Spice.ai Cloud Data Connector, and the upgrade to DuckDB v1.3.2.

What's New in v1.5.0

Amazon S3 Vectors Support: Spice.ai now integrates with Amazon S3 Vectors, launched in public preview on July 15, 2025, enabling vector-native object storage with built-in indexing and querying. This integration supports semantic search, recommendation systems, and retrieval-augmented generation (RAG) at petabyte scale with S3’s durability and elasticity. Spice.ai manages the vector lifecycle—ingesting data, creating embeddings with models like Amazon Titan or Cohere via AWS Bedrock, or others available on HuggingFace, and storing it in S3 Vector buckets.

Spice integration with Amazon S3 Vectors

Example Spicepod.yml configuration for S3 Vectors:

datasets:
  - from: s3://my_data_bucket/data/
    name: my_vectors
    params:
      file_format: parquet
    acceleration:
      enabled: true
    vectors:
      engine: s3_vectors
      params:
        s3_vectors_aws_region: us-east-2
        s3_vectors_bucket: my-s3-vectors-bucket
    columns:
      - name: content
        embeddings:
          - from: bedrock_titan
            row_id:
              - id

Example SQL query using S3 Vectors:

SELECT *
FROM vector_search(my_vectors, 'Cricket bats', 10)
WHERE price < 100
ORDER BY score

For more details, refer to the S3 Vectors Documentation.

SQL-integrated Search: Vector and BM25-scored full-text search capabilities are now natively available in SQL queries, extending the power of the POST v1/search endpoint to all SQL workflows.

Example Vector-Similarity-Search (VSS) using the vector_search UDTF on the table reviews for the search term "Cricket bats":

SELECT review_id, review_text, review_date, score
FROM vector_search(reviews, "Cricket bats")
WHERE country_code="AUS"
LIMIT 3

Example Full-Text-Search (FTS) using the text_search UDTF on the table reviews for the search term "Cricket bats":

SELECT review_id, review_text, review_date, score
FROM text_search(reviews, "Cricket bats")
LIMIT 3

DuckDB v1.3.2 Upgrade: Upgraded DuckDB engine from v1.1.3 to v1.3.2. Key improvements include support for adding primary keys to existing tables, resolution of over-eager unique constraint checking for smoother inserts, and 13% reduced runtime on TPC-H SF100 queries through extensive optimizer refinements. The v1.2.x release of DuckDB was skipped due to a regression in indexes.

Read the DuckDB v1.2.0 announcement.
Read the DuckDB v1.3.0 announcement.

Partitioned Acceleration: DuckDB file-based accelerations now support partition_by expressions, enabling queries to scale to large datasets through automatic data partitioning and query predicate pruning. New UDFs, bucket and truncate, simplify partition logic.

New UDFs useful for partition_by expressions:

bucket(num_buckets, col): Partitions a column into a specified number of buckets based on a hash of the column value.
truncate(width, col): Truncates a column to a specified width, aligning values to the nearest lower multiple (e.g., truncate(10, 101) = 100).

Example Spicepod.yml configuration:

datasets:
  - from: s3://my_bucket/some_large_table/
    name: my_table
    params:
      file_format: parquet
    acceleration:
      enabled: true
      engine: duckdb
      mode: file
      partition_by: bucket(100, account_id) # Partition account_id into 100 buckets

Full-Text-Search (FTS) Index Refresh: Accelerated datasets with search indexes maintain up-to-date results with configurable refresh intervals.

Example refreshing search indexes on body every 10 seconds:

datasets:
  - from: github:github.com/spiceai/docs/pulls
    name: spiceai.doc.pulls
    params:
      github_token: ${secrets:GITHUB_TOKEN}
    acceleration:
      enabled: true
      refresh_mode: full
      refresh_check_interval: 10s
    columns:
      - name: body
        full_text_search:
          enabled: true
          row_id:
            - id

Scheduled View Refresh: Accelerated Views now support cron-based refresh schedules using refresh_cron, automating updates for accelerated data.

Example Spicepod.yml configuration:

views:
  - name: my_view
    sql: SELECT 1
    acceleration:
      enabled: true
      refresh_cron: '0 * * * *' # Every hour

For more details, refer to Scheduled Refreshes.

Multi-column Vector Search: For datasets configured with embeddings on more than one column, POST v1/search and similarity_search perform parallel vector search on each column, aggregating results using reciprocal rank fusion.

Example Spicepod.yml for multi-column search:

datasets:
  - from: github:github.com/apache/datafusion/issues
    name: datafusion.issues
    params:
      github_token: ${secrets:GITHUB_TOKEN}
    columns:
      - name: title
        embeddings:
          - from: hf_minilm
      - name: body
        embeddings:
          - from: openai_embeddings

AWS Bedrock Embeddings Model Provider: Added support for AWS Bedrock embedding models, including Amazon Titan Text Embeddings and Cohere Text Embeddings.

Example Spicepod.yml:

embeddings:
  - from: bedrock:cohere.embed-english-v3
    name: cohere-embeddings
    params:
      aws_region: us-east-1
      input_type: search_document
      truncate: END
  - from: bedrock:amazon.titan-embed-text-v2:0
    name: titan-embeddings
    params:
      aws_region: us-east-1
      dimensions: '256'

For more details, refer to the AWS Bedrock Embedding Models Documentation.

Oracle Data Connector: Use from: oracle: to access and accelerate data stored in Oracle databases, deployed on-premises or in the cloud.

Example Spicepod.yml:

datasets:
  - from: oracle:"SH"."PRODUCTS"
    name: products
    params:
      oracle_host: 127.0.0.1
      oracle_username: scott
      oracle_password: tiger

See the Oracle Data Connector documentation.

GitHub Data Connector: The GitHub data connector supports query and acceleration of members, the users of an organization.

Example Spicepod.yml configuration:

datasets:
  - from: github:github.com/spiceai/members # General format: github.com/[org-name]/members
    name: spiceai.members
    params:
      # With GitHub Apps (recommended)
      github_client_id: ${secrets:GITHUB_SPICEHQ_CLIENT_ID}
      github_private_key: ${secrets:GITHUB_SPICEHQ_PRIVATE_KEY}
      github_installation_id: ${secrets:GITHUB_SPICEHQ_INSTALLATION_ID}
      # With GitHub Tokens
      # github_token: ${secrets:GITHUB_TOKEN}

See the GitHub Data Connector Documentation

Spice.ai Cloud Data Connector: Graduated to Stable.

spice-rs SDK Release: The Spice Rust SDK has updated to v3.0.0. This release includes optimizations for the Spice client API, adds robust query retries, and custom metadata configurations for spice queries.

Contributors

Breaking Changes

Search HTTP API Response: POST v1/search response payload has changed. See the new API documentation for details.
Model Provider Parameter Prefixes: Model Provider parameters use provider-specific prefixes instead of openai_ prefixes (e.g., hf_temperature for HuggingFace, anthropic_max_completion_tokens for Anthropic, perplexity_tool_choice for Perplexity). The openai_ prefix remains supported for backward compatibility but is deprecated and will be removed in a future release.

Cookbook Updates

Added Oracle Data Connector cookbook: Connect to tables in Oracle databases.
Added Hashed Partitioning with DuckDB cookbook: Accelerate data on large datasets by partitioning data into a fixed number of buckets.

The Spice Cookbook now includes 72 recipes to help you get started with Spice quickly and easily.

Upgrading

To upgrade to v1.5.0, download and install the specific binary from github.com/spiceai/spiceai/releases/tag/v1.5.0 or pull the v1.5.0 Docker image (spiceai/spiceai:1.5.0).

What's Changed

Dependencies

delta_kernel: Upgraded to v0.12.1
DuckDB: Upgraded from v1.1.3 to v1.3.2
iceberg-rust: Upgraded from v0.4.0 to v0.5.1

Changelog

fix: openai model endpoint (#6394) by @Sevenannn in #6394
Enable configuring otel endpoint from spice run (#6360) by @Advayp in #6360
Enable Oracle connector in default build configuration (#6395) by @sgrebnov in #6395
fix llm integraion test (#6398) by @Sevenannn in #6398
Promote spice cloud connector to stable quality (#6221) by @Sevenannn in #6221
v1.5.0-rc.1 release notes (#6397) by @lukekim in #6397
Fix model nsql integration tests (#6365) by @Sevenannn in #6365
Fix incorrect UDTF name and SQL query (#6404) by @lukekim in #6404
Update v1.5.0-rc.1.md (#6407) by @sgrebnov in #6407
Improve error messages (#6405) by @lukekim in #6405
build(deps): bump Jimver/cuda-toolkit from 0.2.25 to 0.2.26 (#6388) by @app/dependabot in #6388
Upgrade dependabot dependencies (#6411) by @phillipleblanc in #6411
Fix projection pushdown issues for document based file connector (#6362) by @Advayp in #6362
Add a PartitionedDuckDB Accelerator (#6338) by @kczimm in #6338
Use vector_search() UDTF in HTTP APIs (#6417) by @Jeadie in #6417
add supported types (#6409) by @kczimm in #6409
Enable session time zone override for MySQL (#6426) by @sgrebnov in #6426
Acceleration-like indexing for full text search indexes. (#6382) by @Jeadie in #6382
Provide error message when partition by expression changes (#6415) by @kczimm in #6415
Add support for Oracle Autonomous Database connections (Oracle Cloud) (#6421) by @sgrebnov in #6421
prune partitions for exact and in list with and without UDFs (#6423) by @kczimm in #6423
Fixes and reenable FTS tests (#6431) by @Jeadie in #6431
Upgrade DuckDB to 1.3.2 (#6434) by @phillipleblanc in #6434
Fix issue in limit clause for the Github Data connector (#6443) by @Advayp in #6443
Upgrade iceberg-rust to 0.5.1 (#6446) by @phillipleblanc in #6446
v1.5.0-rc.2 release notes (#6440) by @lukekim in #6440
Oracle: add automated TPC-H SF1 benchmark tests (#6449) by @sgrebnov in #6449
fix: Update benchmark snapshots (#6455) by @app/github-actions in #6455
Preserve ArrowError in arrow_tools::record_batch (#6454) by @mach-kernel in #6454
fix: Update benchmark snapshots (#6465) by @app/github-actions in #6465
Add option to preinstall Oracle ODPI-C library in Docker image (#6466) by @sgrebnov in #6466
Include Oracle connector (federated mode) in automated benchmarks (#6467) by @sgrebnov in #6467
Update crates/llms/src/bedrock/embed/mod.rs by @lukekim in #6468
v1.5.0-rc.3 release notes (#6474) by @lukekim in #6474
Add integration tests for S3 Vectors filters pushdown (#6469) by @sgrebnov in #6469
check for indexedtableprovider when finding tables to search on (#6478) by @Jeadie in #6478
Parse fully qualified table names in UDTFs (#6461) by @Jeadie in #6461
Add integration test for S3 Vectors to cover data update (overwrite) (#6480) by @sgrebnov in #6480
Add 'Run all tests' option for models tests and enable Bedrock tests (#6481) by @sgrebnov in #6481
Add support for a members table type for the GitHub Data Connector (#6464) by @Advayp in #6464
S3 vector data cannot be null (#6483) by @Jeadie in #6483
Don't infer FixedSizeList size during indexing vectors. (#6487) by @Jeadie in #6487
Add support for retention_sql acceleration param (#6488) by @sgrebnov in #6488
Make dataset refresh progress tracing less verbose (#6489) by @sgrebnov in #6489
Use RwLock on tantivy index in FullTextDatabaseIndex for update concurrency (#6490) by @Jeadie in #6490
Add tests for dataset retention logic and refactor retention code (#6495) by @sgrebnov in #6495
Upgade dependabot dependencies (#6497) by @phillipleblanc in #6497
Add periodic tracing of data loading progress during dataset refresh (#6499) by @sgrebnov in #6499
Promote Oracle Data Connector to Alpha (#6503) by @sgrebnov in #6503
Use AWS SDK to provide credentials for Iceberg connectors (#6498) by @phillipleblanc in #6498
Add integration tests for partitioning (#6463) by @kczimm in #6463
Use top-level table in full-text search JOIN ON (#6491) by @Jeadie in #6491
Use accelerated table in vector_search JOIN operations when appropriate (#6516) by @Jeadie in #6516
Fix 'additional_column' for quoted columns (fix for qualified columns broke it) (#6512) by @Jeadie in #6512
Also use AWS SDK for inferring credentials for S3/Delta/Databricks Delta data connectors (#6504) by @phillipleblanc in #6504
Add per-dataset availability monitor configuration (#6482) by @phillipleblanc in #6482
Suppress the warning from the AWS SDK if it can't load credentials (#6533) by @phillipleblanc in #6533
Change default value of check_availability from default to auto (#6534) by @lukekim in #6534
README.md improvements for v1.5.0 (#6539) by @lukekim in #6539
Temporary disable s3_vectors_basic (#6537) by @sgrebnov in #6537
Ensure binder errors show before query and other (#6374) by @suhuruli in #6374
Update spiceai/duckdb-rs -> DuckDB 1.3.2 + index fix (#6496) by @mach-kernel in #6496
Update table-providers to latest version with DuckDB fixes (#6535) by @phillipleblanc in #6535
S3: default to public access if no auth is provided (#6532) by @sgrebnov in #6532

Spice v1.2.0 (Apr 28, 2025)

April 29, 2025 · 16 min read

Evgenii Khramkov

Senior Software Engineer at Spice AI

Announcing the release of Spice v1.2.0! 🚀

Spice v1.2.0 is a significant update. It upgrades DataFusion to v45 and Arrow to v54. This release brings faster query performance, support for parameterized queries in SQL and HTTP APIs, and the ability to accelerate views. Several bugs have been fixed and dependencies updated for better stability and speed.

DataFusion v45 Highlights

Spice.ai is built on the DataFusion query engine. The v45 release brings:

Faster Performance 🚀: DataFusion is now the fastest single-node engine for Apache Parquet files in the clickbench benchmark. Performance improved by over 33% from v33 to v45. Arrow StringView is now on by default, making string and binary data queries much faster, especially with Parquet files.
Better Quality 📋: DataFusion now runs over 5 million SQL tests per push using the SQLite sqllogictest suite. There are new checks for logical plan correctness and more thorough pre-release testing.
New SQL Functions ✨: Added show functions, to_local_time, regexp_count, map_extract, array_distance, array_any_value, greatest, least, and arrays_overlap.

See the DataFusion 45.0.0 release notes for details.

Spice.ai upgrades to the latest minus one DataFusion release to ensure adequate testing and stability. The next upgrade to DataFusion v46 is planned for Spice v1.3.0 in May.

What's New in v1.2.0

Parameterized Queries: Parameterized queries are now supported with the Flight SQL API and HTTP API. Positional and named arguments via $1 and :param syntax are supported, respectively. Logical plans for SQL statements are cached for faster repeated queries.

Example Cookbook recipes:
See the API Documentation for additional details.

Accelerated Views: Views, not just datasets, can now be accelerated. This provides much better performance for views that perform heavy computation.

Example spicepod.yaml:

views:
  - name: accelerated_view
    acceleration:
      enabled: true
      engine: duckdb
      primary_key: id
      refresh_check_interval: 1h
    sql: |
      select * from dataset_a
      union all
      select * from dataset_b

See the Data Acceleration documentation.

Memory Usage Metrics & Configuration: Runtime now tracks memory usage as a metric, and a new runtime memory_limit parameter is available. The memory limit parameter applies specifically to the runtime and should be used in addition to existing memory usage configuration, such as duckdb_memory_limit. Memory usage for queries beyond the memory limit will spill to disk.

See the Memory Reference for details.

New Worker Component: Workers are new configurable compute units in the Spice runtime. They help manage compute across models and tools, handle errors, and balance load. Workers are configured in the workers section of spicepod.yaml.

Example spicepod.yaml:

workers:
  - name: round-robin
    description: |
      Distributes requests between 'foo' and 'bar' models in a round-robin fashion.
    models:
      - from: foo
      - from: bar
  - name: fallback
    description: |
      Tries 'bar' first, then 'foo', then 'baz' if earlier models fail.
    models:
      - from: foo
        order: 2
      - from: bar
        order: 1
      - from: baz
        order: 3

See the Workers Documentation for details.

Databricks Model Provider: Databricks models can now be used with from: databricks:model_name.

Example spicepod.yaml:

models:
  - from: databricks:llama-3_2_1_1b_instruct
    name: llama-instruct
    params:
      databricks_endpoint: dbc-46470731-42e5.cloud.databricks.com
      databricks_token: ${ secrets:SPICE_DATABRICKS_TOKEN }

See the Databricks model documentation.

spice chat CLI Improvements: The spice chat command now supports an optional --temperature parameter. A one-shot chat can also be sent with spice chat <message>.
More Type Support: Added support for Postgres JSON type and DuckDB Dictionary type.
Other Improvements:
- New image tags let you pick memory allocators for different use-cases: jemalloc, sysalloc, and mimalloc.
- Better error handling and logging for chat and model operations.

Contributors

Cookbook Updates

New recipes for:

Python ADBC Client with Parameterized Queries: Using Parameterized Queries from Python over ADBC.
Java JDBC Client with Parameterized Queries: Using Parameterized Queries from Java over JDBC.
Scala JDBC Client with Parameterized Queries: Using Parameterized Queries from Scala over JDBC.

The Spice Cookbook now includes 68 recipes to help you get started with Spice quickly and easily.

Upgrading

To upgrade to v1.2.0, use one of the following methods:

CLI:

spice upgrade

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:1.2.0 image:

docker pull spiceai/spiceai:1.2.0

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai

What's Changed

Dependencies

DataFusion: upgraded to v45.
Apache Arrow: Upgraded to v54.3.0.

Spice is now built with Rust 1.85.0 and Rust 2024.

Changelog

- Update end_game.md (#5312) by @peasee in https://github.com/spiceai/spiceai/pull/5312
- feat: Add initial testoperator query validation (#5311) by @peasee in https://github.com/spiceai/spiceai/pull/5311
- Update Helm + Prepare for next release (#5317) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5317
- Update spicepod.schema.json (#5319) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5319
- add integration test for reading encrypted PDFs from S3 (#5308) by @kczimm in https://github.com/spiceai/spiceai/pull/5308
- Stop `load_components` during runtime shutdown (#5306) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5306
- Update openapi.json (#5321) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5321
- feat: Implement record batch data validation (#5331) by @peasee in https://github.com/spiceai/spiceai/pull/5331
- Update QA analytics for v1.1.1 (#5320) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5320
- fix: Update benchmark snapshots (#5337) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5337
- Enforce pulls with Spice v1.0.4 (#5339) by @lukekim in https://github.com/spiceai/spiceai/pull/5339
- Upgrade to DataFusion 45, Arrow 54, Rust 1.85 & Edition 2024 (#5334) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5334
- feat: Allow validating testoperator in benchmark workflow (#5342) by @peasee in https://github.com/spiceai/spiceai/pull/5342
- Upgrade `delta_kernel` to 0.9 (#5343) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5343
- deps: Update odbc-api (#5344) by @peasee in https://github.com/spiceai/spiceai/pull/5344
- Fix schema inference for Snowflake tables with large number of columns (#5348) by @ewgenius in https://github.com/spiceai/spiceai/pull/5348
- feat: Update testoperator dispatch for validation, version metric (#5349) by @peasee in https://github.com/spiceai/spiceai/pull/5349
- fix: validate_results not validate (#5352) by @peasee in https://github.com/spiceai/spiceai/pull/5352
- revert to previous pdf-extract; remove test for encrypted pdf support (#5355) by @kczimm in https://github.com/spiceai/spiceai/pull/5355
- Stablize the test `verify_similarity_search_chat_completion` (#5284) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5284
- Turn off `delta_kernel::log_segment` logging and refactor log filtering (#5367) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5367
- Upgrade to DuckDB 1.2.2 (#5375) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5375
- Update Readme - fix broken and outdated links (#5376) by @ewgenius in https://github.com/spiceai/spiceai/pull/5376
- Upgrade dependabot dependencies (#5385) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5385
- fix: Remove IMAP oauth (#5386) by @peasee in https://github.com/spiceai/spiceai/pull/5386
- Bump Helm chart to 1.1.2 (#5389) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5389
- Refactor accelerator registry as part of runtime. (#5318) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5318
- Include `vnd.spiceai.sql/nsql.v1+json` response examples (openapi docs) (#5388) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5388
- docs: Update endgame template with SpiceQA, update qa analytics (#5391) by @peasee in https://github.com/spiceai/spiceai/pull/5391
- Make graceful shutdown timeout configurable (#5358) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5358
- docs: Update release criteria with note on max columns (#5401) by @peasee in https://github.com/spiceai/spiceai/pull/5401
- Update openapi.json (#5392) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5392
- FinanceBench: update scorer instructions and switch scoring model to `gpt-4.1` (#5395) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5395
- feat: Write OTel metrics for testoperator (#5397) by @peasee in https://github.com/spiceai/spiceai/pull/5397
- Update nsql openapi title (#5403) by @ewgenius in https://github.com/spiceai/spiceai/pull/5403
- Track `ai_inferences_count` with used tools flag. Extensible runtime request context. (#5393) by @ewgenius in https://github.com/spiceai/spiceai/pull/5393
- Include newly detected view as changed view (#5408) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5408
- Track used_tools in ai_inferences_with_spice_count as number (#5409) by @ewgenius in https://github.com/spiceai/spiceai/pull/5409
- Update openapi.json (#5406) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5406
- Tweak enforce pulls with Spice (#5411) by @lukekim in https://github.com/spiceai/spiceai/pull/5411
- Allow `flightsql` and `spiceai` connectors to override flight max message size (#5407) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5407
- Retry model graded scorer once on successful, empty response (#5405) by @Jeadie in https://github.com/spiceai/spiceai/pull/5405
- use span task name in 'spice trace' tree, not span_id (#5412) by @Jeadie in https://github.com/spiceai/spiceai/pull/5412
- Rename to `track_ai_inferences_with_spice_count` in all places (#5410) by @ewgenius in https://github.com/spiceai/spiceai/pull/5410
- Update qa_analytics.csv (#5421) by @peasee in https://github.com/spiceai/spiceai/pull/5421
- Remove the filter for the `list_datasets` tool in the AI inferences metric count. (#5417) by @ewgenius in https://github.com/spiceai/spiceai/pull/5417
- fix: Testoperator uses an exact API key for benchmark metric submission (#5413) by @peasee in https://github.com/spiceai/spiceai/pull/5413
- feat: Enable testoperator metrics in workflow (#5422) by @peasee in https://github.com/spiceai/spiceai/pull/5422
- Upgrade mistral.rs (#5404) by @Jeadie in https://github.com/spiceai/spiceai/pull/5404
- Include all FinanceBench documents in benchmark tests (#5426) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5426
- Handle second Ctrl-C to force runtime termination (#5427) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5427
- Add optional `--temperature` parameter for `spice chat` CLI command (#5429) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5429
- Remove `with_runtime_status` from the `RuntimeBuilder` (#5430) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5430
- Fix spice chat error handling (#5433) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5433
- Add more test models to FinanceBench benchmark (#5431) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5431
- support 'from: databricks:model_name' (#5434) by @Jeadie in https://github.com/spiceai/spiceai/pull/5434
- Upgrade Pulls with Spice to v1.0.6 and add concurrency control (#5442) by @lukekim in https://github.com/spiceai/spiceai/pull/5442
- Upgrade DataFusion table providers (#5443) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5443
- Test spice chat in e2e_test_spice_cli (#5447) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5447
- Allow for one-shot chat request using `spice chat <message>` (#5444) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5444
- Enable parallel data sampling for NSQL (#5449) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5449
- Upgrade Go from v1.23.4 to v1.24.2 (#5462) by @lukekim in https://github.com/spiceai/spiceai/pull/5462
- Update PULL_REQUEST_TEMPLATE.md (#5465) by @lukekim in https://github.com/spiceai/spiceai/pull/5465
- Enable captured outputs by default when spiced is started by the CLI (spice run) (#5464) by @lukekim in https://github.com/spiceai/spiceai/pull/5464
- Parameterized queries via Flight SQL API (#5420) by @kczimm in https://github.com/spiceai/spiceai/pull/5420
- fix: Update benchmarks readme badge (#5466) by @peasee in https://github.com/spiceai/spiceai/pull/5466
- delay auth check for binding parameterized queries (#5475) by @kczimm in https://github.com/spiceai/spiceai/pull/5475
- Add support for `?` placeholder syntax in parameterized queries (#5463) by @kczimm in https://github.com/spiceai/spiceai/pull/5463
- enable task name override for non static span names (#5423) by @Jeadie in https://github.com/spiceai/spiceai/pull/5423
- Allow parameter queries with no parameters (#5481) by @kczimm in https://github.com/spiceai/spiceai/pull/5481
- Support unparsing UNION for distinct results (#5483) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5483
- add rust-toolchain.toml (#5485) by @kczimm in https://github.com/spiceai/spiceai/pull/5485
- Add parameterized query support to the HTTP API (#5484) by @kczimm in https://github.com/spiceai/spiceai/pull/5484
- E2E test for spice chat <message> behavior (#5451) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5451
- Renable and fix huggingface models integration tests (#5478) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5478
- Update openapi.json (#5488) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5488
- feat: Record memory usage as a metric (#5489) by @peasee in https://github.com/spiceai/spiceai/pull/5489
- fix: update dispatcher to run all benchmarks, rename metric, update spicepods, add scale factor (#5500) by @peasee in https://github.com/spiceai/spiceai/pull/5500
- Fix ILIKE filters support (#5502) by @ewgenius in https://github.com/spiceai/spiceai/pull/5502
- fix: Update test spicepod locations and names (#5505) by @peasee in https://github.com/spiceai/spiceai/pull/5505
- fix: Update benchmark snapshots (#5508) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5508
- fix: Update benchmark snapshots (#5512) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5512
- Fix Delta Lake bug for: Found unmasked nulls for non-nullable StructArray field "predicate" (#5515) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5515
- fix: working directory for duckdb e2e test spicepods (#5510) by @peasee in https://github.com/spiceai/spiceai/pull/5510
- Tweaks to README.md (#5516) by @lukekim in https://github.com/spiceai/spiceai/pull/5516
- Cache logical plans of SQL statements (#5487) by @kczimm in https://github.com/spiceai/spiceai/pull/5487
- Fix `content-type: application/json` (#5517) by @Jeadie in https://github.com/spiceai/spiceai/pull/5517
- Validate postgres results in testoperator dispatch (#5504) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5504
- fix: Update benchmark snapshots (#5511) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5511
- Fix results cache by SQL with prepared statements (#5518) by @kczimm in https://github.com/spiceai/spiceai/pull/5518
- Add initial support for views acceleration (#5509) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5509
- fix: Update benchmark snapshots (#5527) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5527
- Support switching the memory allocator Spice uses via `alloc-*` features. (#5528) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5528
- fix: Update benchmark snapshots (#5525) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5525
- Add test spicepod for tpch mysql-duckdb[file acceleration] (#5521) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5521
- Fix nightly arm build - change tag `-default` to `-models` (#5529) by @ewgenius in https://github.com/spiceai/spiceai/pull/5529
- LLM router via `worker` spicepod component (#5513) by @Jeadie in https://github.com/spiceai/spiceai/pull/5513
- Apply Spice advanced acceleration logic and params support to accelerated views (#5526) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5526
- Enable DatasetCheckpoint logic for accelerated views (#5533) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5533
- Fix public '.model' name for router workers (#5535) by @Jeadie in https://github.com/spiceai/spiceai/pull/5535
- feat: Add Runtime memory limit parameter (#5536) by @peasee in https://github.com/spiceai/spiceai/pull/5536
- For fallback worker, check first item in `chat/completion` stream. (#5537) by @Jeadie in https://github.com/spiceai/spiceai/pull/5537
- Move rate limit check to after parameterized query binding (#5540) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5540
- Update spicepod.schema.json (#5545) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5545
- Accelerate views: refresh_on_startup, ready_state, jitter params support (#5547) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5547
- Add integration test for accelerated views (#5550) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5550
- Don't install make or expect on spiceai-macos runners (#5554) by @lukekim in https://github.com/spiceai/spiceai/pull/5554
- `event_stream` crate for emitting events from tracing::Span; used in v1/chat/completions streaming. (#5474) by @Jeadie in https://github.com/spiceai/spiceai/pull/5474
- Fix typo in method (#5559) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5559
- Run test operator every day and current and previous commits (#5557) by @lukekim in https://github.com/spiceai/spiceai/pull/5557
- Add aws_allow_http parameter for delta lake connector (#5541) by @Sevenannn in https://github.com/spiceai/spiceai/pull/5541
- feat: Add branch name to metric dimensions in testoperator (#5563) by @peasee in https://github.com/spiceai/spiceai/pull/5563
- fix: Update the tpch benchmark snapshots for: ./test/spicepods/tpch/sf1/federated/odbc[databricks].yaml (#5565) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5565
- fix: Split scheduled dispatch into a separate job (#5567) by @peasee in https://github.com/spiceai/spiceai/pull/5567
- fix: Use outputs.SPICED_COMMIT (#5568) by @peasee in https://github.com/spiceai/spiceai/pull/5568
- fix: Use refs in testoperator dispatch instead of commits (#5569) by @peasee in https://github.com/spiceai/spiceai/pull/5569
- fix: actions/checkout ref does not take a full ref (#5571) by @peasee in https://github.com/spiceai/spiceai/pull/5571
- fix: Testoperator dispatch (#5572) by @peasee in https://github.com/spiceai/spiceai/pull/5572
- Respect `update-snapshots` when running all benchmarks manually (#5577) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5577
- Use FETCH_HEAD instead of ${{ inputs.ref }} to list commits in setup_spiced (#5579) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5579
- Add additional test scenarios for benchmarks (#5582) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5582
- fix: Update the tpch benchmark snapshots for: test/spicepods/tpch/sf1/accelerated/databricks[delta_lake]-duckdb[file].yaml (#5590) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5590
- fix: Update the tpch benchmark snapshots for: test/spicepods/tpch/sf1/accelerated/mysql-duckdb[file].yaml (#5591) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5591
- Fix Snowflake data connector rows ordering (#5599) by @sgrebnov in https://github.com/spiceai/spiceai/pull/5599
- fix: Update benchmark snapshots (#5595) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5595
- fix: Update the tpch benchmark snapshots for: test/spicepods/tpch/sf1/accelerated/databricks[delta_lake]-arrow.yaml (#5594) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5594
- fix: Update benchmark snapshots (#5589) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5589
- fix: Update benchmark snapshots (#5583) by @app/github-actions in https://github.com/spiceai/spiceai/pull/5583
- Downgrade DuckDB to 1.1.3 (#5607) by @phillipleblanc in https://github.com/spiceai/spiceai/pull/5607
- Add prepared statement integration tests (#5544) by @kczimm in https://github.com/spiceai/spiceai/pull/5544

Full Changelog: v1.1.2...v1.2.0

Spice v1.0-rc.5 (Jan 13, 2025)

January 13, 2025 · 9 min read

Evgenii Khramkov

Senior Software Engineer at Spice AI

Announcing the release of Spice v1.0-rc.5 🛠️

Spice v1.0.0-rc.5 is the fifth release candidate for the first major version of Spice.ai OSS. This release focuses production readiness and critical bug fixes. In addition, a new DynamoDB data connector has been added along with automatic detection for GPU acceleration when running Spice using the CLI.

Highlights in v1.0-rc.5

Automatic GPU Acceleration Detection: Automatically detect and utilize GPU acceleration when running by CLI. Install AI components locally using the CLI command spice install ai. Currently supports NVIdia CUDA and Apple Metal (M-series).
DynamoDB Data Connector: Query AWS DynamoDB tables using SQL with the new DynamoDB Data Connector.

datasets:
  - from: dynamodb:users
    name: users
    params:
      dynamodb_aws_region: us-west-2
      dynamodb_aws_access_key_id: ${secrets:aws_access_key_id}
      dynamodb_aws_secret_access_key: ${secrets:aws_secret_access_key}
    acceleration:
      enabled: true

sql> describe users;
+----------------+-----------+-------------+
| column_name    | data_type | is_nullable |
+----------------+-----------+-------------+
| created_at     | Utf8      | YES         |
| date_of_birth  | Utf8      | YES         |
| email          | Utf8      | YES         |
| account_status | Utf8      | YES         |
| updated_at     | Utf8      | YES         |
| full_name      | Utf8      | YES         |
| ...                                      |
+----------------+-----------+-------------+

File Data Connector: Graduated to Stable.
Dremio Data Connector: Graduated to Release Candidate (RC).
Spice.ai, Spark, and Snowflake Data Connectors: Graduated to Beta.

Dependencies

No major dependency changes.

Contributors

@Jeadie
@phillipleblanc
@ewgenius
@peasee
@Sevenannn
@lukekim

What's Changed

* Update acknowledgements by @github-actions in https://github.com/spiceai/spiceai/pull/4190
* Ensure non-nullity of primary keys in `MemTable`; check validity of initial data. by @Jeadie in https://github.com/spiceai/spiceai/pull/4158
* Bump version to v1.0.0 stable by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4191
* Fix metal + models download by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4193
* Update spice.ai connector beta roadmap by @ewgenius in https://github.com/spiceai/spiceai/pull/4194
* feat: verify on zero results snapshots by @peasee in https://github.com/spiceai/spiceai/pull/4195
* Add throughput test module to `test-framework` by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4196
* Update Spice.ai TPCH snapshots by @ewgenius in https://github.com/spiceai/spiceai/pull/4202
* Replace all usage of `lazy_static!` with `LazyLock` by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4199
* Fix model + metal download by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4200
* Run Clickbench for Dremio by @Sevenannn in https://github.com/spiceai/spiceai/pull/4138
* Update openapi.json by @github-actions in https://github.com/spiceai/spiceai/pull/4205
* Fix the typo in connector stable criteria by @Sevenannn in https://github.com/spiceai/spiceai/pull/4213
* feat: Add throughput test example by @peasee in https://github.com/spiceai/spiceai/pull/4214
* feat: calculate throughput test query percentiles by @peasee in https://github.com/spiceai/spiceai/pull/4215
* feat: Add throughput test to actions by @peasee in https://github.com/spiceai/spiceai/pull/4217
* Implement DynamoDB Data Connector by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4218
* 1.0 doc updates by @lukekim in https://github.com/spiceai/spiceai/pull/4181
* Improve clarity and concison of use-cases by @lukekim in https://github.com/spiceai/spiceai/pull/4220
* Remove macOS Intel build by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4221
* fix: Test operator throughput test workflow by @peasee in https://github.com/spiceai/spiceai/pull/4222
* DynamoDB: Automatically load AWS credentials from IAM roles if access key not provided by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4226
* File connector clickbench snapshots results by @ewgenius in https://github.com/spiceai/spiceai/pull/4225
* Spice.ai Catalog Connector by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4204
* feat: Add test framework metrics collection by @peasee in https://github.com/spiceai/spiceai/pull/4227
* Add badges for build/test status on README.md by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4228
* Release Dremio to RC by @Sevenannn in https://github.com/spiceai/spiceai/pull/4224
* feat: Add more test spicepods by @peasee in https://github.com/spiceai/spiceai/pull/4229
* feat: Add load test to testoperator by @peasee in https://github.com/spiceai/spiceai/pull/4231
* Add TSV format to all `object_store`-based connectors by @Jeadie in https://github.com/spiceai/spiceai/pull/4192
* Move test-framework to dev-dependencies for Runtime by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4230
* Document limitation for correlated subqueries in TPCH for Spice.ai connector by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4235
* Changes for CUDA by @Jeadie in https://github.com/spiceai/spiceai/pull/4130
* fix: Collect batches from test framework, load test updates by @peasee in https://github.com/spiceai/spiceai/pull/4234
* Suppress opentelemetry_sdk warnings - they aren't useful by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4243
* fix: Set dataset status first, update test framework by @peasee in https://github.com/spiceai/spiceai/pull/4244
* feat: Re-enable defaults on test spicepods by @peasee in https://github.com/spiceai/spiceai/pull/4248
* Add usage for streaming local models; Fix spice chat usage bar TPS expansion by @Jeadie in https://github.com/spiceai/spiceai/pull/4232
* refactor: Use composite testoperator setup, add query overrides by @peasee in https://github.com/spiceai/spiceai/pull/4246
* Enable expand_views_at_output for DF optimizer and transform schema to expanded view types by @ewgenius in https://github.com/spiceai/spiceai/pull/4237
* Add throughput test spicepod for databricks delta mode connector by @Sevenannn in https://github.com/spiceai/spiceai/pull/4241
* Spark data connector - update and enable TPCH and TPCDS benchmarks by @ewgenius in https://github.com/spiceai/spiceai/pull/4240
* Increase the timeout minutes of load test to 10 hours by @Sevenannn in https://github.com/spiceai/spiceai/pull/4254
* Improve partition column counts error for delta table by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4247
* Add e2e test for databricks catalog connector (mode: delta_lake) by @Sevenannn in https://github.com/spiceai/spiceai/pull/4255
* Spark connector integration tests by @ewgenius in https://github.com/spiceai/spiceai/pull/4256
* Run benchmark test with the new test framework by @Sevenannn in https://github.com/spiceai/spiceai/pull/4245
* Configure databricks delta secrets to run load test by @Sevenannn in https://github.com/spiceai/spiceai/pull/4257
* Support `properties` for emitted telemetry by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4249
* feat: Add `ready_wait` test operator workflow input by @peasee in https://github.com/spiceai/spiceai/pull/4259
* Handle 'LargeStringArray' for embedding tables by @Jeadie in https://github.com/spiceai/spiceai/pull/4263
* `llms` tests for alpha/beta model criteria  by @Jeadie in https://github.com/spiceai/spiceai/pull/4261
* Configurable runner type for load and throughput tests by @ewgenius in https://github.com/spiceai/spiceai/pull/4262
* Handle NULL partition columns for Delta Lake tables by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4264
* Add integration test for Snowflake by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4266
* Add Snowflake TPCH queries by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4268
* Handle `LargeStringArray` in `v1/search`. by @Jeadie in https://github.com/spiceai/spiceai/pull/4265
* Fix `build_cuda` in Update spiced_docker.yml by @Jeadie in https://github.com/spiceai/spiceai/pull/4269
* Run Snowflake benchmark in GitHub Actions by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4270
* Allow Snowflake query override for CI tests by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4271
* Don't run GPU builds for trunk by @Jeadie in https://github.com/spiceai/spiceai/pull/4272
* Fix InvalidTypeAction not working by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4273
* Add xAI key to llm integration tests by @Jeadie in https://github.com/spiceai/spiceai/pull/4274
* Update openai snapshots by @Jeadie in https://github.com/spiceai/spiceai/pull/4275
* Fix federation bug for correlated subqueries by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4276
* Update end_game.md by @ewgenius in https://github.com/spiceai/spiceai/pull/4278
* Promote Snowflake to Beta by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4277
* Set version to 1.0.0-rc.5 by @ewgenius in https://github.com/spiceai/spiceai/pull/4283
* Update cargo.lock by @ewgenius in https://github.com/spiceai/spiceai/pull/4285
* Update spice.ai data connector snapshots by @ewgenius in https://github.com/spiceai/spiceai/pull/4281
* Promote the Spice.ai Data Connector to Beta by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4282
* Revert change to  `integration_models__models__search__openai_chunking_response.snap` by @Jeadie in https://github.com/spiceai/spiceai/pull/4279
* Allow for a subset of build artifacts to be published to minio by @Jeadie in https://github.com/spiceai/spiceai/pull/4280
* Promote File Data Connector to Stable by @ewgenius in https://github.com/spiceai/spiceai/pull/4286
* Add Iceberg to Supported Catalogs by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4287
* Update openapi.json by @github-actions in https://github.com/spiceai/spiceai/pull/4289
* Fix Spark benchmark credentials, add back overrides by @ewgenius in https://github.com/spiceai/spiceai/pull/4295
* Promote Spark Data Connector to Beta by @ewgenius in https://github.com/spiceai/spiceai/pull/4296
* Add Dremio throughput test spicepod by @Sevenannn in https://github.com/spiceai/spiceai/pull/4233
* Add error message for invalid databricks mode parameter by @Sevenannn in https://github.com/spiceai/spiceai/pull/4299
* Fix pre-release check to look for `build` string by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4300
* Promote databricks catalog connector (mode: delta_lake) to beta by @Sevenannn in https://github.com/spiceai/spiceai/pull/4301
* Properly delegate `load_table` to Rest Catalog by @phillipleblanc in https://github.com/spiceai/spiceai/pull/4303
* Update acknowledgements by @github-actions in https://github.com/spiceai/spiceai/pull/4302
* docs: Update ROADMAP.md by @peasee in https://github.com/spiceai/spiceai/pull/4306
* v1.0.0-rc.5 Release Notes by @ewgenius in https://github.com/spiceai/spiceai/pull/4298

**Full Changelog**: https://github.com/spiceai/spiceai/compare/v1.0.0-rc.4...v1.0.0-rc.5

Resources

Community

Spice.ai started with the vision to make AI easy for developers. We are building Spice.ai in the open and with the community. Reach out on Slack or by email to get involved.

Twitter: @spice_ai
Slack: spiceai.org/slack
Telegram: Spice AI Discussion
Reddit: https://www.reddit.com/r/spiceai
Email: hey@spice.ai

What's New in v2.0.0-rc.2​

Distributed Cayenne Query and Write Improvements​

MERGE INTO for Spice Cayenne​

PARTITION BY Support for Cayenne​

Catalog Connector Enhancements​

JSON Ingestion Improvements​

DataFusion v52.4.0 Upgrade​

Dependency Upgrades​

Other Improvements​

Contributors​

Breaking Changes​

Upgrading​

What's Changed​

Changelog​

What's New in v1.11.0-rc.1​

Distributed Query with mTLS​

SMB and NFS Data Connectors​

Prepared Statements​

Spice Cayenne Accelerator Enhancements​

Google LLM Support​

Spice Java SDK v0.5.0​

OpenTelemetry Improvements​

Developer Experience Improvements​

Additional Improvements & Bug Fixes​

Contributors​

Breaking Changes​

OTel Ingestion Port Change​

Distributed Query Cluster Mode Requires mTLS​

Cookbook Updates​

Upgrading​

What's Changed​

Changelog​

What's New in v1.5.0​

Contributors​

Breaking Changes​

Cookbook Updates​

Upgrading​

What's Changed​

Dependencies​

Changelog​

DataFusion v45 Highlights​

What's New in v1.2.0​

Contributors​

Cookbook Updates​

Upgrading​

What's Changed​

Dependencies​

Changelog​

Highlights in v1.0-rc.5​

Dependencies​

Contributors​

What's Changed​

Resources​

Community​

What's New in v2.0.0-rc.2

Distributed Cayenne Query and Write Improvements

MERGE INTO for Spice Cayenne

`PARTITION BY` Support for Cayenne

Catalog Connector Enhancements

JSON Ingestion Improvements

DataFusion v52.4.0 Upgrade

Dependency Upgrades

Other Improvements

Contributors

Breaking Changes

Upgrading

What's Changed

Changelog

What's New in v1.11.0-rc.1

Distributed Query with mTLS

SMB and NFS Data Connectors

Prepared Statements

Spice Cayenne Accelerator Enhancements

Google LLM Support

Spice Java SDK v0.5.0

OpenTelemetry Improvements

Developer Experience Improvements

Additional Improvements & Bug Fixes

Contributors

Breaking Changes

OTel Ingestion Port Change

Distributed Query Cluster Mode Requires mTLS

Cookbook Updates

Upgrading

What's Changed

Changelog

What's New in v1.5.0

Contributors

Breaking Changes

Cookbook Updates

Upgrading

What's Changed

Dependencies

Changelog

DataFusion v45 Highlights

What's New in v1.2.0

Contributors

Cookbook Updates

Upgrading

What's Changed

Dependencies

Changelog

Highlights in v1.0-rc.5

Dependencies

Contributors

What's Changed

Resources

Community