http://starrocks.io logo
Join Slack
Powered by
# announcements
  • b

    Beryl Chen

    07/25/2024, 1:30 PM
    Just a friendly reminder that today’s session, Rockset Acquired by OpenAI: What’s Next for Its Users? is happening at 10 AM PT | 1 PM ET. <!channel> Agenda: • What the Rockset acquisition means for its users. • Immediate steps users should take now to ensure continuity in their operations. • The pros and cons of multiple open-source and commercial alternatives for each Rockset use case. https://celerdata.wistia.com/live/events/w0k2mcmpi0 (Can’t join us live? That’s perfectly okay! Please sign up, and we will make sure you get a copy of the recording.)
    👍 5
  • b

    Beryl Chen

    07/26/2024, 5:05 PM
    You’ve likely seen him on the website, social media, or even a recent StarRocks webinar. But now, let’s make it official—meet Rocky, our new mascot! rocky heart Discover more about Rocky and all his otterly adorable details! rocky nice Get a behind-the-scenes look at how Rocky came to life, straight from Wenlong, the creative brain behind our new mascot.
    rocky nice 6
    🎉 3
    rocky confused 3
    rocky heart 4
    💚 7
    h
    • 2
    • 1
  • g

    GitHub

    07/30/2024, 3:20 AM
    Release - Release notes 3.1.14 New release published by jaogoy Release date: July 29, 2024 Improvements • Stream Load now supports using
    \t
    and
    \n
    as row and column delimiters. Users do not need to convert them to their hexadecimal ASCII codes. #47302 Bug Fixes Fixed the following issues: • Frequent INSERT and UPDATE operations on Primary Key tables may cause write and query delays in the database. #47838 • When a Primary Key table encounters data persistence failures, the persistent index may fail to capture the error, leading to data loss and reporting the error "Insert found duplicate key". #48045 • Materialized views may report insufficient permissions when refreshed. #47561 • Materialized view reports the error "For input string" when refreshed. #46131 • During materialized view refresh, the lock is held excessively long, causing the Leader FE to be restarted by the deadlock detection script. #48256 • Queries against views with the IN clause in its definition may return inaccurate results. #47484 • Global Runtime Filter causes incorrect results. #48496 • MySQL protocol
    COM_CHANGE_USER
    does not support
    conn_attr
    . #47796 Behavior Changes • When users create a non-partitioned table without specifying the bucket number, the minimum bucket number the system sets for the table is
    16
    (instead of
    2
    based on the formula
    2*BE or CN count
    ). If users want to set a smaller bucket number when creating a small table, they must set it explicitly. #47005 StarRocks/starrocks
  • b

    Beryl Chen

    08/02/2024, 9:52 PM
    <!channel> Pinterest Engineering has published a comprehensive article on their experience launching their Analytics app with StarRocks. A big thank you to the Pinterest team for sharing their journey. Interested in the challenges they faced and how StarRocks helped them overcome these obstacles? Read their detailed story here: https://medium.com/pinterest-engineering/delivering-faster-analytics-at-pinterest-a639cdfad374
    👍 9
    clapclap 22
    rocky nice 2
  • g

    GitHub

    08/08/2024, 8:14 AM
    Release - 3.3.2 New release published by wangsimo0 Release date: August 8, 2024 New Features • Supports renaming columns within StarRocks internal tables. #47851 • Supports reading Iceberg views. Currently, only Iceberg views created through StarRocks are supported. #46273 • [Experimental] Supports adding and removing fields of STRUCT-type data. #46452 • Supports specifying the compression level for ZSTD compression format during table creation. #46839 • Added the following FE dynamic parameters to limit table boundaries. #47896 Including: •
    auto_partition_max_creation_number_per_load
    •
    max_partition_number_per_table
    •
    max_bucket_number_per_partition
    •
    max_column_number_per_table
    • Supports runtime optimization of table data distribution, ensuring optimization tasks do not conflict with DML operations on the table. #43747 • Added an observability interface for the global hit rate of Data Cache. #48450 • Added the SQL function array_repeat. #47862 Improvements • Optimized the error messages for Routine Load failures due to Kafka authentication failures. #46136 #47649 • Stream Load supports using
    \t
    and
    \n
    as row and column delimiters. Users do not need to convert them to their hexadecimal ASCII codes. #47302 • Optimized the asynchronous statistics collection method for write operators, addressing the issue of increased latency when there are many import tasks. #48162 • Added the following BE dynamic parameters to control resource hard limits during loading, reducing the impact on BE stability when writing a large number of tablets. #48495 Including: •
    load_process_max_memory_hard_limit_ratio
    •
    enable_new_load_on_memory_limit_exceeded
    • Added consistency checks for Column IDs within the same table to prevent Compaction errors. #48498 • Supports persisting PIPE metadata to prevent metadata loss due to FE restarts. #48852 Bug Fixes • The process could not end when creating a dictionary from an FE Follower. #47802 • Inconsistent information returned by the SHOW PARTITIONS command in shared-data clusters and shared-nothing clusters. #48647 • Data errors caused by incorrect type handling when loading data from JSON fields to
    ARRAY<BOOLEAN>
    columns. #48387 • The
    query_id
    column in
    information_schema.task_runs
    cannot be queried. #48876 • During Backup, multiple requests for the same operation are submitted to different Brokers, causing request errors. #48856 • Downgrading to versions earlier than v3.1.11 or v3.2.4 causes Primary Key table index decompression failures, leading to query errors. #48659 Downgrade Notes If you have used the renaming column feature, you must rename the columns to their original names before downgrading your cluster to an earlier version. You can check the audit log of your cluster after upgrading to identify any
    ALTER TABLE RENAME COLUMN
    operations and the original names of the columns. StarRocks/starrocks
    cat dance 1
    🚀 5
  • b

    Beryl Chen

    08/15/2024, 1:30 PM
    Just a friendly reminder that today’s session, “StarRocks Virtual Meetup: Version 3.3.x and What’s Next,” is happening at 10 AM PT | 1 PM ET. <!channel> Harrison (Heng) Zhao and Sida Shen will walk you through the new features in 3.3.x, share upcoming plans, answer your questions, and update you on what’s happening in the StarRocks community. We hope you can join us! https://celerdata.wistia.com/live/events/qy17vi9l5z?utm_campaign=vm&amp;utm_source=social
    dogdance 1
    ✅ 1
    👍 8
    v
    s
    a
    • 4
    • 4
  • b

    Beryl Chen

    08/19/2024, 10:54 PM
    <!channel> Here are a few resources you might find useful: If you missed our ‘StarRocks Virtual Meetup: Version 3.3.x and What’s Next’ webinar last week led by @Heng Zhao and @Sida Shen or want to revisit it, the recording is available here:

    https://youtu.be/qs4RQ37h_dI?si=P67jXhGdl1uL46BI▾

    Check out this video where @Simo Wang breaks down the challenges of job planning in Iceberg and shows how StarRocks effectively addresses them, backed by real-world examples. Simo also conducts a demo comparing job planning between StarRocks 3.2 and 3.3, highlighting the performance boost:

    https://youtu.be/bKB7fyE4dQ0?si=UPeiGZx6hZV94N49▾

    The data file organization format has been redesigned in StarRocks’s cloud-native version to better align with object storage. Delve into @Jeff Ding ’s latest article, where he uses the SSB lineorder table as an example to illustrate how data files are organized on object storage: https://medium.com/starrocks-engineering/introduction-47d1eea48b41
    👍 7
    🎉 1
    o
    k
    • 3
    • 2
  • g

    GitHub

    08/23/2024, 6:13 AM
    Release - 3.2.10 New release published by yingtingdong Release date: August 23, 2024 Improvements • Files() will automatically convert
    BYTE_ARRAY
    data with a
    logical_type
    of
    JSON
    in Parquet files to the JSON type in StarRocks. #49385 • Optimized error messages for Files() when Access Key ID and Secret Access Key are missing. #49090 •
    information_schema.columns
    supports the
    GENERATION_EXPRESSION
    field. #49734 Bug Fixes Fixed the following issues: • Downgrading a v3.3 shared-data cluster to v3.2 after setting the Primary Key table property
    "persistent_index_type" = "CLOUD_NATIVE"
    causes a crash. #48149 • Exporting data to CSV files using SELECT INTO OUTFILE may cause data inconsistency. #48052 • Queries encounter failures during concurrent query execution. #48180 • Queries would hang due to a timeout in the Plan phase without exiting. #48405 • After disabling index compression for Primary Key tables in older versions and then upgrading to v3.2.9, accessing
    page_off
    information causes an array out-of-bounds crash. #48230 • BE crash caused by concurrent execution of ADD/DROP COLUMN operations. #49355 • Queries against negative
    TINYINT
    values in ORC format files return
    None
    on the aarch64 architecture. #49517 • If the disk write operation fails, failures of
    l0
    snapshots for Primary Key Persistent Index may cause data loss. #48045 • Partial Update in Column mode for Primary Key tables fails under scenarios with large-volume data updates. #49054 • BE crash caused by Fast Schema Evolution when downgrading a v3.3.0 shared-data cluster to v3.2.9. #42737 •
    partition_linve_nubmer
    does not take effect. #49213 • The conflict between index persistence and compaction in Primary Key tables could cause clone failures. #49341 • Modifications of
    partition_line_number
    using ALTER TABLE do not take effect. #49437 • Rewrite of CTE distinct grouping sets generates an invalid plan. #48765 • RPC failures polluted the thread pool. #49619 • authentication failure issues when loading files from AWS S3 via PIPE. #49837 Behavior Changes • Added a check for the
    meta
    directory in the FE startup script. If the directory does not exist, it will be automatically created. #48940 • Added a memory limit parameter
    load_process_max_memory_hard_limit_ratio
    for data loading. If memory usage exceeds the limit, subsequent loading tasks will fail. #48495 StarRocks/starrocks
    🙌 6
  • g

    GitHub

    09/04/2024, 9:04 AM
    Release - 3.1.15 New release published by jaogoy 3.1.15 Release date: September 4, 2024 Bug Fixes Fixed the following issues: • During query rewrite with asynchronous materialized views,
    count(*)
    on certain tables returns NULL. #49288 •
    partition_linve_nubmer
    does not take effect. #49213 • FE throws a tablet exception: BE disk offline, and cannot migrate tablets. #47833 StarRocks/starrocks
  • g

    GitHub

    09/05/2024, 5:55 AM
    Release - 3.3.3 New release published by wangsimo0 3.3.3 Release date: September 5, 2024 New Features • Supports user-level variables. #48477 • Supports Delta Lake Catalog metadata cache with manual and periodic refresh strategies. #46526 #49069 • Supports loading JSON types from Parquet files. #49385 • JDBC SQL Server Catalog supports queries with LIMIT. #48248 • Shared-data clusters support Partial Updates with INSERT INTO. #49336 Improvements • Optimized error messages for loading: • When memory limits are reached during loading, the IP of the corresponding BE node is returned for easier troubleshooting. #49335 • Detailed messages are provided when CSV data is loaded to target table columns that are not long enough. #49713 • Specific node information is provided when Kerberos authentication fails in Broker Load. #46085 • Optimized the partitioning mechanism during data loading to reduce memory usage in the initial stage. #47976 • Optimized memory usage for shared-nothing clusters by limiting metadata memory usage to avoid issues when there are too many Tablets or Segment files. #49170 • Optimized the performance of queries using
    max(partition_column)
    . #49391 • Partition pruning is used to optimize query performance when the partition column is a generated column (a column that is calculated based on a native column in the table), and the query predicate filter condition includes the native column. #48692 • Supports masking authentication information for Files() and PIPE. #47629 • Introduced a new statement
    show proc '/global_current_queries'
    to view queries running on all FE nodes.
    show proc '/current_queries'
    only shows queries running on the current FE node. #49826 Bug Fixes Fixed the following issues: • The source cluster's BE nodes were mistakenly added to the current cluster when exporting data to the destination cluster via StarRocks external tables. #49323 • TINYINT data type returned NULL when StarRocks reads ORC files using
    select * from files
    from clusters deployed on aarch64 machines. #49517 • Stream Load fails when loading JSON files containing large Integer types. #49927 • Incorrect schema is returned due to improper handling of invisible characters when users load CSV files with Files(). #49718 • An issue with temporary partition replacement in tables with multiple partition columns. #49764 Behavior Changes • Introduced a new parameter
    object_storage_rename_file_request_timeout_ms
    to better accommodate backup scenarios with cloud object storage. This parameter will be used as the backup timeout, with a default value of 30 seconds. #49706 •
    to_json
    ,
    CAST(AS MAP)
    , and
    STRUCT AS JSON
    will return NULL instead of throwing an error by default when the conversion fails. You can allow errors by setting the system variable
    sql_mode
    to
    ALLOW_THROW_EXCEPTION
    . #50157 StarRocks/starrocks
    🙌 4
  • g

    GitHub

    09/09/2024, 8:28 AM
    Release - 3.2.11 New release published by yingtingdong Release date: September 9, 2024 Improvements • Supports masking authentication information for Files() and PIPE. #47629 • Support automatic inference for the STRUCT type when reading Parquet files through Files(). #50481 Bug Fixes Fixed the following issues: • An error is returned for equi-join queries because they failed to be rewritten by the global dictionary. #50690 • The error "version has been compacted" caused by an infinite loop on the FE side during Tablet Clone. #50561 • Incorrect scheduling for unhealthy replica repairs after distributing data based on labels. #50331 • An error in the statistics collection log: "Unknown column '%s' in '%s." #50785 • Incorrect timezone usage when reading complex types like TIMESTAMP from Parquet files via Files(). #50448 Behavior Changes • When downgrading StarRocks from v3.3.x to v3.2.11, the system will ignore it if there is incompatible metadata. #49636 StarRocks/starrocks
    🙌🏼 1
    🙌 1
  • b

    Beryl Chen

    09/09/2024, 8:53 PM
    <!channel> 🎂 Happy 3rd Anniversary, StarRocks Community! We’ve come a long way in 3 years, thanks to YOU – our users, contributors, and supporters. Your contributions, feedback, and engagement have been the heart and soul of StarRocks’ growth. Thank you for joining us on this amazing ride! Let’s continue to break new ground and celebrate many more milestones together. 🎉
    🎉 23
    clapclap 7
    👍 10
    starrocks 5
    g
    s
    +3
    • 6
    • 8
  • b

    Beryl Chen

    09/12/2024, 1:30 PM
    🔔 Just a friendly reminder that today’s session, *“*Query Engine Must-Haves for the Best Apache Superset Experience” is happening at 10 AM PT | 1 PM ET. <!channel> https://celerdata.wistia.com/live/events/zhxc0m4nxu Agenda: • Fast and flexible ad-hoc queries: Run complex SQL queries on the fly without extensive pre-computation for interactive data analysis. • On-demand query acceleration: Enable your underlying engine to add pre-computations on demand without manual SQL rewriting. • Support for open formats: Integrate with open formats to simplify your data pipeline while improving data governance. Get your questions answered and see these critical features in action with a demo that showcases these capabilities using Preset (powered by Apache Superset), CelerData (powered by StarRocks), and Apache Iceberg.
  • g

    GitHub

    07/02/2025, 2:19 AM
    Release - v3.5.1 New release published by yingtingdong Release Date: July 1, 2025 ### New Features • [Experimental] Starting from v3.5.1, StarRocks introduces a high-performance data transfer channel based on the Apache Arrow Flight SQL protocol, comprehensively optimizing the data import channel and significantly improving transfer efficiency. This solution establishes a fully columnar data transfer pipeline from the StarRocks columnar execution engine to the client, eliminating the frequent row-column conversions and serialization overhead typically seen in traditional JDBC and ODBC interfaces, and achieving true zero-copy, low-latency, and high-throughput data transfer capabilities. #57956 • Java Scalar UDFs (user-defined functions) now support ARRAY and MAP types as input parameters. #55356 • Cross-node data cache sharing: Enables nodes to share cached external table data of data lakes across compute nodes via the network. If a local cache miss occurs, the system first attempts to fetch data from the caches of other nodes within the same cluster. Only if all caches miss will it re-fetch data from remote storage. This feature effectively reduces performance jitter caused by cache invalidation during elastic scaling and ensures stable query performance. A new FE configuration parameter
    enable_trace_historical_node
    controls this behavior (Default:
    false
    ). #57083 • Storage Volume adds native support for Google Cloud Storage (GCS): You can now use GCS as a backend storage volume and manage and access GCS resources through the native SDK. #58815 ### Improvements • Optimized error messages when creating Hive external tables fails. #60076 • Optimized
    count(1)
    query performance using the
    file_record_count
    in Iceberg metadata. #60022 • Refined the Compaction scheduling logic to avoid delayed scheduling when all subtasks succeed. #59998 • Added
    JAVA_OPTS="--add-opens=java.base/java.util=ALL-UNNAMED"
    to BE and CN after upgrading to JDK 17. #59947 • Supports modifying the
    kafka_broker_list
    property via the ALTER ROUTINE LOAD command when Kafka Broker endpoints change. #59787 • Supports reducing build dependencies of the Docker base image through parameters. #59772 • Supports accessing Azure using Managed Identity authentication. #59657 • Improved error messages when querying external data via
    Files()
    function with duplicate path column names. #59597 • Optimized LIMIT pushdown logic. #59265 ### Bug Fixes Fixed the following issues: • Partition pruning issue when queries include Max and Min aggregations and empty partitions. #60162 • Incorrect query results when rewriting queries with materialized views due to missing NULL partitions. #60087 • Refresh errors on Iceberg external tables when using partition expressions based on
    str2date
    . #60089 • Incorrect partition range when creating temporary partitions using the START END syntax. #60014 • Incorrect display of Routine Load metrics on non-leader FE nodes. #59985 • BE/CN crashes when executing queries containing
    COUNT(*)
    window functions. #60003 • Stream Load failures when the target table name contains Chinese characters. #59722 • Overall loading failures to triple-replica tables when loading to a secondary replica fails. #59762 • Missing parameters in SHOW CREATE VIEW output. #59714 ### Behavior Changes • Some FE metrics include the
    is_leader
    label. #59883 StarRocks/starrocks
    👍 5
    🎉 9
    🪣 1
  • m

    Mehdi Sidi Boumedine

    07/02/2025, 1:03 PM
    Hi Beryl, I haven't been able to attend the live event, is there any recording available?
    b
    • 2
    • 1
  • g

    GitHub

    07/04/2025, 8:58 AM
    Release - 3.3.16 New release published by wangsimo0 ## 3.3.16 Release Date: July 4, 2025 ### Improvements • Optimized error logs when creating Hive tables with duplicate names. #60076 • Added the FE parameter
    slow_lock_print_stack
    to prevent process stalls in large clusters when printing thread stacks. #59967 • Reduced unnecessary locks during tablet scheduling. #59744 ### Bug Fixes Fixed the following issues: • SplitOR fails to prune scan columns. #60223 • Incorrect query plan for null-aware left anti joins. #60119 • Incorrect query results when rewriting queries with materialized views due to missing NULL partitions. #60087 • Partition pruning errors when tables contain empty partitions. #60162 • Refresh errors on Iceberg external tables when using partition expressions based on
    str2date
    . #60089 • Unexpected behavior caused by materialized view schema changes. #60079 • Issues related to low-cardinality global dictionaries in UNION operators. #60075 • Incorrect partition ranges for temporary partitions created using the START END syntax. #60014 • Lock issues with SUBMIT TASK. #60026 • Partial updates fail on Primary Key tables under certain conditions. #60052 • Crashes caused by BE failing to create directories due to a lack of permissions to access storage paths. #60028 • Cache failures due to cache key duplication in concurrent scenarios. #60053 • Hive table metadata background refresh failure in Unified Catalog. #55215 • Query failures caused by incorrect return types of CASE WHEN. #59972 • Query failures when Delta Lake tables UNION themselves. #60030 • Partition creation failure when writing to multiple tables within the same transaction. #59954 • Queries could return empty results instead of errors when tablet versions were updated during execution. #53060 • Queries against modified columns in a table return null after upgrading to v3.4. #59941 • Authentication information is printed in logs. #59907 • Metadata refresh failures for external tables in Hive Catalog. #54596 • CACHE SELECT failures for tables after schema changes. #59812 • Broker Load could not recover after FE Leader shifts. #59732 • Stream Load failures when the target table name contains Chinese characters. #59722 • Incorrect query results in external tables due to search key hash collisions (affecting Iceberg/Delta/Paimon). #59781 StarRocks/starrocks
    👍 1
    🎉 1
  • b

    Beryl Chen

    07/09/2025, 11:01 PM
    Hi <!channel>, Just flagging this in case it’s helpful—we’ve compiled best practices around key topics like partitioning, clustering, bucketing, and primary key tables in StarRocks. These were put together by our engineering team to address things that come up often. danceml 📄 Check them out if you haven’t already: https://docs.starrocks.io/docs/category/best-practices/
    👍 17
    🚀 3
    🎉 1
    👏 1
    🔥 1
    gratitude thank you 10
    🙌 6
    m
    p
    a
    • 4
    • 5
  • g

    GitHub

    07/11/2025, 5:54 AM
    Release - 3.4.5 New release published by jaogoy Release Date: July 10, 2025 ### Improvements • Enhanced observability of loading job execution: Unified the runtime information of loading tasks into the
    information_schema.loads
    view. Users can view the execution details of all INSERT, Broker Load, Stream Load, and Routine Load subtasks in this view. Additional fields have been added to help users better understand the status of loading tasks and the association with parent jobs (PIPES, Routine Load Jobs). • Support modifying
    kafka_broker_list
    via the
    ALTER ROUTINE LOAD
    statement. ### Bug Fixes The following issues have been fixed: • Under high-frequency loading scenarios, Compaction could be delayed. #59998 • Querying Iceberg external tables via Unified Catalog would throw an error:
    not support getting unified metadata table factory
    . #59412 • When using
    DESC FILES()
    to view CSV files in remote storage, incorrect results were returned because the system mistakenly inferred
    xinf
    as the FLOAT type. #59574 •
    INSERT INTO
    could cause BE to crash when encountering empty partitions. #59553 • When StarRocks reads Equality Delete files in Iceberg, it could still access deleted data if the data had already been removed from the Iceberg table. #59709 • Query failures caused by renaming columns. #59178 ### Behavior Changes • The default value of the BE configuration item
    skip_pk_preload
    has been changed from
    false
    to
    true
    . As a result, the system will skip preloading Primary Key Indexes for Primary Key tables to reduce the likelihood of
    Reached Timeout
    errors. This change may increase query latency for operations that require loading Primary Key Indexes. StarRocks/starrocks
    🙌 3
  • r

    Ronit Kapoor

    07/18/2025, 5:43 PM
    Hey <!channel>! I'm Ron, the new Developer Advocate at CelerData. I'm thrilled to be part of the team and to support and grow the StarRocks community. Next Thursday, I’ll be speaking at the Apache Iceberg Community Meetup, hosted at Snowflake’s Atlanta office. I’ll be sharing how you can meet your SLAs and low latency goals using materialized views. If you're in the area, it’s a great chance to connect with others in the Iceberg community and with CelerData (that’s me!). Come join us for an evening of learning, insights, and great conversation. Register here on Luma! ->>>> https://lu.ma/vdbjh2fc?tk=fZUnv1 Hope to see you there! And in the meantime, if you have any questions or just want to connect, feel free to reach out 😄 https://www.linkedin.com/in/ron-kapoor-swe/
    rocky heart 5
    🎉 20
    🙌 3
    👍 10
    party 7
    y
    • 2
    • 1
  • g

    GitHub

    07/21/2025, 2:11 AM
    Release - 3.5.2 New release published by yingtingdong Release Date: July 18, 2025 ### Improvements • Collected NDV (number of distinct values) statistics for ARRAY columns to improve query plan accuracy. #60623 • Disabled replica balancing for Colocate tables and tablet scheduling in Shared-data clusters to reduce unnecessary log output. #60737 • Optimized Catalog access workflow: FE now delays accessing external data sources asynchronously at startup to prevent hanging due to external service unavailability. #60614 • Added session variable
    enable_predicate_expr_reuse
    to control predicate pushdown. #60603 • Supports a retry mechanism when fetching Kafka partition information fails. #60513 • Removed the restriction requiring exact mapping of partition columns between materialized views and base tables. #60565 • Supports building Runtime In-Filters to enhance aggregation performance by filtering data during aggregation. #59288 ### Bug Fixes Fixed the following issues: • COUNT DISTINCT queries crash due to low-cardinality optimization for multiple columns. #60664 • Incorrect matching of global UDFs when multiple functions share the same name. #60550 • Null pointer exception (NPE) issue during Stream Load import. #60755 • Null pointer exception (NPE) issue when starting FE during a recovery from a cluster snapshot. #60604 • BE crash caused by column mode mismatch when processing short-circuit queries with out-of-order values. #60466 • Session variables set via PROPERTIES in SUBMIT TASK statements did not take effect. #60584 • Incorrect results for
    SELECT min/max
    queries under specific conditions. #60601 • Incorrect bucket pruning when the left side of a predicate is a function, leading to incorrect query results. #60467 • Crash for queries against a non-existent
    query_id
    via Arrow Flight SQL. #60497 ### Behavior Changes • The default value of
    lake_compaction_allow_partial_success
    is set to
    true
    . Compaction operations can now be marked as successful even if partially completed, preventing blockage of subsequent compaction tasks. #60643 StarRocks/starrocks
  • b

    Beryl Chen

    07/22/2025, 4:00 PM
    Hi <!channel> ! 👋 The StarRocks Global Summit 2025 site is live - https://summit.starrocks.io/2025 danceml Grab your free pass and join us online on September 10 for a full day of engineer-to-engineer sessions led by StarRocks PMCs and key members of the user community. Spare a few hours, skip the guesswork, and learn from teams who’ve solved some of the toughest data challenges using StarRocks. We’re sharing hard-won lessons, real use cases, and tips you can take straight into production—or your next proof of concept! 🔗 Save your spot now: https://summit.starrocks.io/2025 🎤 Got something to share? There’s still time to *submit a talk*—CFP is open a bit longer!
    🎉 7
    👀 3
    👍 5
  • b

    Beryl Chen

    07/24/2025, 1:45 PM
    Friendly reminder: Today’s session, "Introducing StarRocks 3.5," is happening at 10 AM PT | 1 PM ET! Save your seat here*:* https://celerdata.wistia.com/live/events/vwke0ga4l8 Here’s a sneak peek at what we’ll cover: • Cluster management & security: Snapshot & restore for better disaster recovery, LDAP/JWT/OAuth2 authentication, and more • Table & ingestion: Spilling for stable massive batch ingestion, partition merge, TTL • Open format/Lakehouse integration: Low-cardinality optimization with dictionary encoding • Materialized views: TTL support to boost performance and save storage And we’ll open the floor for all your StarRocks questions at the end!
    🎉 6
  • g

    GitHub

    07/31/2025, 9:38 AM
    Release - 3.3.17 New release published by wangsimo0 ## v3.3.17 Release Date: July 30, 2025 ### Bug Fixes The following issues have been fixed: • Upgraded HttpClient5 to 5.4.3. #61298 • Incorrect
    cpu_core_used_permille
    limit in resource groups. #61177 • Conflict between ALTER jobs and partition creation tasks. #61167 • NPE caused by missing
    globalStateMgr
    in
    ConnectContext
    . #60880 • Partition creation failed when partition names matched case-insensitively but had different values. #60909 • Lock competition caused by synchronous access to partition statistics. #61041 • ANALYZE tasks stuck in
    pending
    state after FE restart. #61113 • Issue with JIT (Just-In-Time) compilation in BE. #61060 • Leader address issue in Starmgr. #61016 • CVE vulnerabilities in Broker. #60908 • Actual number of JDBC connections exceeded
    jdbc_connection_pool_size
    limit. #61004 • CVE-2022-41404 vulnerability. #59689 • CVEs related to Parquet and HttpClient5. #58750 • Partition not removed from
    _partition_map
    when physical partition ID was empty. #60842 • Missing version check in shared-data clusters. #59422 • Transaction log missing when publishing logs in batches in shared-data clusters. #60949 • Concurrent publishing of the same transaction when Batch Publish is enabled in shared-data clusters. #57574 • Statistics overwrite issue caused by lack of semi-synchronous mode. #60897 • Inaccurate
    maxInstantTime
    used for filtering Hudi files when retrieving latest merged file slices. #60927 • TaskRun state incompatible with earlier versions. #60438 • CVE-2025-52999 vulnerability. #60795 • Vulnerability caused by
    log4j-1.2.17-cloudera6
    in Broker. #59579 • BE crash when loading OOM partitions. #60778 • Base Compaction tasks blocking other compaction tasks. #60711 • Inefficient handling of error string truncation. #60878 • Materialized view rewrite failed in multi-FE environments. #60841 • INSERT OVERWRITE failed on manually created partitions. #60750 • Issue caused by using random distribution in aggregate keys. #60702 • Crash caused by low cardinality rewrite in
    multi_distinct_count
    . #60664 • Issue with Pivot resolving fields. #60748 • Upgraded
    hudi-common
    to 1.0.2. #59501 • BE crash when CLONE and DROP TABLE run concurrently. #61359 StarRocks/starrocks
    🚀 1
    🙌 2
  • b

    Beryl Chen

    08/05/2025, 7:10 PM
    Hi <!channel>, We recently hosted a webinar diving into the new features in StarRocks 3.5 with @Sida Shen—and had a great discussion with tons of thoughtful questions from the community! If you missed it (or just want to revisit the highlights), Sida also wrote two blog posts so you can catch up anytime: 📝 Blog: Introducing StarRocks 3.5 https://www.starrocks.io/blog/introducing-starrocks-3-5 💬 Q&A Recap: Security, Iceberg, and more https://www.starrocks.io/blog/35-webinar-qa-recap-starrocks-security-iceberg-support-and-more 🎬 Watch the full recording:

    https://www.youtube.com/watch?v=J0WJyXbWghU▾

    🙌 11
    👍 2
  • g

    GitHub

    08/07/2025, 7:58 AM
    Release - 3.4.6 New release published by jaogoy Release Date: August 7, 2025 ### Improvements • When exporting data to Parquet files using
    INSERT INTO FILES
    , you can now specify the Parquet version via the </StarRocks/starrocks/blob/sql-reference/sql-functions/table-functions/files.md#parquetversion|`parquet.version`> property to improve compatibility with other tools when reading the exported files. #60843 ### Bug Fixes The following issues have been fixed: • Loading jobs failed due to overly coarse lock granularity in
    TableMetricsManager
    . #58911 • Case sensitivity issue in column names when loading Parquet data via
    FILES()
    . #61059 • Cache did not take effect after upgrading a shared-data cluster from v3.3 to v3.4 or later. #60973 • A division-by-zero error occurred when the partition ID was null, causing a BE crash. #60842 • Broker Load jobs failed during BE scaling. #60224 ### Behavior Changes • The
    keyword
    column in the
    information_schema.keywords
    view has been renamed to
    word
    to align with the MySQL definition. #60863 StarRocks/starrocks
    👍 3
  • g

    GitHub

    08/11/2025, 7:30 AM
    Release - 3.5.3 New release published by yingtingdong Release Date: August 11, 2025 ### Feature Enhancements • Lake Compaction adds Segment write time statistics. #60891 • Avoid bRPC communication in local PassThrough exchange scenarios. #60538 • Disable inline mode for Data Cache writes to avoid performance degradation. #60530 • Iceberg metadata scan supports shared file I/O. #61012 • Support termination of all PENDING ANALYZE tasks. #61118 • Force reuse when there are too many CTE nodes to avoid excessive optimization time. #60983 • Added
    BALANCE
    type to cluster balance results. #61081 • Optimized materialized view rewrite for external tables. #61037 • Default value of system variable
    enable_materialized_view_agg_pushdown_rewrite
    is changed to
    true
    , enabling aggregation pushdown for materialized view queries by default. #60976 • Optimized partition statistics lock competition. #61041 ### Bug Fixes The following issues have been fixed: • Inconsistent Chunk column size after column pruning. #61271 • Synchronous execution of partition statistics loading may cause deadlocks. #61300 • Crash when
    array_map
    processes constant array columns. #61309 • Setting an auto-increment column to NULL results in the system mistakenly rejecting valid data within the same Chunk. #61255 • The actual number of JDBC connections may exceed the
    jdbc_connection_pool_size
    limit. #61038 • FQDN mode did not use IP addresses as cache map keys. #61203 • Array column cloning error during array comparison. #61036 • Deploying serialized thread pool blockage led to query performance degradation. #61150 • OK hbResponse not synchronized after heartbeat retry counter reset. #61249 • Incorrect result for the
    hour_from_unixtime
    function. #61206 • Conflicts between ALTER TABLE jobs and partition creation. #60890 • Cache does not take effect after upgrading from v3.3 to v3.4 or later. #60973 • Vector index metric
    hit_count
    is not set. #61102 • Stream Load transactions fail to find the coordinator node. #60154 • BE crashes when loading OOM partitions. #60778 • INSERT OVERWRITE failed on manually created partitions. #60750 • Partition creation failed when partition names matched case-insensitively but had different values. #60909 • The system does not support PostgreSQL UUID type. #61021 • Case sensitivity issue with column names when loading Parquet data via
    FILES()
    . #61059 StarRocks/starrocks
    👍 3
  • b

    Beryl Chen

    08/14/2025, 1:45 PM
    <!channel> Friendly reminder: Today’s session, _"_Why Customer-Facing Analytics Breaks—and How to Build It Right_,"_ is happening at 10 AM PT | 1 PM ET! Save your seat here: https://celerdata.wistia.com/live/events/uqt1a3umn1 Join us as we explore real-world strategies for delivering analytics directly to your customers—whether through dashboards, APIs, or AI agents. We’ll break down the common pitfalls of BI and OLAP systems and share examples of companies that overcame bottlenecks by rebuilding their architectures with StarRocks and Apache Iceberg. (No stress if you’re busy—register to get the recording afterward!)
    👍 8
    🙌 4
    • 1
    • 1
  • b

    Beryl Chen

    08/15/2025, 6:40 PM
    Good news for our Japanese community 🇯🇵 — StarRocks docs are available in Japanese 👉🏻 https://docs.starrocks.io/ja/docs/introduction/StarRocks_intro/ 日本語ドキュメントをご覧いただき、気づいたことや直したほうがいい点があればぜひ教えてください 🙌
    👍 8
    🥰 4
    🇯🇵 2
  • r

    Ronit Kapoor

    08/19/2025, 11:41 PM
    Hey <!channel>! Ron here. You might have seen me at a meetup or chatted with me on Slack, but I’m excited to officially introduce myself as CelerData’s new Developer Advocate! I joined the team almost two months ago and have been ramping up behind the scenes. To that end, I recently wrote a blog introducing myself to the community and wanted to share it here on Slack! https://medium.com/starrocks-engineering/hello-from-your-new-dev-advocate-9e27cec4ffe3 👋 I can’t wait to connect with you all, share what I’m working on, and hear your ideas!
    🙌 10
    dogdance 4
    👍 13
    bufo hello 7
    yay 6
    p
    s
    • 3
    • 2
  • g

    GitHub

    08/22/2025, 11:48 AM
    Release - 3.5.4 New release published by yingtingdong Release Date: August 22, 2025 ### Improvements • Added logs to clarify the reason that tablets cannot be repaired. #61959 • Optimized DROP PARTITION information in logs. #61787 • Assigned a large but configurable row count to tables with unknown stats for statistical estimation. #61332 • Added balance statistic according to label location. #61905 • Added colocate group balance statistics to improve cluster monitoring. #61736 • Skipped the Publish waiting phase when the number of healthy replicas exceeds the default replica count. #61820 • Included the tablet information collection time in the tablet report. #61643 • Supports writing Starlet files with tags. #61605 • Supports viewing cluster balance statistics via SHOW PROC. #61578 • Bumped librdkafka to 2.11.0 to support Kafka 4.0 and removed deprecated configurations. #61698 • Added
    prepared_timeout
    configuration to Stream Load Transaction Interface. #61539 • Upgraded StarOS to v3.5‑rc3. #61685 ### Bug Fixes The following issues have been fixed: • Incorrect Dict version of random distribution tables. #61933 • Incorrect query context in context conditions. #61929 • Publish failures caused by synchronous Publish for shadow tablets during ALTER operations. #61887 • CVE‑2025‑55163 issue. #62041 • Memory leak in real-time data ingestion from Apache Kafka. #61698 • Incorrect count of rebuild files in the lake persistent index. #61859 • Statistics collection on generated expression columns causes cross-database query errors. #61829 • Query Cache misaligns in shared-nothing clusters, causing inconsistent results. #61783 • High memory usage in CatalogRecycleBin due to retaining deleted partition information.#61582 • SQL Server JDBC connections fail when the timeout exceeds 65,535 milliseconds. #61719 • Security Integration fails to encrypt passwords, exposing sensitive information. #60666 •
    MIN()
    and
    MAX()
    functions on Iceberg partition columns return NULL unexpectedly. #61858 • Other predicates of Join containing non‑push‑down subfields were incorrectly rewritten. #61868 • QueryContext cancellation can lead to a use‑after‑free situation. #61897 • CBO’s table pruning overlooks other predicates. #61881 • Partial Updates in
    COLUMN_UPSERT_MODE
    may overwrite auto-increment columns with zero. #61341 • JDBC TIME type conversion uses an incorrect timezone offset that leads to wrong time values. #61783 •
    max_filter_ratio
    was not being serialized in Routine Load jobs. #61755 • Precision loss in the
    now(precision)
    function in Stream Load. #61721 • Cancelling a query may result in a “query id not found” error. #61667 • LDAP authentication may miss PartialResultException, causing incomplete query results. #60667 • Paimon Timestamp timezone conversion issue when the query condition contains DATETIME. #60473 StarRocks/starrocks
    🎉 2