Program! Check out the free webinar or register for educations. You can always look back via recorded webinars.

scroll

Data warehouse

Optimization in Snowflake

The data warehouse in Snowflake is the core of the data & BI environment for many of our customers. They come in different sizes and configurations, each with an impact on performance, credit usage and overall efficiency. The text below explains general concepts and things to consider when setting up a data warehouse in Snowflake.

Types and sizes of data warehouses

There are two primary types of warehouses: Standard and Snowpark-optimized. The types range from X-Small to 6X-Large and offer different levels of computing power. The size of the data warehouse in turn determines the credit consumption and the execution possibilities of orders.

Impact on credit usage in the data warehouse

Snowflake operates on a credit system, distinguishing between storage and compute units. A data warehouse’s credit usage increases with its size and duration of activity. While larger data warehouses consume more credits per hour, Snowflake’s per-second billing ensures that you only pay for the resources you actually use.

Credit consumption based on data warehouse size and term:

  • X-Small: Minimal use for short term
  • X-Large: Moderate consumption, suitable for many scenarios
  • 6X-Large: High computational efficiency, ideal for complex operations

Performance Considerations

Interestingly, the size of a data warehouse does not always guarantee faster performance for all tasks. For example, when loading data, the number and size of files is more important than the size of the data warehouse. Using a larger data warehouse may not improve performance unless you are processing a significant amount of files simultaneously.

Managing concurrent jobs

The number of concurrent jobs a data warehouse processes depends on several factors. Snowflake provides options for managing job processing and concurrency, allowing users to manage the queue and resource allocation based on job complexity.

Automation for efficiency

Snowflake warehouses provide auto suspend and resume job capabilities. These capabilities help optimize credit utilization by automatically suspending inactive warehouses and resuming them when jobs are submitted, ensuring efficient resource utilization.

Setting Standards

Snowflake enables you to set up default warehouses for different users and purposes, automatically assigning warehouses to predictable tasks.

So how do I choose the right type of data warehouse?

Selecting the right warehouse size is all about finding a balance between compute capacity, credit consumption, and the nature of the jobs. Understanding workloads and the impact of warehouse size on performance is essential for optimizing resource utilization. Effectively utilizing Snowflake warehouses requires some Snowflake expertise in terms of workloads, job complexity, and the tradeoff between compute capacity and credit consumption. Fine-tuning warehouse configurations provides the perfect balance between performance and cost.

Contact

Get informed about Snowflake

Are you already working with Snowflake? Or are you planning to start working with Snowflake in the near future? Definitely make an appointment to look at the possibilities together!

Contact Snowflake
Name
This form is protected by reCAPTCHA, the privacy policy and with terms of service from Google apply.

Did you know that your browser is outdated?

To get the best possible user experience of our website, we recommend that you upgrade your browser to a newer version or a different browser. Click on the upgrade button to go to the download page.

Upgrade your browser here
Proceed at your own risk