A modern digital illustration depicting an optimized ETL workflow and Redshift cluster efficiency for a retail analytics case. The design features glowing data pipelines, digital dashboards, and abstract cloud icons, symbolizing efficient data flow and technology. Subtle retail elements, such as a shopping cart icon blended with data visuals, convey the integration of retail and tech. The image uses a minimalist style with gradient hues of blue, green, and white, creating a futuristic and innovative appearance.

Improving Weekly ETL Dependencies and Redshift Cluster Runtime for a Global Retailer

Categories

Sign up for our newsletter

We care about the protection of your data. Read our Privacy Policy.

Information

The Challenge

Technical Implementation

Run a query to identify all automated daily/weekly jobs by run time, final status, who created the job, who updated the job last.

Reach out to all data job owners and have them review if the job is still needed to run every day/week.

Identify employees that are no longer in the org and reach out to the team to identify if the scheduled jobs are still needed.

De-schedule all unneeded jobs.

Review all active queries for improvement opportunities.

Check if Metric jobs can be linked to the same job and remove duplicates.

Reach out to Job owners with improvements and review best practices to avoid further cluster slowdown.

Immediately, 20% of all daily and weekly running jobs were de-scheduled, either due to employees leaving the organization or the jobs no longer being needed.

A focus on the highest runtime jobs revealed several created by SQL beginners. Obvious syntax issues were addressed, and additional questions, such as those related to reporting time frames, were clarified with the creators. These improvements reduced the average runtime for these jobs from 60 minutes to an average of 5 minutes.

For weekly business reviews, several repeating queries were consolidated into one main query, and the rest were de-scheduled. A single job can be tied to several metric jobs, the benefit the query only has to be run once and the results can be shared across metric jobs.

For other queries, Junior Business Analysts were guided on best practices to improve queries, like temp tables, review of joins and creation of static table, etc.

Results

Subscribe to our newsletter

Stay informed with the latest insights, industry trends, and expert tips delivered straight to your inbox. Sign up for our newsletter today and never miss an update!

We care about the protection of your data. Read our Privacy Policy.

Keep reading

Dig deeper into data development by browsing our blogs…

Get in Touch

Let us leverage your data so that you can make smarter decisions. Talk to our team of data experts today or fill in this form and we’ll be in touch.