Securiti leads GigaOm's DSPM Vendor Evaluation with top ratings across technical capabilities & business value.

View

Snowflake Migration Best Practices

Published December 10, 2021
Author

Omer Imran Malik

Senior Data Privacy Consultant at Securiti

FIP, CIPT, CIPM, CIPP/US

Listen to the content

Organizations are increasingly adopting data-intensive applications and are choosing to migrate their legacy systems to the Snowflake data cloud, and this is expected to continue. Snowflake revolutionized data processing by enabling customers to process queries at lightning speed, with virtually unlimited workloads running concurrently. It also allows customers to quickly scale up or scale down data processing power according to their needs. Also, the ‘pay-as-you-go’ model enables snowflake customers to optimize their budgetary dollars.

We understand the benefits of migrating to Snowflake. Still, experts recommend some common best practices that can help ensure organizations migrate their data safely and get the most out of Snowflake’s features.

Snowflake Migration Best Practice #1

Ensure Your Technology Stack has the Following Features

  1. End-to-end encrypted connections:
    Data Security teams should secure all connections between on-premises data sources and the Snowflake data cloud with end-to-end encryption. This is important to prevent data leakage and misuse during the migration process.
  2. Dynamic Sensitive Data Masking:
    Ensure that your technology stack has dynamic data masking features to ensure that sensitive data is masked while in transit. This serves as an additional layer of security for sensitive data from unauthorized access.
  3. Data Cleansing:
    Select a user-friendly solution that can cleanse data efficiently, and ensure it is valid and complete before migrating the Data to Snowflake. Effective data cleansing means high data quality.
  4. Data Catalogs:
    Ensure there is a solution that can maintain automated data catalogs of all activities performed during the migration process. The solution should have a continuous, scalable, and auditable data flow for analytics.

Snowflake Migration Best Practice #2

Plan for the following Key Requirements before starting the Snowflake Data Migration Process

  1. Determine Data Storage Requirements:
    Estimate the amount of data/storage and time it may take to migrate. If the storage is more than 50 TB and time is short, consider using physical storage devices to transfer large amounts of data.
  2. Determine your Network’s Speed:
    Determine the bandwidth and connectivity available between your on-premises server to Snowflake (e.g., Direct connect, Region/location of the source and target, etc.). This will determine how much time the actual data migration will take.
  3. Determine Role-based Data Access needs:
    Discuss data access needs to understand who will be using this data, the access frequency, and how fast they want to access.
  4. Set Achievable Timelines:
    All of the factors above contribute to setting achievable timelines for migration. For example, there might be fixed deadlines to offload data from the on-premises database. Tight deadlines complicate the Snowflake data migration process as unforeseen problems (e.g. network breakdowns, equipment malfunctions, etc.) might impact the project’s timelines. It is advisable to keep a buffer when you are planning timelines.
  5. Use the new ELT approach to data migration:
    ELT refers to “Extract, Load, Transform,” and is a modern variation on the older process of “Extract, Transform, and Load (ETL)”. ETL runs transformations before the data is loaded to the data cloud, resulting in a more complex, lengthy, and expensive migration process.On the other hand, ELT transforms data after it is loaded to the data cloud. This means that organizations can transform their raw data at any time, when and as necessary, streamlining the data loading process and saving resources. ELT is beneficial for cloud-native data warehouses like Snowflake because data transformation happens within the target destination itself.

Snowflake Migration Best Practice #3

Plan and Manage Costs Effectively

The pay-as-you-go model is a major reason why companies deploy Snowflake. The model reduces infrastructure costs (because most of the data is migrated to the cloud) and allows companies to re-allocate capital efficiently.

To forecast costs accurately, it is crucial to determine exactly how many resources your company will be using in a given month. Planning teams should forecast costs before initiating the Snowflake data migration process.

The following questions will help you forecast accurately:

  • Which roles should have access to Snowflake, what privileges they have, and why they need them?
    • Your data governance policies can help answer the access and privileges part of this question. To understand why users need access and other privileges, you need to dive deep into their roles and responsibilities. Ensure that access is granted only to users who absolutely need it and understand how the per-query pricing model works.
  • What are the typical data workflows, data usage scenarios, and storage/compute requirements?
    • Snowflake invoices its customers only for what storage and computing power they use. For instance, Snowflake storage costs can begin at a flat rate of $23/TB/month. Compute costs start from $0.00056 per second, per credit, for the On-Demand Standard Edition. So, it is crucial to determine this part to control costs.
  • Which data must be moved to Snowflake, and which data should remain on-premises?
    • Efficiently balancing data storage between on-premises and Snowflake will help optimize your cost structure even more.

Secure Snowflake Data Migration with Securiti

Securiti has designed a customized solution that integrates natively with Snowflake and simplifies Data Governance, privacy, and security with automation.

Data Governance for Snowflake

Securiti incorporates all of the Data Governance features in Snowflake and simplifies policy enforcement with automation. Once Data Governance policies are set up, the solution can continuously monitor data access and usage configurations, with automatic alerts that flag any misconfigurations.

The solution also incorporates:

  • Dynamic Data masking based on roles and policies to restrict access & usage of sensitive data from unauthorized personnel.
  • Table, column, and even row-level access policy enforcement.
  • User access history audit to detect any non-compliance with governance policies.

Learn more about Securit’s Data Governance features for Snowflake

Data Privacy for Snowflake

Securiti specializes in providing cutting-edge, A.I-powered data privacy solutions that automate:

  • Data Mapping and Classification of personal data.
  • Quick and accurate DSR fulfillment.
    • Using a conversational interface (Auti) you can extract any individual’s personal data within minutes.
  • Comprehensive Privacy Risk Assessments that enable a proactive approach to risk mitigation.
  • Data Breach Management Notifications that meet strict regulatory requirements and notify all impacted parties as quickly as possible.
  • A Workflow Orchestration feature that uses a simple drag-and-drop design and helps automate various privacy, governance, and security functions within Snowflake.

Data Security for Snowflake

Securiti’s solution also incorporates all of Snowflake’s native data security features, including:

  • Network Security:
    • Site access is controlled through IP allow and block lists, managed through network policies.
  • Account/user authentication:
    • MFA (multi-factor authentication) for users' increased security for account access.
    • Automated security scanning of any misconfigurations. Snowflake Security Administrators can decide to remediate any misconfigurations automatically or receive notifications.
  • Compliance with Data Regulations like PCI-DSS, HIPAA, and more.
    • Map security policies to specific standard controls and regulatory compliance.
    • Generate one-click reports to demonstrate compliance coverage to regulators and auditors for various data privacy and security regulations.

Learn more about Securit’s Data Security features for Snowflake

Data Governance

Data governance is crucial to effective and compliant data management in Snowflake. Governance teams must formulate and enforce policies at a granular level using a technology solution. The technology solution should have continuous monitoring capabilities that can automatically report any policy violations to data governance teams.

Join Our Newsletter

Get all the latest information, law updates and more delivered to your inbox


Share


More Stories that May Interest You

Videos

View More

Mitigating OWASP Top 10 for LLM Applications 2025

Generative AI (GenAI) has transformed how enterprises operate, scale, and grow. There’s an AI application for every purpose, from increasing employee productivity to streamlining...

View More

DSPM vs. CSPM – What’s the Difference?

While the cloud has offered the world immense growth opportunities, it has also introduced unprecedented challenges and risks. Solutions like Cloud Security Posture Management...

View More

Top 6 DSPM Use Cases

With the advent of Generative AI (GenAI), data has become more dynamic. New data is generated faster than ever, transmitted to various systems, applications,...

View More

Colorado Privacy Act (CPA)

What is the Colorado Privacy Act? The CPA is a comprehensive privacy law signed on July 7, 2021. It established new standards for personal...

View More

Securiti for Copilot in SaaS

Accelerate Copilot Adoption Securely & Confidently Organizations are eager to adopt Microsoft 365 Copilot for increased productivity and efficiency. However, security concerns like data...

View More

Top 10 Considerations for Safely Using Unstructured Data with GenAI

A staggering 90% of an organization's data is unstructured. This data is rapidly being used to fuel GenAI applications like chatbots and AI search....

View More

Gencore AI: Building Safe, Enterprise-grade AI Systems in Minutes

As enterprises adopt generative AI, data and AI teams face numerous hurdles: securely connecting unstructured and structured data sources, maintaining proper controls and governance,...

View More

Navigating CPRA: Key Insights for Businesses

What is CPRA? The California Privacy Rights Act (CPRA) is California's state legislation aimed at protecting residents' digital privacy. It became effective on January...

View More

Navigating the Shift: Transitioning to PCI DSS v4.0

What is PCI DSS? PCI DSS (Payment Card Industry Data Security Standard) is a set of security standards to ensure safe processing, storage, and...

View More

Securing Data+AI : Playbook for Trust, Risk, and Security Management (TRiSM)

AI's growing security risks have 48% of global CISOs alarmed. Join this keynote to learn about a practical playbook for enabling AI Trust, Risk,...

Spotlight Talks

Spotlight 14:21

AI Governance Is Much More than Technology Risk Mitigation

AI Governance Is Much More than Technology Risk Mitigation
Watch Now View
Spotlight 12:!3

You Can’t Build Pipelines, Warehouses, or AI Platforms Without Business Knowledge

Watch Now View
Spotlight 47:42

Cybersecurity – Where Leaders are Buying, Building, and Partnering

Rehan Jalil
Watch Now View
Spotlight 27:29

Building Safe AI with Databricks and Gencore

Rehan Jalil
Watch Now View
Spotlight 46:02

Building Safe Enterprise AI: A Practical Roadmap

Watch Now View
Spotlight 13:32

Ensuring Solid Governance Is Like Squeezing Jello

Watch Now View
Spotlight 40:46

Securing Embedded AI: Accelerate SaaS AI Copilot Adoption Safely

Watch Now View
Spotlight 10:05

Unstructured Data: Analytics Goldmine or a Governance Minefield?

Viral Kamdar
Watch Now View
Spotlight 21:30

Companies Cannot Grow If CISOs Don’t Allow Experimentation

Watch Now View
Spotlight 2:48

Unlocking Gen AI For Enterprise With Rehan Jalil

Rehan Jalil
Watch Now View

Latest

View More

From Trial to Trusted: Securely Scaling Microsoft Copilot in the Enterprise

AI copilots and agents embedded in SaaS are rapidly reshaping how enterprises work. Business leaders and IT teams see them as a gateway to...

The ROI of Safe Enterprise AI View More

The ROI of Safe Enterprise AI: A Business Leader’s Guide

The fundamental truth of today’s competitive landscape is that businesses harnessing data through AI will outperform those that don’t. Especially with 90% of enterprise...

Data Security Governance View More

Data Security Governance: Key Principles and Best Practices for Protection

Learn about Data Security Governance, its importance in protecting sensitive data, ensuring compliance, and managing risks. Best practices for securing data.

AI TRiSM View More

What is AI TRiSM and Why It’s Essential in the Era of GenAI

The launch of ChatGPT in late 2022 was a watershed moment for AI, introducing the world to the possibilities of GenAI. After OpenAI made...

Managing Privacy Risks in Large Language Models (LLMs) View More

Managing Privacy Risks in Large Language Models (LLMs)

Download the whitepaper to learn how to manage privacy risks in large language models (LLMs). Gain comprehensive insights to avoid violations.

View More

Top 10 Privacy Milestones That Defined 2024

Discover the top 10 privacy milestones that defined 2024. Learn how privacy evolved in 2024, including key legislations enacted, data breaches, and AI milestones.

Comparison of RoPA Field Requirements Across Jurisdictions View More

Comparison of RoPA Field Requirements Across Jurisdictions

Download the infographic to compare Records of Processing Activities (RoPA) field requirements across jurisdictions. Learn its importance, penalties, and how to navigate RoPA.

Navigating Kenya’s Data Protection Act View More

Navigating Kenya’s Data Protection Act: What Organizations Need To Know

Download the infographic to discover key details about navigating Kenya’s Data Protection Act and simplify your compliance journey.

Gencore AI and Amazon Bedrock View More

Building Enterprise-Grade AI with Gencore AI and Amazon Bedrock

Learn how to build secure enterprise AI copilots with Amazon Bedrock models, protect AI interactions with LLM Firewalls, and apply OWASP Top 10 LLM...

DSPM Vendor Due Diligence View More

DSPM Vendor Due Diligence

DSPM’s Buyer Guide ebook is designed to help CISOs and their teams ask the right questions and consider the right capabilities when looking for...

What's
New