Securiti launches Gencore AI, a holistic solution to build Safe Enterprise AI with proprietary data - easily

View

Assembly Bill 2013: Generative Artificial Intelligence: Training Data Transparency

Author

Sadaf Ayub Choudary

Associate Data Privacy Analyst at Securiti

CIPP/US

Listen to the content

California Assembly Bill 2013 (AB 2013) on Generative Artificial Intelligence: Training Data Transparency was signed into law on September 28, 2024, after the State Assembly and the State Senate approved it.

The law introduces transparency requirements for generative AI (GenAI) system developers. It mandates that developers publicly disclose information about the data used to train and test their GenAI models. GenAI systems and services used for purposes related to national security, military, or defense are exempt from such requirements.

The law addresses growing regulatory and public concerns around model bias, privacy, and other ethical accountability factors. To that end, it serves as a vital first step in a direction that would require developers to be more transparent about their backend development processes. This law helps Californians better understand how AI systems work while promoting responsible innovation.

Read on to learn more about the law in greater detail.

Who Does the Law Apply To?

The law applies to developers of generative artificial intelligence (AI) systems or services or entities that substantially modify such systems. The term "developer" includes any person, partnership, state or local government agency, or corporation that designs, codes, produces, or substantially modifies an AI system or service for use by members of the public. Members of the public exclude:

  • Affiliate- entities that, directly or indirectly, through one or more intermediaries, controls, is controlled by, or is under common control with, another entity. This means the requirement to post public documentation under AB 2013 only applies when AI systems are made available outside an organization's internal or affiliated network.
  • Members of a hospital's medical staff.

The phrase “substantially modifies it”  means creating a new version, new release, or other update to a generative artificial intelligence system or service that materially changes its functionality or performance, including the results of retraining or fine-tuning.

What Does It Regulate?

The law regulates “generative artificial intelligence,” defined as AI that can generate derived synthetic content, such as text, images, video, and audio, that emulates the structure and characteristics of the artificial intelligence’s training data.”  The regulation applies to systems or services released on or after January 1, 2022.

Obligations on Developers

Developers are required to post specific documentation about the training data on their public websites by January 1, 2026 (or prior to substantial modifications). The documentation must include:

  • Sources or owners of the datasets.
  • A description of how the datasets align with the intended purpose of the AI system.
  • Number and types of data points in the datasets.
  • Whether the datasets contain copyrighted, trademarked, patented, or public domain information.
  • Whether the developer purchased or licensed the datasets.
  • Whether the datasets include ‘personal information’ or ‘aggregate consumer information’.
  • Whether the developer cleaned, processed, or modified the datasets and the intended purpose of those efforts in relation to the AI system or service;
  • The time period of data collection and whether data collection is ongoing.
  • The time period during which the data in the datasets was collected, including a notice if the data collection is ongoing.
  • Information about synthetic data generation, if used.

Exemptions

Certain AI systems or services are exempt from the training data transparency requirements:

  • AI systems or services solely used for security and integrity purposes.
  • AI systems or services used for the operation of aircraft in the national airspace.
  • AI systems or services developed for national security, military, or defense purposes, only available to federal entities.

Key Takeaway

Maintaining a data provenance record is crucial for compliance with Assembly Bill 2013, which mandates transparency regarding the datasets used to train generative AI systems. By accurately tracking datasets' origin, ownership, modifications, and usage, businesses can meet the law’s requirements to disclose how data supports AI functionality, whether it contains personal or sensitive information, and if any synthetic data is used.

Join Our Newsletter

Get all the latest information, law updates and more delivered to your inbox


Share


More Stories that May Interest You

Videos

View More

Mitigating OWASP Top 10 for LLM Applications 2025

Generative AI (GenAI) has transformed how enterprises operate, scale, and grow. There’s an AI application for every purpose, from increasing employee productivity to streamlining...

View More

DSPM vs. CSPM – What’s the Difference?

While the cloud has offered the world immense growth opportunities, it has also introduced unprecedented challenges and risks. Solutions like Cloud Security Posture Management...

View More

Top 6 DSPM Use Cases

With the advent of Generative AI (GenAI), data has become more dynamic. New data is generated faster than ever, transmitted to various systems, applications,...

View More

Colorado Privacy Act (CPA)

What is the Colorado Privacy Act? The CPA is a comprehensive privacy law signed on July 7, 2021. It established new standards for personal...

View More

Securiti for Copilot in SaaS

Accelerate Copilot Adoption Securely & Confidently Organizations are eager to adopt Microsoft 365 Copilot for increased productivity and efficiency. However, security concerns like data...

View More

Top 10 Considerations for Safely Using Unstructured Data with GenAI

A staggering 90% of an organization's data is unstructured. This data is rapidly being used to fuel GenAI applications like chatbots and AI search....

View More

Gencore AI: Building Safe, Enterprise-grade AI Systems in Minutes

As enterprises adopt generative AI, data and AI teams face numerous hurdles: securely connecting unstructured and structured data sources, maintaining proper controls and governance,...

View More

Navigating CPRA: Key Insights for Businesses

What is CPRA? The California Privacy Rights Act (CPRA) is California's state legislation aimed at protecting residents' digital privacy. It became effective on January...

View More

Navigating the Shift: Transitioning to PCI DSS v4.0

What is PCI DSS? PCI DSS (Payment Card Industry Data Security Standard) is a set of security standards to ensure safe processing, storage, and...

View More

Securing Data+AI : Playbook for Trust, Risk, and Security Management (TRiSM)

AI's growing security risks have 48% of global CISOs alarmed. Join this keynote to learn about a practical playbook for enabling AI Trust, Risk,...

Spotlight Talks

Spotlight 47:42

Cybersecurity – Where Leaders are Buying, Building, and Partnering

Rehan Jalil
Watch Now View
Spotlight 46:02

Building Safe Enterprise AI: A Practical Roadmap

Watch Now View
Spotlight 13:32

Ensuring Solid Governance Is Like Squeezing Jello

Watch Now View
Spotlight 40:46

Securing Embedded AI: Accelerate SaaS AI Copilot Adoption Safely

Watch Now View
Spotlight 10:05

Unstructured Data: Analytics Goldmine or a Governance Minefield?

Viral Kamdar
Watch Now View
Spotlight 21:30

Companies Cannot Grow If CISOs Don’t Allow Experimentation

Watch Now View
Spotlight 2:48

Unlocking Gen AI For Enterprise With Rehan Jalil

Rehan Jalil
Watch Now View
Spotlight 13:35

The Better Organized We’re from the Beginning, the Easier it is to Use Data

Watch Now View
Spotlight 13:11

Securing GenAI: From SaaS Copilots to Enterprise Applications

Rehan Jalil
Watch Now View
Spotlight 47:02

Navigating Emerging Technologies: AI for Security/Security for AI

Rehan Jalil
Watch Now View

Latest

View More

Accelerating Safe Enterprise AI with Gencore Sync & Databricks

We are delighted to announce new capabilities in Gencore AI to support Databricks' Mosaic AI and Delta Tables! This support enables organizations to selectively...

View More

Building Safe, Enterprise-grade AI with Securiti’s Gencore AI and NVIDIA NIM

Businesses are rapidly adopting generative AI (GenAI) to boost efficiency, productivity, innovation, customer service, and growth. However, IT & AI executives—particularly in highly regulated...

Key Differences from DLP & CNAPP View More

Why DSPM is Critical: Key Differences from DLP & CNAPP

Learn about the critical differences between DSPM vs DLP vs CNAPP and why a unified, data-centric approach is an optimal solution for robust data...

DSPM Trends View More

DSPM in 2025: Key Trends Transforming Data Security

DSPM trends in 2025 provides a quick glance at the challenges, risks, and best practices that can help security leaders evolve their data security...

The Future of Privacy View More

The Future of Privacy: Top Emerging Privacy Trends in 2025

Download the whitepaper to gain insights into the top emerging privacy trends in 2025. Analyze trends and embed necessary measures to stay ahead.

View More

Personalization vs. Privacy: Data Privacy Challenges in Retail

Download the whitepaper to learn about the regulatory landscape and enforcement actions in the retail industry, data privacy challenges, practical recommendations, and how Securiti...

Nigeria's DPA View More

Navigating Nigeria’s DPA: A Step-by-Step Compliance Roadmap

Download the infographic to learn how Nigeria's Data Protection Act (DPA) mapping impacts your organization and compliance strategy.

Decoding Data Retention Requirements Across US State Privacy Laws View More

Decoding Data Retention Requirements Across US State Privacy Laws

Download the infographic to explore data retention requirements across US state privacy laws. Understand key retention requirements and noncompliance penalties.

Gencore AI and Amazon Bedrock View More

Building Enterprise-Grade AI with Gencore AI and Amazon Bedrock

Learn how to build secure enterprise AI copilots with Amazon Bedrock models, protect AI interactions with LLM Firewalls, and apply OWASP Top 10 LLM...

DSPM Vendor Due Diligence View More

DSPM Vendor Due Diligence

DSPM’s Buyer Guide ebook is designed to help CISOs and their teams ask the right questions and consider the right capabilities when looking for...

What's
New