Requires anyone that creates or alters a training dataset for a generative AI system to submit a notice to the Register of Copyrights detailing copyrighted works used in the dataset 30 days before the model being made available to consumers or 30 days after the Act takes effect. Imposes a $5,000 penalty for non-compliance and instructs the Register to create a public database of filed notices.
Requires anyone who creates or alters a training dataset used for a generative AI system to submit a notice to the Register of Copyrights with a summary of any copyrighted works used in the dataset and the URL for the dataset.
Requires the notice must be submitted 30 days before the AI system is made available to consumers or within 30 days of the Act taking effect for systems already available.
Imposes a civil penalty of at least $5,000 for non-compliance.
Requires the Register to establish and maintain a public database of filed notices.
Key facts
🏛️ This document was proposed and/or enacted by the United States Congress but is now defunct.
For authoritative text and metadata, visit the official source.
🎯 This document primarily applies to the private sector, rather than the government.
📜 This document's name is Generative AI Copyright Disclosure Act of 2024.
AGORA also tracks this document under the name GenAI Copyright Disclosure Act.
Themes AI risks, applications, governance strategies, and other themes addressed in AGORA documents.
This is an unofficial copy. The document has been
archived and reformatted in plaintext for AGORA. Footnotes, tables, and
similar material may be omitted. For the official text, visit the original source.
H. R. 7913
To require a notice be submitted to the Register of Copyrights with respect to copyrighted works used in building generative AI systems, and for other purposes.
IN THE HOUSE OF REPRESENTATIVES
April 9, 2024
Mr. Schiff introduced the following bill; which was referred to the Committee on the Judiciary
A BILL
To require a notice be submitted to the Register of Copyrights with respect to copyrighted works used in building generative AI systems, and for other purposes.
Be it enacted by the Senate and House of Representatives of the United States of America in Congress assembled,
Requires submission of notice to Register of Copyrights for copyrighted works in generative AI systems.
Requires submission of notice to Register of Copyrights for copyrighted works in generative AI systems.
SECTION 1. SHORT TITLE.
This Act may be cited as the “Generative AI Copyright Disclosure Act of 2024”.
Cites this Act as the “Generative AI Copyright Disclosure Act of 2024”.
Cites this Act as the “Generative AI Copyright Disclosure Act of 2024”.
SEC. 2. NOTICE TO BE SUBMITTED TO THE REGISTER OF COPYRIGHTS WITH RESPECT TO COPYRIGHTED WORKS USED IN BUILDING GENERATIVE AI SYSTEMS.
(a) Notice.—
(1) REQUIREMENT.—A person who creates a training dataset, or alters a training dataset (including by making an update to, refining, or retraining the dataset) in a significant manner, that is used in building a generative AI system shall submit to the Register a notice that contains—
(A) a sufficiently detailed summary of any copyrighted works used—
(i) in the training dataset (in the case that the person creates the dataset); or
(ii) to alter the training dataset (in the case that the person alters the training data in a significant manner); and
(B) the URL for such dataset (in the case of a training dataset that is publicly available on the internet at the time the notice is submitted).
(2) TIME FOR FILING NOTICE.—The notice required by paragraph (1) shall be submitted—
(A) not later than 30 days before the generative AI system with respect to which the training dataset is used is made available to consumers, in the case that the generative AI system is first made available to consumers after the date on which this Act takes effect; and
(B) not later than 30 days after the date on which this Act takes effect, in the case that the generative AI system with respect to which the training dataset was used was made available to consumers before the effective date of this Act.
Requires submitting a notice detailing copyrighted works used in AI training datasets to the Register of Copyrights 30 days before the AI system is made public or within 30 days of the Act going in effect for existing systems.
Requires submitting a notice detailing copyrighted works used in AI training datasets to the Register of Copyrights 30 days before the AI system is made public or within 30 days of the Act going in effect for existing systems.
(b) Civil Penalty.—
(1) ASSESSMENT.—Any person described under paragraph (1) of subsection (a) that fails to comply with a requirement under such subsection shall be assessed a civil penalty in an amount not less than $5,000.
(2) REGULATIONS.— Not later than 180 days after the date on which this Act takes effect, the Register shall issue regulations to implement the requirement under paragraph (1).
(c) Database.—The Register shall establish and maintain a publicly available online database that contains each notice filed under subsection (a)(1).
Mandates a minimum $5,000 civil penalty for non-compliance and instructs the Register to maintain a public database of filed notices.
Mandates a minimum $5,000 civil penalty for non-compliance and instructs the Register to maintain a public database of filed notices.
(d) Definitions.—In this section:
(1) ARTIFICIAL INTELLIGENCE.—The term “Artificial Intelligence” means an automated system designed to perform a task typically associated with human intelligence or cognitive function.
(2) COPYRIGHTED WORK.—The term “copyrighted work” means a work protected in the United States under a law relating to copyrights.
(3) GENERATIVE AI MODEL.—The term “generative AI model” means a combination of computer code and numerical values designed to use Artificial Intelligence to generate outputs in the form of expressive material such as text, images, audio, or video.
(4) GENERATIVE AI SYSTEM.—The term “generative AI system” means a software product or service that—
(A) substantially incorporates one or more generative AI models; and
(B) is designed for use by consumers.
(5) REGISTER.—The term “Register” means the Register of Copyrights.
(6) TRAINING DATASET.—The term “training dataset” means a collection of individual units of material (including a combination of text, images, audio, or other categories of expressive material, as well as annotations describing the material) used to train a generative AI model.
(e) Effective Date.—This Act shall take effect on the date that is 180 days after the date of the enactment of this Act.
Defines artificial intelligence, copyrighted work, generative AI model, generative AI system, and training dataset; sets the Act's effective date to be 180 days after enactment.
Defines artificial intelligence, copyrighted work, generative AI model, generative AI system, and training dataset; sets the Act's effective date to be 180 days after enactment.