Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PEP 763: Limiting deletions on PyPI #4080

Merged
merged 36 commits into from
Oct 28, 2024
Merged
Show file tree
Hide file tree
Changes from 35 commits
Commits
Show all changes
36 commits
Select commit Hold shift + click to select a range
6176ded
drafting deletion PEP
woodruffw Oct 8, 2024
7db6359
draft: add Alexis as author
woodruffw Oct 16, 2024
ba5490a
Update download count
DarkaMaul Oct 17, 2024
27ab348
Update peps/pep-9999.rst
woodruffw Oct 17, 2024
15db84b
Update peps/pep-9999.rst
woodruffw Oct 17, 2024
4190787
Update peps/pep-9999.rst
woodruffw Oct 17, 2024
54af5b7
cite Catch-22
woodruffw Oct 17, 2024
a817cbb
use a more Python-specific stat
woodruffw Oct 17, 2024
1b1b646
Apply suggestions from code review
woodruffw Oct 17, 2024
ebb5456
codecov example
woodruffw Oct 17, 2024
600fa37
Merge branch 'ww/deletion-pep' of github.com:trail-of-forks/peps into…
DarkaMaul Oct 17, 2024
df34ddf
add pre-release exception
woodruffw Oct 17, 2024
e994629
fix grammar
woodruffw Oct 17, 2024
f2e3f21
Revert collapsing changes
DarkaMaul Oct 17, 2024
9f3a2d2
small tweaks
woodruffw Oct 24, 2024
0f72405
add implementation deets
woodruffw Oct 24, 2024
5bed2ce
Update pep-9999.rst
woodruffw Oct 24, 2024
689bfa5
PEP: rename to PEP 763
woodruffw Oct 24, 2024
ecccdc0
CODEOWNERS: record PEP 763
woodruffw Oct 24, 2024
f7a9e74
PEP 763: fix backticks
woodruffw Oct 24, 2024
2272ebb
Apply suggestions from code review
woodruffw Oct 25, 2024
afbc32d
Merge branch 'main' into ww/deletion-pep
woodruffw Oct 25, 2024
1ebea39
Apply suggestions from code review
woodruffw Oct 25, 2024
46e6b23
PEP 763: fix blockquote
woodruffw Oct 25, 2024
5e28051
CODEOWNERS: reorder
woodruffw Oct 28, 2024
79115ed
PEP 763: cleanup, remove licensing bits
woodruffw Oct 28, 2024
a85925c
PEP 763: use release consistently
woodruffw Oct 28, 2024
aad73dd
PEP 763: subsection
woodruffw Oct 28, 2024
d451f3f
Apply suggestions from code review
woodruffw Oct 28, 2024
962c6f4
Update peps/pep-0763.rst
woodruffw Oct 28, 2024
449a9d9
PEP 763: remove redundant justification
woodruffw Oct 28, 2024
d7204a7
PEP 763: positive security
woodruffw Oct 28, 2024
ed4733a
PEP 763: add banner
woodruffw Oct 28, 2024
250c9c3
PEP 763: elaborate on security
woodruffw Oct 28, 2024
a800aaa
Merge branch 'main' into ww/deletion-pep
woodruffw Oct 28, 2024
1f3dbcc
Update peps/pep-0763.rst
woodruffw Oct 28, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .github/CODEOWNERS
Original file line number Diff line number Diff line change
Expand Up @@ -640,6 +640,7 @@ peps/pep-0759.rst @warsaw
peps/pep-0760.rst @pablogsal @brettcannon
peps/pep-0761.rst @sethmlarson @hugovk
peps/pep-0762.rst @pablogsal @ambv @lysnikolaou @emilyemorehouse
peps/pep-0763.rst @dstufft
# ...
peps/pep-0777.rst @warsaw
# ...
Expand Down
269 changes: 269 additions & 0 deletions peps/pep-0763.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,269 @@
PEP: 763
Title: Limiting deletions on PyPI
Author: William Woodruff <[email protected]>,
Alexis Challande <[email protected]>
Sponsor: Donald Stufft <[email protected]>
PEP-Delegate: Donald Stufft <[email protected]>
Status: Draft
Type: Standards Track
Topic: Packaging
Created: 07-Oct-2024
woodruffw marked this conversation as resolved.
Show resolved Hide resolved
Post-History: `09-Jul-2022 <https://discuss.python.org/t/17227>`__,
`01-Oct-2024 <https://discuss.python.org/t/66351>`__

Abstract
========

We propose limiting when users can delete files, releases, and projects
from PyPI. A project, release, or file may only be deleted within 72 hours
of when it is uploaded to the index. From this point, users may only use
the "yank" mechanism specified by :pep:`592`.

An exception to this restriction is made for releases and files that are
marked with :ref:`pre-release specifiers <packaging:version-specifiers>`,
which will remain deletable at any time.
The PyPI administrators will retain the ability to delete files, releases,
and projects at any time, for example for moderation or security purposes.

Rationale and Motivation
========================

As observed in :pep:`592`, user-level deletion of projects on PyPI
enables a catch-22 situation of dependency breakage:

Whenever a project detects that a particular release on PyPI might be
broken, they oftentimes will want to prevent further users from
inadvertently using that version. However, the obvious solution of
deleting the existing file from a repository will break users who have
pinned to a specific version of the project.

This leaves projects in a catch-22 situation where new projects may be pulling
down this known broken version, but if they do anything to prevent that they’ll
break projects that are already using it.
Comment on lines +40 to +42
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
This leaves projects in a catch-22 situation where new projects may be pulling
down this known broken version, but if they do anything to prevent that they’ll
break projects that are already using it.
This leaves a project author in a catch-22 situation where other projects may be pulling
down this known broken version, but if the project author does anything to prevent that, it
breaks projects where the dependency has been pinned for use.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as https://github.com/python/peps/pull/4080/files#r1817037637 -- I like this rephrasing but I don't know whether it's kosher to rewrite a direct quote 😅


On a technical level, the problem of deletion is mitigated by
"yanking," also specified in :pep:`592`. However, deletions continue to be
allowed on PyPI, and have caused multiple notable disruptions to the Python
ecosystem over the interceding years:

* July 2022: `atomicwrites <https://pypi.org/project/atomicwrites/>`_
was `deleted by its maintainer <https://github.com/untitaker/python-atomicwrites/issues/61>`_
in an attempt to remove the project's "critical" designation, without the
maintainer realizing that project deletion would also delete all previously
uploaded releases.

The project was subsequently restored with the maintainer's consent,
but at the cost of manual administrator action and extensive downstream
breakage to projects like `pytest <https://github.com/pytest-dev/pytest/issues/10114>`_.
As of October 2024, atomicwrites is archived but still has
around `4.5 million monthly downloads from PyPI <https://pypistats.org/packages/atomicwrites>`_.

* April 2023: `codecov <https://pypi.org/project/codecov/>`_ was deleted by
its maintainers after a long deprecation period. This caused extensive
breakage for many of Codecov's CI/CD users, who were unaware of the
deprecation period due to limited observability of deprecation warnings
within CI/CD logs.

The project was
`subsequently re-created <https://about.codecov.io/blog/message-regarding-the-pypi-package/>`_
by its maintainers, with a new release published to compensate for the deleted releases
(which were not restored), meaning that any pinned installations remained
broken. As of October 2024, this single release remains the only release on
PyPI and has around
`1.5 million monthly downloads <https://pypistats.org/packages/codecov>`_.

* June 2023: `python-sonarqube-api <https://pypi.org/project/python-sonarqube-api/>`_
deleted all released releases prior to 2.0.2.

The project's maintainer subsequently
`deleted conversations <https://discuss.python.org/t/stop-allowing-deleting-things-from-pypi/17227/114>`_
and force-pushed over the tag history for ``python-sonarqube-api``'s source
repository, impeding efforts by users to compare changes between
releases.

* June 2024: `PySimpleGUI <https://pypi.org/project/PySimpleGUI/>`_ changed
licenses and deleted
`nearly all previous releases <https://discuss.python.org/t/48790/27>`_.
This resulted in widespread disruption for users, who (prior to the
relicensing) were downloading PySimpleGUI approximately 25,000 times a day.

In addition to their disruptive effect on downstreams, deletions
also have detrimental effects on PyPI's sustainability as well as the overall
security of the ecosystem:

* Deletions increase support workload for PyPI's administrators and
moderators, as users mistakenly file support requests believing that PyPI
is broken, or that the administrators themselves have removed the
project.

* Deletions impair external (meaning end-user) incident response and analysis,
making it difficult to distinguish "good faith" maintainer behavior from
a malicious actor attempting the cover their tracks.

The Python ecosystem is continuing to grow,
meaning that future deletions of projects can be reasonably assumed to
be *just, as if not more,* disruptive than the deletions sampled above.

Given the above, this PEP concludes that deletions now present a greater risk
and detriment to the Python ecosystem than a benefit.

Specification
=============

There are three different types of deletable objects:

1. **Files**, which are individual project distributions (such as source
distributions or wheels).

Example: ``requests-2.32.3-py3-none-any.whl``.

2. **Releases**, which contain one or more files that share the same version
number.

Example: `requests v2.32.3 <https://pypi.org/project/requests/2.32.3/>`_.

3. **Projects**, which contain one or more releases.

Example: `requests <https://pypi.org/project/requests>`_.

Deletion eligibility rules
--------------------------

This PEP proposes the following *deletion eligibility rules*:
woodruffw marked this conversation as resolved.
Show resolved Hide resolved

* A **file** is deletable if and only if it was uploaded to
PyPI less than 72 hours from the current time, **or** if it
has a :ref:`pre-release specifier <packaging:version-specifiers>`.
* A **release** is deletable if and only if all of its
contained files are deletable.
* A **project** is deletable if and only if all of its releases are deletable.

These rules allow new projects to be
deleted entirely, and allow old projects to delete new files or releases,
but do not allow old projects to delete old files or releases.

Implementation
==============

This PEP's implementation primarily concerns aspects of PyPI that are not
standardized or subject to standardization, such as the web interface and
signed-in user operations. As a result, this section describes its
implementation in behavioral terms.

Changes
-------

* Per the eligibility rules above, PyPI will reject web interface requests
(using an appropriate HTTP response code of its choosing) for
file, release, or project deletion if the respective object is not
eligible for deletion.
* PyPI will amend its web interface to indicate a file/release/project's
deletion ineligibility, e.g. by styling the relevant UI elements as "inactive"
and making relevant bottoms/forms unclickable.

Security Implications
=====================

This PEP does not identify negative security implications associated with the
proposed approach.

This PEP identifies one minor positive security implication: by restricting
user-controlled deletions, this PEP makes it more difficult for a malicious
actor to cover their tracks by deleting malware from the index. This is
particularly useful for external (i.e. non-PyPI administrator) triage and
incident response, where the defending party needs easy access to malware
samples to develop indicators of compromise.

How To Teach This
=================

This PEP suggests at least two pieces of public-facing material to help
the larger Python packaging community (and its downstream consumers)
understand its changes:

* An announcement post on the `PyPI blog <https://blog.pypi.org>`_ explaining
the nature of the PEP, its motivations, and its behavioral implications for
PyPI.
* An announcement banner on PyPI itself, linking to the above.
* Updates to the `PyPI user documentation <https://docs.pypi.org/>`_ explaining
the difference between deletion and yanking and the limited conditions under
which the former can still be initiated by package owners.

Rejected Ideas
==============

Conditioning deletion on dependency relationships
-------------------------------------------------

An alternative to time-based deletion windows is deletion eligibility based on
downstream dependents. For example, a release could be considered deletable
if and only if it has fewer than ``N`` downstream dependents on PyPI,
where ``N`` could be as low as 1.

This idea is appealing since it directly links deletion eligibility to
disruptiveness. `npm <https://www.npmjs.com/>`_ uses it and
conditions project removal on the absence of any downstream dependencies
known to the index.

Despite its appeal, this PEP identifies several disadvantages and technical
limitations that make dependency-conditioned deletion not appropriate
for PyPI:

1. *PyPI is not aware of dependency relationships.* In Python packaging,
both project builds *and* metadata generation are frequently dynamic
operations, involving arbitrary project-specified code. This is typified
by source distributions containing ``setup.py`` scripts, where the execution
of ``setup.py`` is responsible for computing the set of dependencies
encoded in the project's metadata.

This is in marked contrast to ecosystems like npm and Rust's
`crates <https://crates.io/>`_, where project *builds* can be dynamic but
the project's metadata itself is static.

As a result of this, `PyPI doesn't know your project's dependencies
<https://dustingram.com/articles/2018/03/05/why-pypi-doesnt-know-dependencies/>`_,
and is architecturally incapable of knowing them without either running
arbitrary code (a significant security risk) or performing a long-tail
deprecation of ``setup.py``-based builds in favor of :pep:`517` and
:pep:`621`-style static metadata.

2. *Results in an unintuitive permissions model.* Dependency-conditioned
deletion results in a "reversed" power relationship, where anybody
who introduces a dependency on a project can prevent that project from
being deleted.

This is reasonable on face value, but can be abused to produce unexpected
and undesirable (in the context of enabling some deletions) outcomes.
A notable example of this is npm's
`everything package <https://www.npmjs.com/package/everything>`_, which
depends on every public package on npm (as of 30 Dec 2023) and thereby
prevents their deletion.


Conditioning deletion on download count
---------------------------------------

Another alternative to time-based deletion windows is to delete based on the
number of downloads. For example, a release could be considered deletable if
and only if it has fewer than ``N`` downloads during the last period.

While presenting advantages by tying a project deletion possibility to its
usage, this PEP identifies several limitations to this approach:

1. *Ecosystem diversity.* The Python ecosystem includes projects with widely
varying usage patterns. A fixed download threshold would not adequately account
for niche but critical projects with naturally low download counts.

2. *Time sensitivity.* Download counts do not necessarily reflect a project's
current status or importance. A previously popular project might have low
recent downloads but still be crucial for maintaining older systems.

3. *Technical complexity.* Accessing the download count of a project within
PyPI is not straightforward, and there is limited possibility to gather a
project's download statistics from mirrors or other distributions systems.

Copyright
=========

This document is placed in the public domain or under the CC0-1.0-Universal
license, whichever is more permissive.