Storage protection feature does not integrate well with StatefulSet PVC recreation #74374

msau42 · 2019-02-21T22:22:31Z

What happened:
The storage protection feature prevents PVC deletion until all Pods referencing it are deleted. This doesn't work well with Statefulsets when you want to delete the PVC and have the StatefulSet controller recreate it. The StatefulSet controller only creates missing PVCs when creating a new Pod. So you could have a sequence like this:

Try to delete PVC. Deletion is pending due to the storage protection finalizer.
Delete the Pod.
StatefulSet controller notices that the pod is deleted and creates the replacement pod. But the PVC already exists, so a new one is not created.
Storage protection controller removes the finalizer and the PVC is deleted
New pod is pending. First because PVC deletion is pending, and then 2nd because PVC is deleted.

Nothing tries to recreate the PVC again and the Pod is stuck in pending forever.

What you expected to happen:
We could potentially add logic in the statefulset controller to check if the PVC is terminating here. However, it uses an informer so the race may still exist.

Another option is to make the StatefulSet controller actively reconcile PVC creation. This would probably require more substantial changes to the controller.

@kubernetes/sig-storage-bugs
@kubernetes/sig-apps-bugs

msau42 · 2019-02-21T22:24:07Z

A workaround in the meantime is to delete the StatefulSet pod a second time if it's stuck in pending due to missing PVC.

kow3ns · 2019-02-25T16:29:18Z

/assign

kow3ns · 2019-02-25T16:29:29Z

/assign msau42

cscetbon · 2019-03-10T01:18:15Z

Hey @msau42 thank you for having opened the issue and the temporary workaround

any news on that bug ? ??

KevinTHU · 2019-06-05T01:55:14Z

I think this need to be fixed, but not just workaround. a second delete of pod is not a graceful operation.

msau42 · 2019-06-05T02:15:22Z

I would be happy to help guide someone to come up with a possible design and implementation for a fix.

/help

k8s-ci-robot · 2019-06-05T02:15:23Z

@msau42:
This request has been marked as needing help from a contributor.

Please ensure the request meets the requirements listed here.

If this request no longer meets these requirements, the label can be removed
by commenting with the /remove-help command.

In response to this:

I would be happy to help guide someone to come up with a possible design and implementation for a fix.

/help

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

zhangxiaoyu-zidif · 2019-06-15T04:32:08Z

when recreate pod, check if the PVC was label with deletion. maybe we could delete the PVC immediately by force and create a new PVC before Pod creation.

mucahitkurt · 2019-06-18T20:46:32Z

Hey @msau42,

I would like to work on this issue. As far as I understand StatefulSetController check the different state of the pods and take some action according to pod states when syncing statefulset.

May be I add the pod Pending state here and then check if the pod's pvc exist or not, if not then create the missing pvc.

wdyt?

KevinTHU · 2019-06-19T02:21:20Z

the most hard part is that finding the status of pvc(creating pod process), and the deletion of pvc(pvc-protection finalizer) is in different goroutines. there will be concourrent problem.

cwdsuzhou · 2019-06-19T12:52:26Z

@msau42 @kow3ns I have sent a PR #79164.
I have ran it in my cluster, it works.

mucahitkurt · 2019-06-19T23:21:31Z

Hey @msau42,

I would like to work on this issue. As far as I understand StatefulSetController check the different state of the pods and take some action according to pod states when syncing statefulset.

May be I add the pod Pending state here and then check if the pod's pvc exist or not, if not then create the missing pvc.

wdyt?

Alternative solution might be like https://github.com/mucahitkurt/kubernetes/tree/fix/enable-statefulset-pvc-recreation-for-pending-pod

cscetbon · 2019-06-19T23:25:18Z

I think you mean mucahitkurt@1ab4b97

cscetbon · 2019-06-19T23:35:10Z

@mucahitkurt if I understand your code well, if a Pod is in PodPending state you try to recreate the PVC. What if a Pod is just restarted by k8s on the same host ? would the recreation just fail ? what if the previous PVC is still in terminating (same name in our case) ?

cwdsuzhou · 2019-06-20T09:22:35Z

@mucahitkurt we may need know if the pod keeps in pending or just pending transitorily.

If it keeps in pending, we need try to recreate the pvc ( there also may be many other reasons lead to pending)
If it is pending transitorily, we should do nothing

mucahitkurt · 2019-06-20T10:55:42Z

@mucahitkurt if I understand your code well, if a Pod is in PodPending state you try to recreate the PVC. What if a Pod is just restarted by k8s on the same host ? would the recreate just fail ? what if the previous PVC is still in terminating (same name in our case) ?

@cscetbon I'll test these scenarios on my local cluster, but CreatePersistentVolumeClaims does not directly create the PVC, firstly it checks the pod's pvc is missing or not, if the pvc is not found then it tries to create the pvc. So If a statefulset pod is restarting and there is no missing pvc this solution won't do anything.

If the previous PVC is still terminating, I expect that pvc lister returns this pvc and CreatePersistentVolumeClaims won't try to recreate the pvc.

@cwdsuzhou actualy CreatePersistentVolumeClaims try to create the PVC if it's missing, if the pod is in pending state and reason is another thing other than missing PVC, this solution's cost will be querying the same pvcs again and again (spc.pvcLister.PersistentVolumeClaims(claim.Namespace).Get(claim.Name)). So if a pod in pending state and there is no missing pvc the solution won't do anything that has side effect.

msau42 · 2019-06-21T01:30:31Z

I wonder if we actually need to check pod state? Regardless of pod state, can we just check if any of its pvcs are missing?

cscetbon · 2019-06-21T02:18:52Z

@msau42 what if we check that it exists and it does in deleting state ?

mucahitkurt · 2019-06-21T19:54:15Z

I wonder if we actually need to check pod state? Regardless of pod state, can we just check if any of its pvcs are missing?

@msau42 ,

Most probably there won't be any side effect checking the missing PVCs with CreatePersistentVolumeClaims method regarless of pod state, because as far as I understand this method works in a idempotent way.

Checking pod's PVCs existence at every sync operation can create some performance effect.

I don't know about the different scenarios that we need to check the existence of pod's PVCs at every statefulset sync. Do you have any scenario?

janetkuo · 2019-06-22T00:31:05Z

We could potentially add logic in the statefulset controller to check if the PVC is terminating. However, it uses an informer so the race may still exist.

An alternative is to get PVC directly from API server instead of cache to avoid the race. The logic would be straightforward and the code change is small.

We'll only do this when creating a Pod with an existing PVC. Therefore, the chance of hitting this case would be pretty low, and the impact to API server would be small too.

mucahitkurt · 2019-06-29T08:31:21Z

We could potentially add logic in the statefulset controller to check if the PVC is terminating. However, it uses an informer so the race may still exist.

An alternative is to get PVC directly from API server instead of cache to avoid the race. The logic would be straightforward and the code change is small.

We'll only do this when creating a Pod with an existing PVC. Therefore, the chance of hitting this case would be pretty low, and the impact to API server would be small too.

@janetkuo
Do you have any concern with my solution alternative that has no race condition I think?

andyxning · 2021-02-03T16:59:21Z

/sub

santhoshs123 · 2021-03-22T20:32:33Z

Is this issue fixed in k8s version 1.20? I used to hit this issue 100% of the times in 1.19, in 1.20 pod remains in pending until the pvc gets recreated, without any PVC not found events, once the pvc is recreated pod goes to running.

fejta-bot · 2021-06-20T21:03:18Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

fejta-bot · 2021-07-20T21:34:59Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle rotten

ThisIsQasim · 2021-07-21T01:27:08Z

/remove-lifecycle rotten

k8s-triage-robot · 2021-10-19T01:37:35Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

nikola-n6c · 2021-10-29T11:53:35Z

Is the deletion order of Pod -> PVC -> PV still prone to this bug/race-condition? I still see this behavior in 1.20.5

ymmt2005 · 2021-10-29T12:10:04Z

Yes, PVC should be marked deleted before Pod.

k8s-triage-robot · 2021-11-28T12:44:31Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

jingxu97 · 2021-11-29T20:31:25Z

@cofyc Is this issue completely resolved or still need some fixes after your change?

cofyc · 2021-11-30T08:52:53Z

/remove-lifecycle rotten
not resolved completely, but the impact is minor (I guess).

k8s-triage-robot · 2022-02-28T09:17:56Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2022-03-30T09:45:48Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2022-04-29T10:34:09Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen
Mark this issue or PR as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

k8s-ci-robot · 2022-04-29T10:34:26Z

@k8s-triage-robot: Closing this issue.

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen

Mark this issue or PR as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

msau42 added the kind/bug Categorizes issue or PR as related to a bug. label Feb 21, 2019

k8s-ci-robot added sig/storage Categorizes an issue or PR as relevant to SIG Storage. sig/apps Categorizes an issue or PR as relevant to SIG Apps. labels Feb 21, 2019

msau42 changed the title ~~Storage protection feature does not integrate well with StatefulSet pod recreation~~ Storage protection feature does not integrate well with StatefulSet PVC recreation Feb 21, 2019

k8s-ci-robot assigned kow3ns Feb 25, 2019

k8s-ci-robot assigned msau42 Feb 25, 2019

k8s-ci-robot added the help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. label Jun 5, 2019

cwdsuzhou mentioned this issue Jun 19, 2019

Bugfix:statefulSet would not recreate the pvc when a pvc is deleted before a pod #79164

Closed

cwdsuzhou mentioned this issue Sep 21, 2019

k8s storage using rook,delete pv ,pvc then create pvc shows peding #82926

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 20, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jul 20, 2021

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Jul 21, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 19, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Nov 28, 2021

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Nov 30, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 28, 2022

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Mar 30, 2022

k8s-ci-robot closed this as completed Apr 29, 2022

This was referenced Oct 18, 2022

PVC recreation needed when Stateful set Pod stuck in pending #113139

Closed

Automatically recreate pvc when sts pod is stuck in pending rrangith/kubernetes#1

Open

Automatically recreate PVC for pending STS pod #113270

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Storage protection feature does not integrate well with StatefulSet PVC recreation #74374

Storage protection feature does not integrate well with StatefulSet PVC recreation #74374

Storage protection feature does not integrate well with StatefulSet PVC recreation #74374

Storage protection feature does not integrate well with StatefulSet PVC recreation #74374

Comments