google cloud platform - GKE pod with filestore RWX volume takes 30 minutes to start Error syncing pod, skipping" err=&a

admin•2025-04-20 14:52:29•questions•阅读7

I have a GKE pod mounted with RWX volume with Filestore. Below are my storage class,PV,PVC configs.GKE

I have a GKE pod mounted with RWX volume with Filestore. Below are my storage class,PV,PVC configs.

GKE Version - 1.30.9-gke.1127000

For all the pods which uses this multishare volume takes about 30 minutes to start and in kubelet events I see the below error :

Error syncing pod, skipping" err="unmounted volumes=[filestore-rwx-volume], unattached volumes=[], failed to process volumes=[]: context deadline exceeded"

I have verified the connectivity from node and pod to Filestore instance in 2049, it's working fine. Even nodes are healthy.

allowVolumeExpansion: true
apiVersion: storage.k8s.io/v1
kind: StorageClass
metadata:
  labels:
    addonmanager.kubernetes.io/mode: EnsureExists
    k8s-app: gcp-filestore-csi-driver
  name: rwx-sc
parameters:
  instance-storageclass-label: rwx
  multishare: "true"
  network: prvpc
  tier: enterprise
provisioner: filestore.csi.storage.gke.io
reclaimPolicy: Retain
volumeBindingMode: WaitForFirstConsumer
'''
'''
apiVersion: v1
kind: PersistentVolume
metadata:
  annotations:
    pv.kubernetes.io/provisioned-by: filestore.csi.storage.gke.io
    volume.kubernetes.io/provisioner-deletion-secret-name: ""
    volume.kubernetes.io/provisioner-deletion-secret-namespace: ""
  creationTimestamp: "2025-01-26T17:07:36Z"
  finalizers:
  - kubernetes.io/pv-protection
  name: pv-pr
spec:
  accessModes:
  - ReadWriteMany
  capacity:
    storage: 60Gi
  claimRef:
    apiVersion: v1
    kind: PersistentVolumeClaim
    name: commerce-prodlive-assets-pvc
    namespace: commerce
    resourceVersion: "3125440"
    uid: 333cea1a-b49160c4d6e8
  csi:
    driver: filestore.csi.storage.gke.io
    volumeAttributes:
      ip: 10.xx.xx.x
      max-share-size: "1099511627776"
      storage.kubernetes.io/csiProvisionerIdentity: 123312-63xxx19-filestore.csi.storage.gke.io
      supportLockRelease: "true"
    volumeHandle: modeMultishare/enterprise-multishare-rwx-/test-k8s/europe-west1/fs-id/pv-pr
  persistentVolumeReclaimPolicy: Retain
  storageClassName: enterprise-multishare-rwx-custom
  volumeMode: Filesystem

We are clueless on why it's taking too long for this Filestore volume mount, if i attach a different volume other than Filestore it works fine.

Please find below my filestore pod log output its same for all the 3

kubectl logs -f filestore-node-dsdsd -n kube-system
Defaulted container "csi-driver-registrar" out of: csi-driver-registrar, gcp-filestore-driver, nfs-services, filestorecsi-metrics-collector
I0309 17:00:06.274180       1 main.go:135] Version: v2.9.4-gke.27-0-gf3945690
I0309 17:00:06.274296       1 main.go:136] Running node-driver-registrar in mode=
I0309 17:00:06.274304       1 main.go:157] Attempting to open a gRPC connection with: "/csi/csi.sock"
I0309 17:00:06.274893       1 connection.go:214] Connecting to unix:///csi/csi.sock
I0309 17:00:11.037725       1 main.go:164] Calling CSI driver to discover driver name
I0309 17:00:11.037755       1 connection.go:243] GRPC call: /csi.v1.Identity/GetPluginInfo
I0309 17:00:11.037762       1 connection.go:244] GRPC request: {}
I0309 17:00:11.041351       1 connection.go:250] GRPC response: {"name":"filestore.csi.storage.gke.io","vendor_version":"v1.6.17-gke.15"}
I0309 17:00:11.041365       1 connection.go:251] GRPC error: <nil>
I0309 17:00:11.041374       1 main.go:173] CSI driver name: "filestore.csi.storage.gke.io"
I0309 17:00:11.041408       1 node_register.go:55] Starting Registration Server at: /registration/filestore.csi.storage.gke.io-reg.sock
I0309 17:00:11.072876       1 node_register.go:64] Registration Server started at: /registration/filestore.csi.storage.gke.io-reg.sock
I0309 17:00:11.072997       1 node_register.go:88] Skipping HTTP server because endpoint is set to: ""
I0309 17:00:11.636522       1 main.go:90] Received GetInfo call: &InfoRequest{}
I0309 17:00:11.780566       1 main.go:101] Received NotifyRegistrationStatus call: &RegistrationStatus{PluginRegistered:true,Error:,}

I have a GKE pod mounted with RWX volume with Filestore. Below are my storage class,PV,PVC configs.

GKE Version - 1.30.9-gke.1127000

For all the pods which uses this multishare volume takes about 30 minutes to start and in kubelet events I see the below error :

Error syncing pod, skipping" err="unmounted volumes=[filestore-rwx-volume], unattached volumes=[], failed to process volumes=[]: context deadline exceeded"

I have verified the connectivity from node and pod to Filestore instance in 2049, it's working fine. Even nodes are healthy.

allowVolumeExpansion: true
apiVersion: storage.k8s.io/v1
kind: StorageClass
metadata:
  labels:
    addonmanager.kubernetes.io/mode: EnsureExists
    k8s-app: gcp-filestore-csi-driver
  name: rwx-sc
parameters:
  instance-storageclass-label: rwx
  multishare: "true"
  network: prvpc
  tier: enterprise
provisioner: filestore.csi.storage.gke.io
reclaimPolicy: Retain
volumeBindingMode: WaitForFirstConsumer
'''
'''
apiVersion: v1
kind: PersistentVolume
metadata:
  annotations:
    pv.kubernetes.io/provisioned-by: filestore.csi.storage.gke.io
    volume.kubernetes.io/provisioner-deletion-secret-name: ""
    volume.kubernetes.io/provisioner-deletion-secret-namespace: ""
  creationTimestamp: "2025-01-26T17:07:36Z"
  finalizers:
  - kubernetes.io/pv-protection
  name: pv-pr
spec:
  accessModes:
  - ReadWriteMany
  capacity:
    storage: 60Gi
  claimRef:
    apiVersion: v1
    kind: PersistentVolumeClaim
    name: commerce-prodlive-assets-pvc
    namespace: commerce
    resourceVersion: "3125440"
    uid: 333cea1a-b49160c4d6e8
  csi:
    driver: filestore.csi.storage.gke.io
    volumeAttributes:
      ip: 10.xx.xx.x
      max-share-size: "1099511627776"
      storage.kubernetes.io/csiProvisionerIdentity: 123312-63xxx19-filestore.csi.storage.gke.io
      supportLockRelease: "true"
    volumeHandle: modeMultishare/enterprise-multishare-rwx-/test-k8s/europe-west1/fs-id/pv-pr
  persistentVolumeReclaimPolicy: Retain
  storageClassName: enterprise-multishare-rwx-custom
  volumeMode: Filesystem

We are clueless on why it's taking too long for this Filestore volume mount, if i attach a different volume other than Filestore it works fine.

Please find below my filestore pod log output its same for all the 3

kubectl logs -f filestore-node-dsdsd -n kube-system
Defaulted container "csi-driver-registrar" out of: csi-driver-registrar, gcp-filestore-driver, nfs-services, filestorecsi-metrics-collector
I0309 17:00:06.274180       1 main.go:135] Version: v2.9.4-gke.27-0-gf3945690
I0309 17:00:06.274296       1 main.go:136] Running node-driver-registrar in mode=
I0309 17:00:06.274304       1 main.go:157] Attempting to open a gRPC connection with: "/csi/csi.sock"
I0309 17:00:06.274893       1 connection.go:214] Connecting to unix:///csi/csi.sock
I0309 17:00:11.037725       1 main.go:164] Calling CSI driver to discover driver name
I0309 17:00:11.037755       1 connection.go:243] GRPC call: /csi.v1.Identity/GetPluginInfo
I0309 17:00:11.037762       1 connection.go:244] GRPC request: {}
I0309 17:00:11.041351       1 connection.go:250] GRPC response: {"name":"filestore.csi.storage.gke.io","vendor_version":"v1.6.17-gke.15"}
I0309 17:00:11.041365       1 connection.go:251] GRPC error: <nil>
I0309 17:00:11.041374       1 main.go:173] CSI driver name: "filestore.csi.storage.gke.io"
I0309 17:00:11.041408       1 node_register.go:55] Starting Registration Server at: /registration/filestore.csi.storage.gke.io-reg.sock
I0309 17:00:11.072876       1 node_register.go:64] Registration Server started at: /registration/filestore.csi.storage.gke.io-reg.sock
I0309 17:00:11.072997       1 node_register.go:88] Skipping HTTP server because endpoint is set to: ""
I0309 17:00:11.636522       1 main.go:90] Received GetInfo call: &InfoRequest{}
I0309 17:00:11.780566       1 main.go:101] Received NotifyRegistrationStatus call: &RegistrationStatus{PluginRegistered:true,Error:,}

Share Improve this question edited Mar 11 at 19:14 asked Mar 10 at 13:55 saurabh umathe 4136 silver badges29 bronze badges

Please try the following as the first steps to provide more details, as this is very specific issue. 1. During startup delay check the status/logs of the Filestore's DaemonSets. Use this for logs: resource.type="k8s_container" resource.labels.location="LOC" resource.labels.cluster_name="NAME" labels.k8s-pod/k8s-app="gcp-filestore-csi-driver" severity>="ERROR"; 2. During startup check for the status/errors (via kubectl describe) of the PV and PVC; 3. Try to mount your Filestore instance to a Node manually to check whether it's a Filestore or CSI Driver issue; 4. GKE version? – mikalai Commented Mar 11 at 17:24
Hi @mikalai, i have attached pod output of filestore, also in pv and pvc there no events logged. gke version - 1.30.9-gke.1127000 – saurabh umathe Commented Mar 11 at 19:15
Could you please provide other requested details so that I can see the whole picture? – mikalai Commented Mar 14 at 14:08

Add a comment |

1 Answer 1

Sorted by: Reset to default 0

The delay you’re experiencing during the mounting process is a usual behavior when mounting Filestore volume to GKE as the pod.spec.securityContext.fsGroup setting causes kubelet to run chown and chmod on all the files in the volumes mounted for a given pod. As per Kubernetes documentation:

By default, Kubernetes recursively changes ownership and permissions for the contents of each volume to match the fsGroup specified in a Pod's securityContext when that volume is mounted.

Checking and changing ownership and permissions is very time consuming especially for large volumes with many files, slowing Pod startup. To resolve this, use fsGroupChangePolicy: OnRootMismatch. This can help you control the way that Kubernetes checks and manages ownership and permissions for your volume.

发布者：admin，转转请注明出处：http://www.yc00.com/questions/1744843295a4596687.html

admin

questions
javascript - Converting UUIDV4 to decimal on base 16 - Stack Overflow
I don't want to share my primary key in the API, So I used UUID4 to generate unique row id. But I
admin
30分钟前
00
questions
html - Tailwind Classes Not Working as Expected in React JS (Tailwind V4) - Stack Overflow
I just tried using tailwind V4 In my react project. my configuration all are okay everything is fine. t
admin
28分钟前
00
questions
javascript - Are there any JS objects for which IsCallable is false but IsConstructor is true? - Stack Overflow
The ECMAScript specification function IsCallable returns true iff its argument has a [[Call]] internal
admin
28分钟前
10
questions
php - What are the best practices for dealing with the Back Button in IE (and Firefox) - Stack Overflow
I know this in old issue but I can't figure out the best practices for dealing with the back butto
admin
27分钟前
10
questions
javascript - How do I get the count for each item using v-for in Vue? - Stack Overflow
I have this Vue code:var itembox = new Vue({el: '#itembox',data: {items: {cookiesncreme: {nam
admin
26分钟前
10
questions
plugins - How to apply lazy loading in background images
Closed. This question is off-topic. It is not currently accepting answers.Your question should be specific to WordPress.
admin
25分钟前
00
questions
javascript - Find closest span text? - Stack Overflow
I feel like this would be simple enough, but I'm clearly missing something. When clicking an "
admin
25分钟前
00
questions
javascript - Show content if <noscript> OR <!--[if lte IE 7]> conditional comment is met, without du
I'm developing a Javascript-driven interactive tool that does not support IE6 or IE7. If someone t
admin
20分钟前
00
questions
javascript - Vue router passing props is undefined - Stack Overflow
I want to pass a props via the vue router with the router link looks like this<router-link :to=&quo
admin
15分钟前
00
questions
javascript - how to identify a cookie is from client-side or server-side? - Stack Overflow
how does thebrowser differentiate a cookie is from client-side created (JavaScript) or server-side cr
admin
13分钟前
10
questions
php - Changing Date Format on Custom Meta Data wshortcode call
I am currently pulling a date from a users meta and making it so I can input a shortcode and it will show that date in c
admin
11分钟前
00
questions
javascript - Moving markers in Google Maps API v3 without being so processor intensive? - Stack Overflow
I'm trying to move a marker on a GoogleMap to simulate real time object movements. At present the
admin
10分钟前
00
questions
javascript - Angular Bootstrap Modal Not Showing - Stack Overflow
EDIT: Not sure why it works on jsfiddle but not on my PC!I am working on a sample Angular app in which
admin
10分钟前
10
questions
javascript - converting plain text into json - Stack Overflow
Suppose I have a text file that I want to convert into json file. Precisely, I want to convert each lin
admin
9分钟前
00
questions
javascript - Dimensions validation for image client side - Stack Overflow
The website that I'm working on has an option to upload images, but I must validate the image'
admin
9分钟前
10
questions
javascript - tell datatable to use custom button for file export - Stack Overflow
I have a working htmljs datatable example jsfiddle that has two working buttons for exporting data; ex
admin
7分钟前
00
questions
javascript - Rendering react component in html page - Stack Overflow
So I just got fortable with ReactJS within a few and want to actually start creating things with it. Be
admin
7分钟前
10
questions
How to sort <ul><li>'s based on class with javascript? - Stack Overflow
I have a TODO list app with an Unordered list. Within it I have a few list items. The li classes are hi
admin
5分钟前
00
questions
menus - Does the Default theme of Wordpress like 2016, 2017 and 2019 uses Walker_Nav_Menu Class
I want to know if these themes uses Walker_Nav_Menu class. As I open the files I can't see a Walker_Nav_Class or I
admin
2分钟前
00
questions
categories - How do I add a separator to my list of terms with get_category
I have this...$args = array('child_of' => 3422 );$categories = get_categories( $args );foreach($categorie
admin
47秒前
00

发表回复

评论列表（0条）

暂无评论

google cloud platform - GKE pod with filestore RWX volume takes 30 minutes to start Error syncing pod, skipping" err=&a

1 Answer 1

发表回复

评论列表（0条）

联系我们

400-800-8888

google cloud platform - GKE pod with filestore RWX volume takes 30 minutes to start Error syncing pod, skipping&quot; err=&a

1 Answer 1

相关推荐

发表回复

评论列表（0条）

联系我们

400-800-8888

google cloud platform - GKE pod with filestore RWX volume takes 30 minutes to start Error syncing pod, skipping" err=&a