Bonjour à tous,
Depuis un certain temps j’ai le bureau KDE sous Debian 11 qui freeze: la barre de tâche ne répond plus, impossible de changer d’activité, d’accéder au menu K ou de rebooter. Souris et clavier fonctionnent sur le reste de l’écran courant de l’activité en cours, les applications des fenêtre visibles fonctionnent.
Ca se passe généralement après un accès à un site via firefox mais je ne peux reproduire systématiquement le bug.
C’est la première fois, depuis que je suis sous Linux, en 2003, que je suis obligé de passer par un terminal pour rebooter. Le système refonctionne alors correctement pour quelque temps.
J’ai relevé ces messages dans syslog avant le dernier freeze:
Mar 23 09:27:43 kmcs kernel: [420375.912061] nouveau 0000:01:00.0: timeout
Mar 23 09:27:43 kmcs kernel: [420375.912091] WARNING: CPU: 3 PID: 204980 at drivers/gpu/drm/nouveau/nvkm/subdev/pmu/base.c:107 nvkm_pmu_reset+0x151/0x170 [nouveau]
...
Mar 23 10:32:04 kmcs kernel: [424236.501831] nouveau 0000:01:00.0: timeout
Mar 23 10:32:04 kmcs kernel: [424236.501861] WARNING: CPU: 2 PID: 209103 at drivers/gpu/drm/nouveau/nvkm/subdev/pmu/base.c:107 nvkm_pmu_reset+0x151/0x170 [nouveau]
...
Mar 23 10:33:02 kmcs org.freedesktop.Notifications[209117]: org.kde.knotifications: WaitForName: Service was not registered within timeout
Mar 23 10:33:02 kmcs dbus-daemon[209084]: [session uid=1000 pid=209084] Activated service 'org.freedesktop.Notifications' failed: Process org.freedesktop.Notifications exited with status 1
...
Mar 23 10:40:03 kmcs dbus-daemon[209084]: [session uid=1000 pid=209084] Activated service 'org.freedesktop.Notifications' failed: Process org.freedesktop.Notifications exited with status 1
...
Mar 23 10:46:18 kmcs kernel: [425090.254402] nouveau 0000:01:00.0: timeout
Mar 23 10:46:18 kmcs kernel: [425090.254429] WARNING: CPU: 4 PID: 208999 at drivers/gpu/drm/nouveau/nvkm/subdev/pmu/base.c:107 nvkm_pmu_reset+0x151/0x170 [nouv
...
Mar 23 10:49:34 kmcs kernel: [425286.124705] nouveau 0000:01:00.0: timeout
Mar 23 10:49:34 kmcs kernel: [425286.124733] WARNING: CPU: 7 PID: 209000 at drivers/gpu/drm/nouveau/nvkm/subdev/pmu/base.c:107 nvkm_pmu_reset+0x151/0x170 [nouveau]
...
Mar 23 10:50:03 kmcs kernel: [425315.740820] nouveau 0000:01:00.0: timeout
Mar 23 10:50:03 kmcs kernel: [425315.740849] WARNING: CPU: 2 PID: 207371 at drivers/gpu/drm/nouveau/nvkm/subdev/pmu/base.c:107 nvkm_pmu_reset+0x151/0x170 [nouveau]
...
Mar 23 10:52:04 kmcs kernel: [425436.037433] nouveau 0000:01:00.0: timeout
Mar 23 10:52:04 kmcs kernel: [425436.037462] WARNING: CPU: 1 PID: 201200 at drivers/gpu/drm/nouveau/nvkm/subdev/pmu/base.c:107 nvkm_pmu_reset+0x151/0x170 [nouveau]
...
Mar 23 10:52:47 kmcs kernel: [425479.103782] nouveau 0000:01:00.0: timeout
Mar 23 10:52:47 kmcs kernel: [425479.103810] WARNING: CPU: 1 PID: 201200 at drivers/gpu/drm/nouveau/nvkm/subdev/pmu/base.c:107 nvkm_pmu_reset+0x151/0x170 [nouveau]
...
Mar 23 10:54:12 kmcs kernel: [425564.170050] nouveau 0000:01:00.0: timeout
Mar 23 10:54:12 kmcs kernel: [425564.170077] WARNING: CPU: 7 PID: 209000 at drivers/gpu/drm/nouveau/nvkm/subdev/pmu/base.c:107 nvkm_pmu_reset+0x151/0x170 [nouveau]
...
Mar 23 10:54:38 kmcs kernel: [425590.229643] nouveau 0000:01:00.0: timeout
Mar 23 10:54:38 kmcs kernel: [425590.229671] WARNING: CPU: 3 PID: 208557 at drivers/gpu/drm/nouveau/nvkm/subdev/pmu/base.c:107 nvkm_pmu_reset+0x151/0x170 [nouveau]
Mar 23 10:54:38 kmcs kernel: [425590.229727] Modules linked in: sd_mod sg uas usb_storage uinput rfcomm cmac algif_hash algif_skcipher af_alg intel_rapl_msr intel_rapl_common snd_sof_pci snd_sof_intel_byt snd_sof_intel_ipc snd_sof_intel_hda_common snd_hda_codec_hdmi snd_sof_xtensa_dsp snd_sof snd_hda_codec_realtek bnep snd_sof_intel_hda snd_soc_hdac_hda snd_hda_ext_core snd_hda_codec_generic snd_soc_acpi_intel_match snd_soc_acpi ledtrig_audio snd_hda_intel x86_pkg_temp_thermal snd_intel_dspcfg intel_powerclamp soundwire_intel coretemp soundwire_generic_allocation btusb btrtl btbcm btintel mei_hdcp snd_soc_core snd_compress kvm_intel bluetooth iwlmvm soundwire_cadence snd_hda_codec jitterentropy_rng drbg kvm snd_hda_core nls_ascii irqbypass uvcvideo nls_cp437 mac80211 snd_hwdep videobuf2_vmalloc ghash_clmulni_intel vfat libarc4 aes_generic videobuf2_memops soundwire_bus rapl videobuf2_v4l2 intel_cstate fat aesni_intel joydev videobuf2_common intel_uncore snd_pcm crypto_simd iwlwifi iTCO_wdt snd_timer asus_wmi cryptd
Mar 23 10:54:38 kmcs kernel: [425590.229755] serio_raw videodev intel_pmc_bxt pcspkr glue_helper sparse_keymap iTCO_vendor_support snd ansi_cprng wmi_bmof ecdh_generic watchdog mc soundcore cfg80211 ecc libaes mei_me hid_multitouch mei rfkill intel_pch_thermal tpm_crb tpm_tis tpm_tis_core evdev tpm rng_core intel_pmc_core acpi_pad acpi_tad ac parport_pc ppdev lp parport fuse configfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 crc32c_generic usbhid hid_generic i915 nouveau ttm nvme i2c_algo_bit xhci_pci nvme_core drm_kms_helper r8169 xhci_hcd ahci libahci t10_pi realtek crc_t10dif mdio_devres libata crct10dif_generic cec crc32_pclmul libphy crct10dif_pclmul usbcore intel_lpss_pci crct10dif_common drm i2c_i801 intel_lpss crc32c_intel scsi_mod i2c_hid idma64 i2c_smbus usb_common hid battery mxm_wmi wmi video button
Mar 23 10:54:38 kmcs kernel: [425590.229790] CPU: 3 PID: 208557 Comm: kworker/3:2 Tainted: G W 5.10.0-10-rt-amd64 #1 Debian 5.10.84-1
...
Mar 23 10:56:19 kmcs kernel: [ 4.082255] nouveau 0000:01:00.0: timeout
Mar 23 10:56:19 kmcs kernel: [ 4.082264] WARNING: CPU: 7 PID: 200 at drivers/gpu/drm/nouveau/nvkm/subdev/pmu/base.c:107 nvkm_pmu_reset+0x151/0x170 [nouveau]
Mar 23 10:56:19 kmcs kernel: [ 4.082325] Modules linked in: usbhid hid_generic i915 nouveau(+) nvme ahci ttm libahci i2c_algo_bit xhci_pci nvme_core mxm_wmi r8169 t10_pi drm_kms_helper xhci_hcd crc_t10dif realtek crct10dif_generic libata mdio_devres cec crc32_pclmul crct10dif_pclmul intel_lpss_pci crct10dif_common libphy crc32c_intel usbcore i2c_i801 drm intel_lpss i2c_hid scsi_mod i2c_smbus idma64 usb_common hid battery video wmi button
Mar 23 10:56:19 kmcs kernel: [ 4.082337] CPU: 7 PID: 200 Comm: systemd-udevd Not tainted 5.10.0-10-rt-amd64 #1 Debian 5.10.84-1
Mar 23 10:56:19 kmcs kernel: [ 4.082338] Hardware name: SLIMBOOK PROX14-10/PROX14-10, BIOS N.1.05 02/21/2020
Mar 23 10:56:19 kmcs kernel: [ 4.082339] BUG: using smp_processor_id() in preemptible [00000000] code: systemd-udevd/200
Mar 23 10:56:19 kmcs kernel: [ 4.082340] caller is print_stop_info+0x1b/0x40
On voit qu’après le message de dbus-daemon à 10:40 les timeout processeurs s’intensifient. Je n’ai pas rèeussi à trouver d’autres éléments plus précis.
Quelqu’un peut-il m’indiquer une méthode pour débugger ça ?
J’ai sauvegardé tous les logs du moment.