Besoin d'aide pour interpréter un fichier kern.log (plantage récurrent du système)

J’ai un serveur (machine physique, debian 9.8) qui vient de planter suite à une simple commande htop. L’erreur affichée dans le terminal était:

NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [htop:13364]

J’ai dû le redémarrer de force car il ne répondait plus du tout (ping, ssh, complètement HS).

Il avait un uptime d’une vingtaine de jours lorsque je l’ai redémarré.

Ce n’est pas la première fois que cela arrive (pas de cause concrète, ici c’est htop mais ça peut arriver n’importe comment).

Le matériel est plutôt récent, voire neuf (plus de précisions si nécessaire).

En regardant dans les logs, et plus précisément le fichier kern.log.1, on trouve des dizaines de lignes relatives à cet incident. Problème: je n’y comprends rien!

Merci d’avance à toute personne susceptible de m’aider à déchiffrer ce log et identifier la cause (matériel? software?).

Actuellement, je n’arrive plus à faire tourner cette machine plus d’une vingtaine de jours sans qu’elle plante. Je commence à douter de la stabilité légendaire de debian :slight_smile:

Voici le log en entier (en deux messages, car j’atteins la limite):

Mar  4 19:53:01 home-server kernel: [1317232.024737] INFO: rcu_sched self-detected stall on CPU
Mar  4 19:53:01 home-server kernel: [1317232.025426] 	5-...: (5249 ticks this GP) idle=a67/140000000000001/0 softirq=94313417/94313420 fqs=2624 
Mar  4 19:53:01 home-server kernel: [1317232.026480] 	 (t=5250 jiffies g=73084763 c=73084762 q=303821)
Mar  4 19:53:01 home-server kernel: [1317232.027158] Task dump for CPU 5:
Mar  4 19:53:01 home-server kernel: [1317232.027852] htop            R  running task        0 13364  13287 0x0000000c
Mar  4 19:53:01 home-server kernel: [1317232.028720]  ffffffff99518ec0 ffffffff988a7dcb 0000000000000005 ffffffff99518ec0
Mar  4 19:53:01 home-server kernel: [1317232.029476]  ffffffff9898112b ffff9a9b1fb596c0 ffffffff9944fd00 0000000000000000
Mar  4 19:53:01 home-server kernel: [1317232.030217]  ffffffff99518ec0 00000000ffffffff ffffffff988e36fa 0000000000d52129
Mar  4 19:53:01 home-server kernel: [1317232.030930] Call Trace:
Mar  4 19:53:01 home-server kernel: [1317232.031889]  <IRQ> 
Mar  4 19:53:01 home-server kernel: [1317232.031898]  [<ffffffff988a7dcb>] ? sched_show_task+0xcb/0x130
Mar  4 19:53:01 home-server kernel: [1317232.032600]  [<ffffffff9898112b>] ? rcu_dump_cpu_stacks+0x92/0xb2
Mar  4 19:53:01 home-server kernel: [1317232.033238]  [<ffffffff988e36fa>] ? rcu_check_callbacks+0x75a/0x8b0
Mar  4 19:53:01 home-server kernel: [1317232.034103]  [<ffffffff988f1dc8>] ? update_wall_time+0x498/0x7b0
Mar  4 19:53:01 home-server kernel: [1317232.034878]  [<ffffffff988f9c30>] ? tick_sched_do_timer+0x30/0x30
Mar  4 19:53:01 home-server kernel: [1317232.035541]  [<ffffffff988ea2d8>] ? update_process_times+0x28/0x50
Mar  4 19:53:01 home-server kernel: [1317232.036116]  [<ffffffff988f9630>] ? tick_sched_handle.isra.12+0x20/0x50
Mar  4 19:53:01 home-server kernel: [1317232.036940]  [<ffffffff988f9c68>] ? tick_sched_timer+0x38/0x70
Mar  4 19:53:01 home-server kernel: [1317232.037566]  [<ffffffff988eadae>] ? __hrtimer_run_queues+0xde/0x250
Mar  4 19:53:01 home-server kernel: [1317232.038192]  [<ffffffff988eb48c>] ? hrtimer_interrupt+0x9c/0x1a0
Mar  4 19:53:01 home-server kernel: [1317232.038722]  [<ffffffff98e1c507>] ? smp_apic_timer_interrupt+0x47/0x60
Mar  4 19:53:01 home-server kernel: [1317232.039416]  [<ffffffff98e1ada6>] ? apic_timer_interrupt+0x96/0xa0
Mar  4 19:53:01 home-server kernel: [1317232.040111]  <EOI> 
Mar  4 19:53:01 home-server kernel: [1317232.040117]  [<ffffffff988c53fe>] ? native_queued_spin_lock_slowpath+0x16e/0x190
Mar  4 19:53:01 home-server kernel: [1317232.040674]  [<ffffffff98e18ead>] ? _raw_spin_lock+0x1d/0x20
Mar  4 19:53:01 home-server kernel: [1317232.041234]  [<ffffffff98a7e82b>] ? proc_pid_make_inode+0x7b/0xe0
Mar  4 19:53:01 home-server kernel: [1317232.041803]  [<ffffffff98a7e96b>] ? proc_pident_instantiate+0x1b/0xa0
Mar  4 19:53:01 home-server kernel: [1317232.042668]  [<ffffffff98a7ea7b>] ? proc_pident_lookup+0x8b/0xd0
Mar  4 19:53:01 home-server kernel: [1317232.043178]  [<ffffffff98a1aada>] ? path_openat+0x115a/0x14f0
Mar  4 19:53:01 home-server kernel: [1317232.043754]  [<ffffffff98a1c131>] ? do_filp_open+0x91/0x100
Mar  4 19:53:01 home-server kernel: [1317232.044264]  [<ffffffff98a06f7a>] ? __check_object_size+0xfa/0x1d8
Mar  4 19:53:01 home-server kernel: [1317232.044994]  [<ffffffff98a097ae>] ? do_sys_open+0x12e/0x210
Mar  4 19:53:01 home-server kernel: [1317232.045607]  [<ffffffff98803b7d>] ? do_syscall_64+0x8d/0xf0
Mar  4 19:53:01 home-server kernel: [1317232.046151]  [<ffffffff98e18f8e>] ? entry_SYSCALL_64_after_swapgs+0x58/0xc6
Mar  4 19:53:26 home-server kernel: [1317256.350378] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [htop:13364]
Mar  4 19:53:26 home-server kernel: [1317256.350888] Modules linked in: uas usb_storage vhost_net vhost fuse btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs vfat msdos fat jfs xfs libcrc32c macvtap macvlan xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp tun bridge stp llc ebtable_filter ebtables ip6_tables iptable_filter intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel eeepc_wmi kvm irqbypass asus_wmi sparse_keymap snd_hda_codec_realtek snd_hda_codec_hdmi snd_hda_codec_generic iTCO_wdt snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep iTCO_vendor_support i915 ppdev intel_cstate evdev rfkill joydev snd_pcm intel_uncore snd_timer intel_rapl_perf mei_me drm_kms_helper snd soundcore
Mar  4 19:53:26 home-server kernel: [1317256.354428]  serio_raw pcspkr lpc_ich shpchp parport_pc parport drm mei mfd_core sg wmi video i2c_algo_bit button ip_tables x_tables autofs4 ext4 crc16 jbd2 crc32c_generic fscrypto ecb mbcache algif_skcipher af_alg dm_crypt dm_mod hid_logitech_hidpp hid_logitech_dj usbhid hid sd_mod crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel ahci aesni_intel libahci aes_x86_64 lrw gf128mul glue_helper ablk_helper libata cryptd psmouse xhci_pci scsi_mod ehci_pci xhci_hcd ehci_hcd i2c_i801 e1000e i2c_smbus r8169 mii ptp usbcore pps_core usb_common fan thermal
Mar  4 19:53:26 home-server kernel: [1317256.357642] CPU: 5 PID: 13364 Comm: htop Tainted: G      D         4.9.0-8-amd64 #1 Debian 4.9.130-2
Mar  4 19:53:26 home-server kernel: [1317256.358345] Hardware name: ASUS All Series/CS-B, BIOS 1203 12/10/2015
Mar  4 19:53:26 home-server kernel: [1317256.358949] task: ffff9a96352840c0 task.stack: ffffbad1c3b40000
Mar  4 19:53:26 home-server kernel: [1317256.359596] RIP: 0010:[<ffffffff988c5400>]  [<ffffffff988c5400>] native_queued_spin_lock_slowpath+0x170/0x190
Mar  4 19:53:26 home-server kernel: [1317256.360211] RSP: 0018:ffffbad1c3b43c68  EFLAGS: 00000202
Mar  4 19:53:26 home-server kernel: [1317256.360933] RAX: 0000000000000101 RBX: ffff9a975684e078 RCX: 0000000000000001
Mar  4 19:53:26 home-server kernel: [1317256.361517] RDX: 0000000000000101 RSI: 0000000000000001 RDI: ffff9a9b0173e7a0
Mar  4 19:53:26 home-server kernel: [1317256.362130] RBP: ffffbad1c3b43c90 R08: 0000000000000101 R09: 0000000000000001
Mar  4 19:53:26 home-server kernel: [1317256.362727] R10: 000000005eeef6c0 R11: ffff9a975eeef6c0 R12: ffff9a9b0173e080
Mar  4 19:53:26 home-server kernel: [1317256.363433] R13: ffff9a9b0173e7a0 R14: 0000000000000004 R15: ffff9a975eeef6c0
Mar  4 19:53:26 home-server kernel: [1317256.364073] FS:  00007fa52e615380(0000) GS:ffff9a9b1fb40000(0000) knlGS:0000000000000000
Mar  4 19:53:26 home-server kernel: [1317256.364695] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  4 19:53:26 home-server kernel: [1317256.365275] CR2: 0000555a29a76000 CR3: 000000019b87a000 CR4: 0000000000162670
Mar  4 19:53:26 home-server kernel: [1317256.365955] Stack:
Mar  4 19:53:26 home-server kernel: [1317256.366598]  ffffffff98e18ead ffffffff98a7e82b ffffffff9902be80 ffff9a975eeef6c0
Mar  4 19:53:26 home-server kernel: [1317256.367198]  ffff9a9b0173e080 ffffffff9902be80 ffffffff98a7e96b ffffffff9902be80
Mar  4 19:53:26 home-server kernel: [1317256.367839]  ffffffff9902c600 0000000000000004 ffffffff98a7ea7b ffff9a9aed546838
Mar  4 19:53:26 home-server kernel: [1317256.368470] Call Trace:
Mar  4 19:53:26 home-server kernel: [1317256.369183]  [<ffffffff98e18ead>] ? _raw_spin_lock+0x1d/0x20
Mar  4 19:53:26 home-server kernel: [1317256.369774]  [<ffffffff98a7e82b>] ? proc_pid_make_inode+0x7b/0xe0
Mar  4 19:53:26 home-server kernel: [1317256.370422]  [<ffffffff98a7e96b>] ? proc_pident_instantiate+0x1b/0xa0
Mar  4 19:53:26 home-server kernel: [1317256.371049]  [<ffffffff98a7ea7b>] ? proc_pident_lookup+0x8b/0xd0
Mar  4 19:53:26 home-server kernel: [1317256.371792]  [<ffffffff98a1aada>] ? path_openat+0x115a/0x14f0
Mar  4 19:53:26 home-server kernel: [1317256.372396]  [<ffffffff98a1c131>] ? do_filp_open+0x91/0x100
Mar  4 19:53:26 home-server kernel: [1317256.373027]  [<ffffffff98a06f7a>] ? __check_object_size+0xfa/0x1d8
Mar  4 19:53:26 home-server kernel: [1317256.373610]  [<ffffffff98a097ae>] ? do_sys_open+0x12e/0x210
Mar  4 19:53:26 home-server kernel: [1317256.374318]  [<ffffffff98803b7d>] ? do_syscall_64+0x8d/0xf0
Mar  4 19:53:26 home-server kernel: [1317256.374945]  [<ffffffff98e18f8e>] ? entry_SYSCALL_64_after_swapgs+0x58/0xc6
Mar  4 19:53:26 home-server kernel: [1317256.375578] Code: d0 66 31 c0 41 39 c0 74 e6 4d 85 c9 c6 07 01 74 2d 41 c7 41 08 01 00 00 00 e9 53 ff ff ff 83 fa 01 74 17 8b 07 84 c0 74 08 f3 90 <8b> 07 84 c0 75 f8 b8 01 00 00 00 66 89 07 c3 f3 c3 f3 90 4c 8b 
Mar  4 19:53:54 home-server kernel: [1317284.352266] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [htop:13364]
Mar  4 19:53:54 home-server kernel: [1317284.353539] Modules linked in: uas usb_storage vhost_net vhost fuse btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs vfat msdos fat jfs xfs libcrc32c macvtap macvlan xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp tun bridge stp llc ebtable_filter ebtables ip6_tables iptable_filter intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel eeepc_wmi kvm irqbypass asus_wmi sparse_keymap snd_hda_codec_realtek snd_hda_codec_hdmi snd_hda_codec_generic iTCO_wdt snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep iTCO_vendor_support i915 ppdev intel_cstate evdev rfkill joydev snd_pcm intel_uncore snd_timer intel_rapl_perf mei_me drm_kms_helper snd soundcore
Mar  4 19:53:54 home-server kernel: [1317284.360718]  serio_raw pcspkr lpc_ich shpchp parport_pc parport drm mei mfd_core sg wmi video i2c_algo_bit button ip_tables x_tables autofs4 ext4 crc16 jbd2 crc32c_generic fscrypto ecb mbcache algif_skcipher af_alg dm_crypt dm_mod hid_logitech_hidpp hid_logitech_dj usbhid hid sd_mod crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel ahci aesni_intel libahci aes_x86_64 lrw gf128mul glue_helper ablk_helper libata cryptd psmouse xhci_pci scsi_mod ehci_pci xhci_hcd ehci_hcd i2c_i801 e1000e i2c_smbus r8169 mii ptp usbcore pps_core usb_common fan thermal
Mar  4 19:53:54 home-server kernel: [1317284.367190] CPU: 5 PID: 13364 Comm: htop Tainted: G      D      L  4.9.0-8-amd64 #1 Debian 4.9.130-2
Mar  4 19:53:54 home-server kernel: [1317284.368346] Hardware name: ASUS All Series/CS-B, BIOS 1203 12/10/2015
Mar  4 19:53:54 home-server kernel: [1317284.369440] task: ffff9a96352840c0 task.stack: ffffbad1c3b40000
Mar  4 19:53:54 home-server kernel: [1317284.370977] RIP: 0010:[<ffffffff988c5402>]  [<ffffffff988c5402>] native_queued_spin_lock_slowpath+0x172/0x190
Mar  4 19:53:54 home-server kernel: [1317284.372091] RSP: 0018:ffffbad1c3b43c68  EFLAGS: 00000202
Mar  4 19:53:54 home-server kernel: [1317284.373119] RAX: 0000000000000101 RBX: ffff9a975684e078 RCX: 0000000000000001
Mar  4 19:53:54 home-server kernel: [1317284.374582] RDX: 0000000000000101 RSI: 0000000000000001 RDI: ffff9a9b0173e7a0
Mar  4 19:53:54 home-server kernel: [1317284.375617] RBP: ffffbad1c3b43c90 R08: 0000000000000101 R09: 0000000000000001
Mar  4 19:53:54 home-server kernel: [1317284.376610] R10: 000000005eeef6c0 R11: ffff9a975eeef6c0 R12: ffff9a9b0173e080
Mar  4 19:53:54 home-server kernel: [1317284.378080] R13: ffff9a9b0173e7a0 R14: 0000000000000004 R15: ffff9a975eeef6c0
Mar  4 19:53:54 home-server kernel: [1317284.379089] FS:  00007fa52e615380(0000) GS:ffff9a9b1fb40000(0000) knlGS:0000000000000000
Mar  4 19:53:54 home-server kernel: [1317284.380048] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  4 19:53:54 home-server kernel: [1317284.381457] CR2: 0000555a29a76000 CR3: 000000019b87a000 CR4: 0000000000162670
Mar  4 19:53:54 home-server kernel: [1317284.382507] Stack:
Mar  4 19:53:54 home-server kernel: [1317284.383501]  ffffffff98e18ead ffffffff98a7e82b ffffffff9902be80 ffff9a975eeef6c0
Mar  4 19:53:54 home-server kernel: [1317284.384801]  ffff9a9b0173e080 ffffffff9902be80 ffffffff98a7e96b ffffffff9902be80
Mar  4 19:53:54 home-server kernel: [1317284.385840]  ffffffff9902c600 0000000000000004 ffffffff98a7ea7b ffff9a9aed546838
Mar  4 19:53:54 home-server kernel: [1317284.386906] Call Trace:
Mar  4 19:53:54 home-server kernel: [1317284.388263]  [<ffffffff98e18ead>] ? _raw_spin_lock+0x1d/0x20
Mar  4 19:53:54 home-server kernel: [1317284.389245]  [<ffffffff98a7e82b>] ? proc_pid_make_inode+0x7b/0xe0
Mar  4 19:53:54 home-server kernel: [1317284.390320]  [<ffffffff98a7e96b>] ? proc_pident_instantiate+0x1b/0xa0
Mar  4 19:53:54 home-server kernel: [1317284.391691]  [<ffffffff98a7ea7b>] ? proc_pident_lookup+0x8b/0xd0
Mar  4 19:53:54 home-server kernel: [1317284.392740]  [<ffffffff98a1aada>] ? path_openat+0x115a/0x14f0
Mar  4 19:53:54 home-server kernel: [1317284.393768]  [<ffffffff98a1c131>] ? do_filp_open+0x91/0x100
Mar  4 19:53:54 home-server kernel: [1317284.395009]  [<ffffffff98a06f7a>] ? __check_object_size+0xfa/0x1d8
Mar  4 19:53:54 home-server kernel: [1317284.396015]  [<ffffffff98a097ae>] ? do_sys_open+0x12e/0x210
Mar  4 19:53:54 home-server kernel: [1317284.397045]  [<ffffffff98803b7d>] ? do_syscall_64+0x8d/0xf0
Mar  4 19:53:54 home-server kernel: [1317284.398421]  [<ffffffff98e18f8e>] ? entry_SYSCALL_64_after_swapgs+0x58/0xc6
Mar  4 19:53:54 home-server kernel: [1317284.399378] Code: 31 c0 41 39 c0 74 e6 4d 85 c9 c6 07 01 74 2d 41 c7 41 08 01 00 00 00 e9 53 ff ff ff 83 fa 01 74 17 8b 07 84 c0 74 08 f3 90 8b 07 <84> c0 75 f8 b8 01 00 00 00 66 89 07 c3 f3 c3 f3 90 4c 8b 09 4d 
Mar  4 19:54:04 home-server kernel: [1317295.040985] INFO: rcu_sched self-detected stall on CPU
Mar  4 19:54:04 home-server kernel: [1317295.041655] 	5-...: (20980 ticks this GP) idle=a67/140000000000001/0 softirq=94313417/94313420 fqs=10482 
Mar  4 19:54:04 home-server kernel: [1317295.042367] 	 (t=21003 jiffies g=73084763 c=73084762 q=1174569)
Mar  4 19:54:04 home-server kernel: [1317295.043079] Task dump for CPU 5:
Mar  4 19:54:04 home-server kernel: [1317295.043759] htop            R  running task        0 13364  13287 0x0000000c
Mar  4 19:54:04 home-server kernel: [1317295.044422]  ffffffff99518ec0 ffffffff988a7dcb 0000000000000005 ffffffff99518ec0
Mar  4 19:54:04 home-server kernel: [1317295.045088]  ffffffff9898112b ffff9a9b1fb596c0 ffffffff9944fd00 0000000000000000
Mar  4 19:54:04 home-server kernel: [1317295.045793]  ffffffff99518ec0 00000000ffffffff ffffffff988e36fa 0000000000d52129
Mar  4 19:54:04 home-server kernel: [1317295.046509] Call Trace:
Mar  4 19:54:04 home-server kernel: [1317295.047126]  <IRQ> 
Mar  4 19:54:04 home-server kernel: [1317295.047135]  [<ffffffff988a7dcb>] ? sched_show_task+0xcb/0x130
Mar  4 19:54:04 home-server kernel: [1317295.047767]  [<ffffffff9898112b>] ? rcu_dump_cpu_stacks+0x92/0xb2
Mar  4 19:54:04 home-server kernel: [1317295.048447]  [<ffffffff988e36fa>] ? rcu_check_callbacks+0x75a/0x8b0
Mar  4 19:54:04 home-server kernel: [1317295.049055]  [<ffffffff988f1dc8>] ? update_wall_time+0x498/0x7b0
Mar  4 19:54:04 home-server kernel: [1317295.049638]  [<ffffffff988f9c30>] ? tick_sched_do_timer+0x30/0x30
Mar  4 19:54:04 home-server kernel: [1317295.050231]  [<ffffffff988ea2d8>] ? update_process_times+0x28/0x50
Mar  4 19:54:04 home-server kernel: [1317295.050772]  [<ffffffff988f9630>] ? tick_sched_handle.isra.12+0x20/0x50
Mar  4 19:54:04 home-server kernel: [1317295.051381]  [<ffffffff988f9c68>] ? tick_sched_timer+0x38/0x70
Mar  4 19:54:04 home-server kernel: [1317295.051909]  [<ffffffff988eadae>] ? __hrtimer_run_queues+0xde/0x250
Mar  4 19:54:04 home-server kernel: [1317295.052428]  [<ffffffff988eb48c>] ? hrtimer_interrupt+0x9c/0x1a0
Mar  4 19:54:04 home-server kernel: [1317295.052942]  [<ffffffff98e1c507>] ? smp_apic_timer_interrupt+0x47/0x60
Mar  4 19:54:04 home-server kernel: [1317295.053450]  [<ffffffff98e1ada6>] ? apic_timer_interrupt+0x96/0xa0
Mar  4 19:54:04 home-server kernel: [1317295.054080]  <EOI> 
Mar  4 19:54:04 home-server kernel: [1317295.054096]  [<ffffffff988c5402>] ? native_queued_spin_lock_slowpath+0x172/0x190
Mar  4 19:54:04 home-server kernel: [1317295.054601]  [<ffffffff98e18ead>] ? _raw_spin_lock+0x1d/0x20
Mar  4 19:54:04 home-server kernel: [1317295.055104]  [<ffffffff98a7e82b>] ? proc_pid_make_inode+0x7b/0xe0
Mar  4 19:54:04 home-server kernel: [1317295.055612]  [<ffffffff98a7e96b>] ? proc_pident_instantiate+0x1b/0xa0
Mar  4 19:54:04 home-server kernel: [1317295.056114]  [<ffffffff98a7ea7b>] ? proc_pident_lookup+0x8b/0xd0
Mar  4 19:54:04 home-server kernel: [1317295.056666]  [<ffffffff98a1aada>] ? path_openat+0x115a/0x14f0
Mar  4 19:54:04 home-server kernel: [1317295.057161]  [<ffffffff98a1c131>] ? do_filp_open+0x91/0x100
Mar  4 19:54:04 home-server kernel: [1317295.057646]  [<ffffffff98a06f7a>] ? __check_object_size+0xfa/0x1d8
Mar  4 19:54:04 home-server kernel: [1317295.058149]  [<ffffffff98a097ae>] ? do_sys_open+0x12e/0x210
Mar  4 19:54:04 home-server kernel: [1317295.058613]  [<ffffffff98803b7d>] ? do_syscall_64+0x8d/0xf0
Mar  4 19:54:04 home-server kernel: [1317295.059116]  [<ffffffff98e18f8e>] ? entry_SYSCALL_64_after_swapgs+0x58/0xc6
Mar  4 19:54:30 home-server kernel: [1317320.354691] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 23s! [htop:13364]
Mar  4 19:54:30 home-server kernel: [1317320.355171] Modules linked in: uas usb_storage vhost_net vhost fuse btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs vfat msdos fat jfs xfs libcrc32c macvtap macvlan xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp tun bridge stp llc ebtable_filter ebtables ip6_tables iptable_filter intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel eeepc_wmi kvm irqbypass asus_wmi sparse_keymap snd_hda_codec_realtek snd_hda_codec_hdmi snd_hda_codec_generic iTCO_wdt snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep iTCO_vendor_support i915 ppdev intel_cstate evdev rfkill joydev snd_pcm intel_uncore snd_timer intel_rapl_perf mei_me drm_kms_helper snd soundcore
Mar  4 19:54:30 home-server kernel: [1317320.358908]  serio_raw pcspkr lpc_ich shpchp parport_pc parport drm mei mfd_core sg wmi video i2c_algo_bit button ip_tables x_tables autofs4 ext4 crc16 jbd2 crc32c_generic fscrypto ecb mbcache algif_skcipher af_alg dm_crypt dm_mod hid_logitech_hidpp hid_logitech_dj usbhid hid sd_mod crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel ahci aesni_intel libahci aes_x86_64 lrw gf128mul glue_helper ablk_helper libata cryptd psmouse xhci_pci scsi_mod ehci_pci xhci_hcd ehci_hcd i2c_i801 e1000e i2c_smbus r8169 mii ptp usbcore pps_core usb_common fan thermal
Mar  4 19:54:30 home-server kernel: [1317320.362506] CPU: 5 PID: 13364 Comm: htop Tainted: G      D      L  4.9.0-8-amd64 #1 Debian 4.9.130-2
Mar  4 19:54:30 home-server kernel: [1317320.363367] Hardware name: ASUS All Series/CS-B, BIOS 1203 12/10/2015
Mar  4 19:54:30 home-server kernel: [1317320.364063] task: ffff9a96352840c0 task.stack: ffffbad1c3b40000
Mar  4 19:54:30 home-server kernel: [1317320.364646] RIP: 0010:[<ffffffff988c5400>]  [<ffffffff988c5400>] native_queued_spin_lock_slowpath+0x170/0x190
Mar  4 19:54:30 home-server kernel: [1317320.365531] RSP: 0018:ffffbad1c3b43c68  EFLAGS: 00000202
Mar  4 19:54:30 home-server kernel: [1317320.366252] RAX: 0000000000000101 RBX: ffff9a975684e078 RCX: 0000000000000001
Mar  4 19:54:30 home-server kernel: [1317320.366957] RDX: 0000000000000101 RSI: 0000000000000001 RDI: ffff9a9b0173e7a0
Mar  4 19:54:30 home-server kernel: [1317320.367539] RBP: ffffbad1c3b43c90 R08: 0000000000000101 R09: 0000000000000001
Mar  4 19:54:30 home-server kernel: [1317320.368468] R10: 000000005eeef6c0 R11: ffff9a975eeef6c0 R12: ffff9a9b0173e080
Mar  4 19:54:30 home-server kernel: [1317320.369149] R13: ffff9a9b0173e7a0 R14: 0000000000000004 R15: ffff9a975eeef6c0
Mar  4 19:54:30 home-server kernel: [1317320.369807] FS:  00007fa52e615380(0000) GS:ffff9a9b1fb40000(0000) knlGS:0000000000000000
Mar  4 19:54:30 home-server kernel: [1317320.370394] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  4 19:54:30 home-server kernel: [1317320.371333] CR2: 0000555a29a76000 CR3: 000000019b87a000 CR4: 0000000000162670
Mar  4 19:54:30 home-server kernel: [1317320.371994] Stack:
Mar  4 19:54:30 home-server kernel: [1317320.372666]  ffffffff98e18ead ffffffff98a7e82b ffffffff9902be80 ffff9a975eeef6c0
Mar  4 19:54:30 home-server kernel: [1317320.373257]  ffff9a9b0173e080 ffffffff9902be80 ffffffff98a7e96b ffffffff9902be80
Mar  4 19:54:30 home-server kernel: [1317320.374278]  ffffffff9902c600 0000000000000004 ffffffff98a7ea7b ffff9a9aed546838
Mar  4 19:54:30 home-server kernel: [1317320.374894] Call Trace:
Mar  4 19:54:30 home-server kernel: [1317320.375604]  [<ffffffff98e18ead>] ? _raw_spin_lock+0x1d/0x20
Mar  4 19:54:30 home-server kernel: [1317320.376212]  [<ffffffff98a7e82b>] ? proc_pid_make_inode+0x7b/0xe0
Mar  4 19:54:30 home-server kernel: [1317320.377247]  [<ffffffff98a7e96b>] ? proc_pident_instantiate+0x1b/0xa0
Mar  4 19:54:30 home-server kernel: [1317320.377844]  [<ffffffff98a7ea7b>] ? proc_pident_lookup+0x8b/0xd0
Mar  4 19:54:30 home-server kernel: [1317320.378515]  [<ffffffff98a1aada>] ? path_openat+0x115a/0x14f0
Mar  4 19:54:30 home-server kernel: [1317320.379227]  [<ffffffff98a1c131>] ? do_filp_open+0x91/0x100
Mar  4 19:54:30 home-server kernel: [1317320.380199]  [<ffffffff98a06f7a>] ? __check_object_size+0xfa/0x1d8
Mar  4 19:54:30 home-server kernel: [1317320.380801]  [<ffffffff98a097ae>] ? do_sys_open+0x12e/0x210
Mar  4 19:54:30 home-server kernel: [1317320.381456]  [<ffffffff98803b7d>] ? do_syscall_64+0x8d/0xf0
Mar  4 19:54:30 home-server kernel: [1317320.382149]  [<ffffffff98e18f8e>] ? entry_SYSCALL_64_after_swapgs+0x58/0xc6
Mar  4 19:54:30 home-server kernel: [1317320.382986] Code: d0 66 31 c0 41 39 c0 74 e6 4d 85 c9 c6 07 01 74 2d 41 c7 41 08 01 00 00 00 e9 53 ff ff ff 83 fa 01 74 17 8b 07 84 c0 74 08 f3 90 <8b> 07 84 c0 75 f8 b8 01 00 00 00 66 89 07 c3 f3 c3 f3 90 4c 8b 
Mar  4 19:54:53 home-server kernel: [1317344.195198] kvm [17784]: vcpu0, guest rIP: 0xffffffff99c5a674 disabled perfctr wrmsr: 0xc2 data 0xffff
Mar  4 19:54:58 home-server kernel: [1317348.356576] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 23s! [htop:13364]
Mar  4 19:54:58 home-server kernel: [1317348.357615] Modules linked in: uas usb_storage vhost_net vhost fuse btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs vfat msdos fat jfs xfs libcrc32c macvtap macvlan xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp tun bridge stp llc ebtable_filter ebtables ip6_tables iptable_filter intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel eeepc_wmi kvm irqbypass asus_wmi sparse_keymap snd_hda_codec_realtek snd_hda_codec_hdmi snd_hda_codec_generic iTCO_wdt snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep iTCO_vendor_support i915 ppdev intel_cstate evdev rfkill joydev snd_pcm intel_uncore snd_timer intel_rapl_perf mei_me drm_kms_helper snd soundcore
Mar  4 19:54:58 home-server kernel: [1317348.362693]  serio_raw pcspkr lpc_ich shpchp parport_pc parport drm mei mfd_core sg wmi video i2c_algo_bit button ip_tables x_tables autofs4 ext4 crc16 jbd2 crc32c_generic fscrypto ecb mbcache algif_skcipher af_alg dm_crypt dm_mod hid_logitech_hidpp hid_logitech_dj usbhid hid sd_mod crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel ahci aesni_intel libahci aes_x86_64 lrw gf128mul glue_helper ablk_helper libata cryptd psmouse xhci_pci scsi_mod ehci_pci xhci_hcd ehci_hcd i2c_i801 e1000e i2c_smbus r8169 mii ptp usbcore pps_core usb_common fan thermal
Mar  4 19:54:58 home-server kernel: [1317348.367445] CPU: 5 PID: 13364 Comm: htop Tainted: G      D      L  4.9.0-8-amd64 #1 Debian 4.9.130-2
Mar  4 19:54:58 home-server kernel: [1317348.368260] Hardware name: ASUS All Series/CS-B, BIOS 1203 12/10/2015
Mar  4 19:54:58 home-server kernel: [1317348.368993] task: ffff9a96352840c0 task.stack: ffffbad1c3b40000
Mar  4 19:54:58 home-server kernel: [1317348.370208] RIP: 0010:[<ffffffff988c5400>]  [<ffffffff988c5400>] native_queued_spin_lock_slowpath+0x170/0x190
Mar  4 19:54:58 home-server kernel: [1317348.370949] RSP: 0018:ffffbad1c3b43c68  EFLAGS: 00000202
Mar  4 19:54:58 home-server kernel: [1317348.371725] RAX: 0000000000000101 RBX: ffff9a975684e078 RCX: 0000000000000001
Mar  4 19:54:58 home-server kernel: [1317348.372404] RDX: 0000000000000101 RSI: 0000000000000001 RDI: ffff9a9b0173e7a0

Suite du log:

Mar  4 19:54:58 home-server kernel: [1317348.373455] RBP: ffffbad1c3b43c90 R08: 0000000000000101 R09: 0000000000000001
Mar  4 19:54:58 home-server kernel: [1317348.374144] R10: 000000005eeef6c0 R11: ffff9a975eeef6c0 R12: ffff9a9b0173e080
Mar  4 19:54:58 home-server kernel: [1317348.374828] R13: ffff9a9b0173e7a0 R14: 0000000000000004 R15: ffff9a975eeef6c0
Mar  4 19:54:58 home-server kernel: [1317348.375704] FS:  00007fa52e615380(0000) GS:ffff9a9b1fb40000(0000) knlGS:0000000000000000
Mar  4 19:54:58 home-server kernel: [1317348.376575] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  4 19:54:58 home-server kernel: [1317348.377390] CR2: 0000555a29a76000 CR3: 000000019b87a000 CR4: 0000000000162670
Mar  4 19:54:58 home-server kernel: [1317348.378126] Stack:
Mar  4 19:54:58 home-server kernel: [1317348.379132]  ffffffff98e18ead ffffffff98a7e82b ffffffff9902be80 ffff9a975eeef6c0
Mar  4 19:54:58 home-server kernel: [1317348.379830]  ffff9a9b0173e080 ffffffff9902be80 ffffffff98a7e96b ffffffff9902be80
Mar  4 19:54:58 home-server kernel: [1317348.380562]  ffffffff9902c600 0000000000000004 ffffffff98a7ea7b ffff9a9aed546838
Mar  4 19:54:58 home-server kernel: [1317348.381290] Call Trace:
Mar  4 19:54:58 home-server kernel: [1317348.382317]  [<ffffffff98e18ead>] ? _raw_spin_lock+0x1d/0x20
Mar  4 19:54:58 home-server kernel: [1317348.382978]  [<ffffffff98a7e82b>] ? proc_pid_make_inode+0x7b/0xe0
Mar  4 19:54:58 home-server kernel: [1317348.383691]  [<ffffffff98a7e96b>] ? proc_pident_instantiate+0x1b/0xa0
Mar  4 19:54:58 home-server kernel: [1317348.384484]  [<ffffffff98a7ea7b>] ? proc_pident_lookup+0x8b/0xd0
Mar  4 19:54:58 home-server kernel: [1317348.385506]  [<ffffffff98a1aada>] ? path_openat+0x115a/0x14f0
Mar  4 19:54:58 home-server kernel: [1317348.386141]  [<ffffffff98a1c131>] ? do_filp_open+0x91/0x100
Mar  4 19:54:58 home-server kernel: [1317348.386889]  [<ffffffff98a06f7a>] ? __check_object_size+0xfa/0x1d8
Mar  4 19:54:58 home-server kernel: [1317348.387620]  [<ffffffff98a097ae>] ? do_sys_open+0x12e/0x210
Mar  4 19:54:58 home-server kernel: [1317348.388440]  [<ffffffff98803b7d>] ? do_syscall_64+0x8d/0xf0
Mar  4 19:54:58 home-server kernel: [1317348.389141]  [<ffffffff98e18f8e>] ? entry_SYSCALL_64_after_swapgs+0x58/0xc6
Mar  4 19:54:58 home-server kernel: [1317348.389758] Code: d0 66 31 c0 41 39 c0 74 e6 4d 85 c9 c6 07 01 74 2d 41 c7 41 08 01 00 00 00 e9 53 ff ff ff 83 fa 01 74 17 8b 07 84 c0 74 08 f3 90 <8b> 07 84 c0 75 f8 b8 01 00 00 00 66 89 07 c3 f3 c3 f3 90 4c 8b 
Mar  4 19:55:07 home-server kernel: [1317358.057229] INFO: rcu_sched self-detected stall on CPU
Mar  4 19:55:07 home-server kernel: [1317358.057955] 	5-...: (36715 ticks this GP) idle=a67/140000000000001/0 softirq=94313417/94313420 fqs=18334 
Mar  4 19:55:07 home-server kernel: [1317358.058648] 	 (t=36756 jiffies g=73084763 c=73084762 q=2034283)
Mar  4 19:55:07 home-server kernel: [1317358.059324] Task dump for CPU 5:
Mar  4 19:55:07 home-server kernel: [1317358.059982] htop            R  running task        0 13364  13287 0x0000000c
Mar  4 19:55:07 home-server kernel: [1317358.060643]  ffffffff99518ec0 ffffffff988a7dcb 0000000000000005 ffffffff99518ec0
Mar  4 19:55:07 home-server kernel: [1317358.061305]  ffffffff9898112b ffff9a9b1fb596c0 ffffffff9944fd00 0000000000000000
Mar  4 19:55:07 home-server kernel: [1317358.061953]  ffffffff99518ec0 00000000ffffffff ffffffff988e36fa 0000000000d52129
Mar  4 19:55:07 home-server kernel: [1317358.062587] Call Trace:
Mar  4 19:55:07 home-server kernel: [1317358.063206]  <IRQ> 
Mar  4 19:55:07 home-server kernel: [1317358.063215]  [<ffffffff988a7dcb>] ? sched_show_task+0xcb/0x130
Mar  4 19:55:07 home-server kernel: [1317358.063832]  [<ffffffff9898112b>] ? rcu_dump_cpu_stacks+0x92/0xb2
Mar  4 19:55:07 home-server kernel: [1317358.064445]  [<ffffffff988e36fa>] ? rcu_check_callbacks+0x75a/0x8b0
Mar  4 19:55:07 home-server kernel: [1317358.065041]  [<ffffffff988f1dc8>] ? update_wall_time+0x498/0x7b0
Mar  4 19:55:07 home-server kernel: [1317358.065644]  [<ffffffff988f9c30>] ? tick_sched_do_timer+0x30/0x30
Mar  4 19:55:07 home-server kernel: [1317358.066217]  [<ffffffff988ea2d8>] ? update_process_times+0x28/0x50
Mar  4 19:55:07 home-server kernel: [1317358.066774]  [<ffffffff988f9630>] ? tick_sched_handle.isra.12+0x20/0x50
Mar  4 19:55:07 home-server kernel: [1317358.067318]  [<ffffffff988f9c68>] ? tick_sched_timer+0x38/0x70
Mar  4 19:55:07 home-server kernel: [1317358.067845]  [<ffffffff988eadae>] ? __hrtimer_run_queues+0xde/0x250
Mar  4 19:55:07 home-server kernel: [1317358.068388]  [<ffffffff988eb48c>] ? hrtimer_interrupt+0x9c/0x1a0
Mar  4 19:55:07 home-server kernel: [1317358.068904]  [<ffffffff98e1c507>] ? smp_apic_timer_interrupt+0x47/0x60
Mar  4 19:55:07 home-server kernel: [1317358.069414]  [<ffffffff98e1ada6>] ? apic_timer_interrupt+0x96/0xa0
Mar  4 19:55:07 home-server kernel: [1317358.069914]  <EOI> 
Mar  4 19:55:07 home-server kernel: [1317358.069921]  [<ffffffff988c5402>] ? native_queued_spin_lock_slowpath+0x172/0x190
Mar  4 19:55:07 home-server kernel: [1317358.070421]  [<ffffffff98e18ead>] ? _raw_spin_lock+0x1d/0x20
Mar  4 19:55:07 home-server kernel: [1317358.070945]  [<ffffffff98a7e82b>] ? proc_pid_make_inode+0x7b/0xe0
Mar  4 19:55:07 home-server kernel: [1317358.071448]  [<ffffffff98a7e96b>] ? proc_pident_instantiate+0x1b/0xa0
Mar  4 19:55:07 home-server kernel: [1317358.071948]  [<ffffffff98a7ea7b>] ? proc_pident_lookup+0x8b/0xd0
Mar  4 19:55:07 home-server kernel: [1317358.072442]  [<ffffffff98a1aada>] ? path_openat+0x115a/0x14f0
Mar  4 19:55:07 home-server kernel: [1317358.072933]  [<ffffffff98a1c131>] ? do_filp_open+0x91/0x100
Mar  4 19:55:07 home-server kernel: [1317358.073436]  [<ffffffff98a06f7a>] ? __check_object_size+0xfa/0x1d8
Mar  4 19:55:07 home-server kernel: [1317358.073920]  [<ffffffff98a097ae>] ? do_sys_open+0x12e/0x210
Mar  4 19:55:07 home-server kernel: [1317358.074397]  [<ffffffff98803b7d>] ? do_syscall_64+0x8d/0xf0
Mar  4 19:55:07 home-server kernel: [1317358.074865]  [<ffffffff98e18f8e>] ? entry_SYSCALL_64_after_swapgs+0x58/0xc6
Mar  4 19:55:34 home-server kernel: [1317384.359000] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [htop:13364]
Mar  4 19:55:34 home-server kernel: [1317384.359544] Modules linked in: uas usb_storage vhost_net vhost fuse btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs vfat msdos fat jfs xfs libcrc32c macvtap macvlan xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp tun bridge stp llc ebtable_filter ebtables ip6_tables iptable_filter intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel eeepc_wmi kvm irqbypass asus_wmi sparse_keymap snd_hda_codec_realtek snd_hda_codec_hdmi snd_hda_codec_generic iTCO_wdt snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep iTCO_vendor_support i915 ppdev intel_cstate evdev rfkill joydev snd_pcm intel_uncore snd_timer intel_rapl_perf mei_me drm_kms_helper snd soundcore
Mar  4 19:55:34 home-server kernel: [1317384.362766]  serio_raw pcspkr lpc_ich shpchp parport_pc parport drm mei mfd_core sg wmi video i2c_algo_bit button ip_tables x_tables autofs4 ext4 crc16 jbd2 crc32c_generic fscrypto ecb mbcache algif_skcipher af_alg dm_crypt dm_mod hid_logitech_hidpp hid_logitech_dj usbhid hid sd_mod crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel ahci aesni_intel libahci aes_x86_64 lrw gf128mul glue_helper ablk_helper libata cryptd psmouse xhci_pci scsi_mod ehci_pci xhci_hcd ehci_hcd i2c_i801 e1000e i2c_smbus r8169 mii ptp usbcore pps_core usb_common fan thermal
Mar  4 19:55:34 home-server kernel: [1317384.365836] CPU: 5 PID: 13364 Comm: htop Tainted: G      D      L  4.9.0-8-amd64 #1 Debian 4.9.130-2
Mar  4 19:55:34 home-server kernel: [1317384.366453] Hardware name: ASUS All Series/CS-B, BIOS 1203 12/10/2015
Mar  4 19:55:34 home-server kernel: [1317384.367095] task: ffff9a96352840c0 task.stack: ffffbad1c3b40000
Mar  4 19:55:34 home-server kernel: [1317384.367733] RIP: 0010:[<ffffffff988c5402>]  [<ffffffff988c5402>] native_queued_spin_lock_slowpath+0x172/0x190
Mar  4 19:55:34 home-server kernel: [1317384.368330] RSP: 0018:ffffbad1c3b43c68  EFLAGS: 00000202
Mar  4 19:55:34 home-server kernel: [1317384.368958] RAX: 0000000000000101 RBX: ffff9a975684e078 RCX: 0000000000000001
Mar  4 19:55:34 home-server kernel: [1317384.369538] RDX: 0000000000000101 RSI: 0000000000000001 RDI: ffff9a9b0173e7a0
Mar  4 19:55:34 home-server kernel: [1317384.370198] RBP: ffffbad1c3b43c90 R08: 0000000000000101 R09: 0000000000000001
Mar  4 19:55:34 home-server kernel: [1317384.370777] R10: 000000005eeef6c0 R11: ffff9a975eeef6c0 R12: ffff9a9b0173e080
Mar  4 19:55:34 home-server kernel: [1317384.371417] R13: ffff9a9b0173e7a0 R14: 0000000000000004 R15: ffff9a975eeef6c0
Mar  4 19:55:34 home-server kernel: [1317384.371991] FS:  00007fa52e615380(0000) GS:ffff9a9b1fb40000(0000) knlGS:0000000000000000
Mar  4 19:55:34 home-server kernel: [1317384.372650] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  4 19:55:34 home-server kernel: [1317384.373242] CR2: 0000555a29a76000 CR3: 000000019b87a000 CR4: 0000000000162670
Mar  4 19:55:34 home-server kernel: [1317384.373829] Stack:
Mar  4 19:55:34 home-server kernel: [1317384.374407]  ffffffff98e18ead ffffffff98a7e82b ffffffff9902be80 ffff9a975eeef6c0
Mar  4 19:55:34 home-server kernel: [1317384.375067]  ffff9a9b0173e080 ffffffff9902be80 ffffffff98a7e96b ffffffff9902be80
Mar  4 19:55:34 home-server kernel: [1317384.375688]  ffffffff9902c600 0000000000000004 ffffffff98a7ea7b ffff9a9aed546838
Mar  4 19:55:34 home-server kernel: [1317384.376283] Call Trace:
Mar  4 19:55:34 home-server kernel: [1317384.376895]  [<ffffffff98e18ead>] ? _raw_spin_lock+0x1d/0x20
Mar  4 19:55:34 home-server kernel: [1317384.377516]  [<ffffffff98a7e82b>] ? proc_pid_make_inode+0x7b/0xe0
Mar  4 19:55:34 home-server kernel: [1317384.378165]  [<ffffffff98a7e96b>] ? proc_pident_instantiate+0x1b/0xa0
Mar  4 19:55:34 home-server kernel: [1317384.378764]  [<ffffffff98a7ea7b>] ? proc_pident_lookup+0x8b/0xd0
Mar  4 19:55:34 home-server kernel: [1317384.379439]  [<ffffffff98a1aada>] ? path_openat+0x115a/0x14f0
Mar  4 19:55:34 home-server kernel: [1317384.380056]  [<ffffffff98a1c131>] ? do_filp_open+0x91/0x100
Mar  4 19:55:34 home-server kernel: [1317384.380710]  [<ffffffff98a06f7a>] ? __check_object_size+0xfa/0x1d8
Mar  4 19:55:34 home-server kernel: [1317384.381292]  [<ffffffff98a097ae>] ? do_sys_open+0x12e/0x210
Mar  4 19:55:34 home-server kernel: [1317384.381877]  [<ffffffff98803b7d>] ? do_syscall_64+0x8d/0xf0
Mar  4 19:55:34 home-server kernel: [1317384.382445]  [<ffffffff98e18f8e>] ? entry_SYSCALL_64_after_swapgs+0x58/0xc6
Mar  4 19:55:34 home-server kernel: [1317384.383107] Code: 31 c0 41 39 c0 74 e6 4d 85 c9 c6 07 01 74 2d 41 c7 41 08 01 00 00 00 e9 53 ff ff ff 83 fa 01 74 17 8b 07 84 c0 74 08 f3 90 8b 07 <84> c0 75 f8 b8 01 00 00 00 66 89 07 c3 f3 c3 f3 90 4c 8b 09 4d 
Mar  4 19:56:02 home-server kernel: [1317412.360884] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [htop:13364]
Mar  4 19:56:02 home-server kernel: [1317412.361803] Modules linked in: uas usb_storage vhost_net vhost fuse btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs vfat msdos fat jfs xfs libcrc32c macvtap macvlan xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp tun bridge stp llc ebtable_filter ebtables ip6_tables iptable_filter intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel eeepc_wmi kvm irqbypass asus_wmi sparse_keymap snd_hda_codec_realtek snd_hda_codec_hdmi snd_hda_codec_generic iTCO_wdt snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep iTCO_vendor_support i915 ppdev intel_cstate evdev rfkill joydev snd_pcm intel_uncore snd_timer intel_rapl_perf mei_me drm_kms_helper snd soundcore
Mar  4 19:56:02 home-server kernel: [1317412.366654]  serio_raw pcspkr lpc_ich shpchp parport_pc parport drm mei mfd_core sg wmi video i2c_algo_bit button ip_tables x_tables autofs4 ext4 crc16 jbd2 crc32c_generic fscrypto ecb mbcache algif_skcipher af_alg dm_crypt dm_mod hid_logitech_hidpp hid_logitech_dj usbhid hid sd_mod crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel ahci aesni_intel libahci aes_x86_64 lrw gf128mul glue_helper ablk_helper libata cryptd psmouse xhci_pci scsi_mod ehci_pci xhci_hcd ehci_hcd i2c_i801 e1000e i2c_smbus r8169 mii ptp usbcore pps_core usb_common fan thermal
Mar  4 19:56:02 home-server kernel: [1317412.370753] CPU: 5 PID: 13364 Comm: htop Tainted: G      D      L  4.9.0-8-amd64 #1 Debian 4.9.130-2
Mar  4 19:56:02 home-server kernel: [1317412.371438] Hardware name: ASUS All Series/CS-B, BIOS 1203 12/10/2015
Mar  4 19:56:02 home-server kernel: [1317412.372418] task: ffff9a96352840c0 task.stack: ffffbad1c3b40000
Mar  4 19:56:02 home-server kernel: [1317412.373131] RIP: 0010:[<ffffffff988c5402>]  [<ffffffff988c5402>] native_queued_spin_lock_slowpath+0x172/0x190
Mar  4 19:56:02 home-server kernel: [1317412.373840] RSP: 0018:ffffbad1c3b43c68  EFLAGS: 00000202
Mar  4 19:56:02 home-server kernel: [1317412.374811] RAX: 0000000000000101 RBX: ffff9a975684e078 RCX: 0000000000000001
Mar  4 19:56:02 home-server kernel: [1317412.375519] RDX: 0000000000000101 RSI: 0000000000000001 RDI: ffff9a9b0173e7a0
Mar  4 19:56:02 home-server kernel: [1317412.376236] RBP: ffffbad1c3b43c90 R08: 0000000000000101 R09: 0000000000000001
Mar  4 19:56:02 home-server kernel: [1317412.376882] R10: 000000005eeef6c0 R11: ffff9a975eeef6c0 R12: ffff9a9b0173e080
Mar  4 19:56:02 home-server kernel: [1317412.377823] R13: ffff9a9b0173e7a0 R14: 0000000000000004 R15: ffff9a975eeef6c0
Mar  4 19:56:02 home-server kernel: [1317412.378477] FS:  00007fa52e615380(0000) GS:ffff9a9b1fb40000(0000) knlGS:0000000000000000
Mar  4 19:56:02 home-server kernel: [1317412.379157] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  4 19:56:02 home-server kernel: [1317412.380037] CR2: 0000555a29a76000 CR3: 000000019b87a000 CR4: 0000000000162670
Mar  4 19:56:02 home-server kernel: [1317412.380773] Stack:
Mar  4 19:56:02 home-server kernel: [1317412.381518]  ffffffff98e18ead ffffffff98a7e82b ffffffff9902be80 ffff9a975eeef6c0
Mar  4 19:56:02 home-server kernel: [1317412.382163]  ffff9a9b0173e080 ffffffff9902be80 ffffffff98a7e96b ffffffff9902be80
Mar  4 19:56:02 home-server kernel: [1317412.383211]  ffffffff9902c600 0000000000000004 ffffffff98a7ea7b ffff9a9aed546838
Mar  4 19:56:02 home-server kernel: [1317412.383865] Call Trace:
Mar  4 19:56:02 home-server kernel: [1317412.384540]  [<ffffffff98e18ead>] ? _raw_spin_lock+0x1d/0x20
Mar  4 19:56:02 home-server kernel: [1317412.385370]  [<ffffffff98a7e82b>] ? proc_pid_make_inode+0x7b/0xe0
Mar  4 19:56:02 home-server kernel: [1317412.386288]  [<ffffffff98a7e96b>] ? proc_pident_instantiate+0x1b/0xa0
Mar  4 19:56:02 home-server kernel: [1317412.387000]  [<ffffffff98a7ea7b>] ? proc_pident_lookup+0x8b/0xd0
Mar  4 19:56:02 home-server kernel: [1317412.387615]  [<ffffffff98a1aada>] ? path_openat+0x115a/0x14f0
Mar  4 19:56:02 home-server kernel: [1317412.388528]  [<ffffffff98a1c131>] ? do_filp_open+0x91/0x100
Mar  4 19:56:02 home-server kernel: [1317412.389195]  [<ffffffff98a06f7a>] ? __check_object_size+0xfa/0x1d8
Mar  4 19:56:02 home-server kernel: [1317412.389920]  [<ffffffff98a097ae>] ? do_sys_open+0x12e/0x210
Mar  4 19:56:02 home-server kernel: [1317412.390537]  [<ffffffff98803b7d>] ? do_syscall_64+0x8d/0xf0
Mar  4 19:56:02 home-server kernel: [1317412.391577]  [<ffffffff98e18f8e>] ? entry_SYSCALL_64_after_swapgs+0x58/0xc6
Mar  4 19:56:02 home-server kernel: [1317412.392185] Code: 31 c0 41 39 c0 74 e6 4d 85 c9 c6 07 01 74 2d 41 c7 41 08 01 00 00 00 e9 53 ff ff ff 83 fa 01 74 17 8b 07 84 c0 74 08 f3 90 8b 07 <84> c0 75 f8 b8 01 00 00 00 66 89 07 c3 f3 c3 f3 90 4c 8b 09 4d 
Mar  4 19:56:10 home-server kernel: [1317421.073469] INFO: rcu_sched self-detected stall on CPU
Mar  4 19:56:10 home-server kernel: [1317421.074207] 	5-...: (52451 ticks this GP) idle=a67/140000000000001/0 softirq=94313417/94313420 fqs=26166 
Mar  4 19:56:10 home-server kernel: [1317421.075051] 	 (t=52509 jiffies g=73084763 c=73084762 q=2851583)
Mar  4 19:56:10 home-server kernel: [1317421.076215] Task dump for CPU 5:
Mar  4 19:56:10 home-server kernel: [1317421.076946] htop            R  running task        0 13364  13287 0x0000000c
Mar  4 19:56:10 home-server kernel: [1317421.077623]  ffffffff99518ec0 ffffffff988a7dcb 0000000000000005 ffffffff99518ec0
Mar  4 19:56:10 home-server kernel: [1317421.078771]  ffffffff9898112b ffff9a9b1fb596c0 ffffffff9944fd00 0000000000000000
Mar  4 19:56:10 home-server kernel: [1317421.079617]  ffffffff99518ec0 00000000ffffffff ffffffff988e36fa 0000000000d52129
Mar  4 19:56:10 home-server kernel: [1317421.080352] Call Trace:
Mar  4 19:56:10 home-server kernel: [1317421.080937]  <IRQ> 
Mar  4 19:56:10 home-server kernel: [1317421.080949]  [<ffffffff988a7dcb>] ? sched_show_task+0xcb/0x130
Mar  4 19:56:10 home-server kernel: [1317421.082110]  [<ffffffff9898112b>] ? rcu_dump_cpu_stacks+0x92/0xb2
Mar  4 19:56:10 home-server kernel: [1317421.082709]  [<ffffffff988e36fa>] ? rcu_check_callbacks+0x75a/0x8b0
Mar  4 19:56:10 home-server kernel: [1317421.083362]  [<ffffffff988f1dc8>] ? update_wall_time+0x498/0x7b0
Mar  4 19:56:10 home-server kernel: [1317421.083940]  [<ffffffff988f9c30>] ? tick_sched_do_timer+0x30/0x30
Mar  4 19:56:10 home-server kernel: [1317421.085143]  [<ffffffff988ea2d8>] ? update_process_times+0x28/0x50
Mar  4 19:56:10 home-server kernel: [1317421.085714]  [<ffffffff988f9630>] ? tick_sched_handle.isra.12+0x20/0x50
Mar  4 19:56:10 home-server kernel: [1317421.086344]  [<ffffffff988f9c68>] ? tick_sched_timer+0x38/0x70
Mar  4 19:56:10 home-server kernel: [1317421.086846]  [<ffffffff988eadae>] ? __hrtimer_run_queues+0xde/0x250
Mar  4 19:56:10 home-server kernel: [1317421.087608]  [<ffffffff988eb48c>] ? hrtimer_interrupt+0x9c/0x1a0
Mar  4 19:56:10 home-server kernel: [1317421.088549]  [<ffffffff98e1c507>] ? smp_apic_timer_interrupt+0x47/0x60
Mar  4 19:56:10 home-server kernel: [1317421.089082]  [<ffffffff98e1ada6>] ? apic_timer_interrupt+0x96/0xa0
Mar  4 19:56:10 home-server kernel: [1317421.089670]  <EOI> 
Mar  4 19:56:10 home-server kernel: [1317421.089677]  [<ffffffff988c5402>] ? native_queued_spin_lock_slowpath+0x172/0x190
Mar  4 19:56:10 home-server kernel: [1317421.090153]  [<ffffffff98e18ead>] ? _raw_spin_lock+0x1d/0x20
Mar  4 19:56:10 home-server kernel: [1317421.091028]  [<ffffffff98a7e82b>] ? proc_pid_make_inode+0x7b/0xe0
Mar  4 19:56:10 home-server kernel: [1317421.091823]  [<ffffffff98a7e96b>] ? proc_pident_instantiate+0x1b/0xa0
Mar  4 19:56:10 home-server kernel: [1317421.092360]  [<ffffffff98a7ea7b>] ? proc_pident_lookup+0x8b/0xd0
Mar  4 19:56:10 home-server kernel: [1317421.092948]  [<ffffffff98a1aada>] ? path_openat+0x115a/0x14f0
Mar  4 19:56:10 home-server kernel: [1317421.093413]  [<ffffffff98a1c131>] ? do_filp_open+0x91/0x100
Mar  4 19:56:10 home-server kernel: [1317421.094287]  [<ffffffff98a06f7a>] ? __check_object_size+0xfa/0x1d8
Mar  4 19:56:10 home-server kernel: [1317421.094930]  [<ffffffff98a097ae>] ? do_sys_open+0x12e/0x210
Mar  4 19:56:10 home-server kernel: [1317421.095404]  [<ffffffff98803b7d>] ? do_syscall_64+0x8d/0xf0
Mar  4 19:56:10 home-server kernel: [1317421.095988]  [<ffffffff98e18f8e>] ? entry_SYSCALL_64_after_swapgs+0x58/0xc6
Mar  4 19:56:38 home-server kernel: [1317448.363304] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [htop:13364]
Mar  4 19:56:38 home-server kernel: [1317448.363787] Modules linked in: uas usb_storage vhost_net vhost fuse btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs vfat msdos fat jfs xfs libcrc32c macvtap macvlan xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp tun bridge stp llc ebtable_filter ebtables ip6_tables iptable_filter intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel eeepc_wmi kvm irqbypass asus_wmi sparse_keymap snd_hda_codec_realtek snd_hda_codec_hdmi snd_hda_codec_generic iTCO_wdt snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep iTCO_vendor_support i915 ppdev intel_cstate evdev rfkill joydev snd_pcm intel_uncore snd_timer intel_rapl_perf mei_me drm_kms_helper snd soundcore
Mar  4 19:56:38 home-server kernel: [1317448.366799]  serio_raw pcspkr lpc_ich shpchp parport_pc parport drm mei mfd_core sg wmi video i2c_algo_bit button ip_tables x_tables autofs4 ext4 crc16 jbd2 crc32c_generic fscrypto ecb mbcache algif_skcipher af_alg dm_crypt dm_mod hid_logitech_hidpp hid_logitech_dj usbhid hid sd_mod crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel ahci aesni_intel libahci aes_x86_64 lrw gf128mul glue_helper ablk_helper libata cryptd psmouse xhci_pci scsi_mod ehci_pci xhci_hcd ehci_hcd i2c_i801 e1000e i2c_smbus r8169 mii ptp usbcore pps_core usb_common fan thermal
Mar  4 19:56:38 home-server kernel: [1317448.369668] CPU: 5 PID: 13364 Comm: htop Tainted: G      D      L  4.9.0-8-amd64 #1 Debian 4.9.130-2
Mar  4 19:56:38 home-server kernel: [1317448.370258] Hardware name: ASUS All Series/CS-B, BIOS 1203 12/10/2015
Mar  4 19:56:38 home-server kernel: [1317448.370842] task: ffff9a96352840c0 task.stack: ffffbad1c3b40000
Mar  4 19:56:38 home-server kernel: [1317448.371444] RIP: 0010:[<ffffffff988c53fe>]  [<ffffffff988c53fe>] native_queued_spin_lock_slowpath+0x16e/0x190
Mar  4 19:56:38 home-server kernel: [1317448.372024] RSP: 0018:ffffbad1c3b43c68  EFLAGS: 00000202
Mar  4 19:56:38 home-server kernel: [1317448.372592] RAX: 0000000000000101 RBX: ffff9a975684e078 RCX: 0000000000000001
Mar  4 19:56:38 home-server kernel: [1317448.373160] RDX: 0000000000000101 RSI: 0000000000000001 RDI: ffff9a9b0173e7a0
Mar  4 19:56:38 home-server kernel: [1317448.373726] RBP: ffffbad1c3b43c90 R08: 0000000000000101 R09: 0000000000000001
Mar  4 19:56:38 home-server kernel: [1317448.374292] R10: 000000005eeef6c0 R11: ffff9a975eeef6c0 R12: ffff9a9b0173e080
Mar  4 19:56:38 home-server kernel: [1317448.374855] R13: ffff9a9b0173e7a0 R14: 0000000000000004 R15: ffff9a975eeef6c0
Mar  4 19:56:38 home-server kernel: [1317448.375447] FS:  00007fa52e615380(0000) GS:ffff9a9b1fb40000(0000) knlGS:0000000000000000
Mar  4 19:56:38 home-server kernel: [1317448.376016] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  4 19:56:38 home-server kernel: [1317448.376583] CR2: 0000555a29a76000 CR3: 000000019b87a000 CR4: 0000000000162670
Mar  4 19:56:38 home-server kernel: [1317448.377149] Stack:
Mar  4 19:56:38 home-server kernel: [1317448.377706]  ffffffff98e18ead ffffffff98a7e82b ffffffff9902be80 ffff9a975eeef6c0
Mar  4 19:56:38 home-server kernel: [1317448.378283]  ffff9a9b0173e080 ffffffff9902be80 ffffffff98a7e96b ffffffff9902be80
Mar  4 19:56:38 home-server kernel: [1317448.378873]  ffffffff9902c600 0000000000000004 ffffffff98a7ea7b ffff9a9aed546838
Mar  4 19:56:38 home-server kernel: [1317448.379485] Call Trace:
Mar  4 19:56:38 home-server kernel: [1317448.380055]  [<ffffffff98e18ead>] ? _raw_spin_lock+0x1d/0x20
Mar  4 19:56:38 home-server kernel: [1317448.380637]  [<ffffffff98a7e82b>] ? proc_pid_make_inode+0x7b/0xe0
Mar  4 19:56:38 home-server kernel: [1317448.381217]  [<ffffffff98a7e96b>] ? proc_pident_instantiate+0x1b/0xa0
Mar  4 19:56:38 home-server kernel: [1317448.381799]  [<ffffffff98a7ea7b>] ? proc_pident_lookup+0x8b/0xd0
Mar  4 19:56:38 home-server kernel: [1317448.382374]  [<ffffffff98a1aada>] ? path_openat+0x115a/0x14f0
Mar  4 19:56:38 home-server kernel: [1317448.382949]  [<ffffffff98a1c131>] ? do_filp_open+0x91/0x100
Mar  4 19:56:38 home-server kernel: [1317448.383555]  [<ffffffff98a06f7a>] ? __check_object_size+0xfa/0x1d8
Mar  4 19:56:38 home-server kernel: [1317448.384121]  [<ffffffff98a097ae>] ? do_sys_open+0x12e/0x210
Mar  4 19:56:38 home-server kernel: [1317448.384678]  [<ffffffff98803b7d>] ? do_syscall_64+0x8d/0xf0
Mar  4 19:56:38 home-server kernel: [1317448.385233]  [<ffffffff98e18f8e>] ? entry_SYSCALL_64_after_swapgs+0x58/0xc6
Mar  4 19:56:38 home-server kernel: [1317448.385796] Code: c2 89 d0 66 31 c0 41 39 c0 74 e6 4d 85 c9 c6 07 01 74 2d 41 c7 41 08 01 00 00 00 e9 53 ff ff ff 83 fa 01 74 17 8b 07 84 c0 74 08 <f3> 90 8b 07 84 c0 75 f8 b8 01 00 00 00 66 89 07 c3 f3 c3 f3 90

Ca pourrait aider de connaitre le noyau, pour régler un problème de noyau…
Ca ne serait pas un noyau preempt ?
Quel usage pour la machine (serveur ? desktop ?).

D’ailleurs, as tu essayé de booter sur un autre noyau, pour savoir si ce n’est pas le noyau actuel qui est bugué ?

Dans tous les cas de plantage, c’est toujours le même CPU qui cause le “soft lockup” ?

Ca fait un peu plus de 15j.
D’où ma question: est ce que ça fait à peu prés le même intervalle entre chaque plantage ?

As tu accés physique à la machine ? As tu moyen d’executer des choses dans une console ?

Toutes les "Call Trace"s sont identique, et on voit que le proc rentre dans un _raw_spin_lock (la boucle du soft lockup) au moment d’un proc_pid_make_inode, il me semble.
Mais je t’avoue que je ne sais pas trop à quoi ça correspond, en tout cas, pas à un probléme de module foireux (pas les mêmes messages a priori).

Ah non, ça, c’est soit le matos, soit le kernel, mais ce n’est pas debian.

Merci de ta réponse!

C’est un serveur.

Oui désolé, j’aurais dû préciser le noyau:

Linux home-server 4.9.0-8-amd64 #1 SMP Debian 4.9.144-3 (2019-02-02) x86_64 GNU/Linux

J’ai à disposition la version précédente (4.9.0-7):

ls /boot
config-4.9.0-7-amd64  config-4.9.0-8-amd64  grub  initrd.img-4.9.0-7-amd64  initrd.img-4.9.0-8-amd64  lost+found  System.map-4.9.0-7-amd64  System.map-4.9.0-8-amd64  vmlinuz-4.9.0-7-amd64  vmlinuz-4.9.0-8-amd64

Je n’ai pas essayé de booter dessus.

Oui, effectivement un peu plus de 15 j. Jusqu’ici je n’avais pas trop creusé, aujourd’hui en regardant les logs kern.log, je constate que j’ai eu le même problème le 17 février et le 4 décembre. Je ne me souviens pas précisément si j’ai dû rebooter de force la machine pour sortir du problème.

Trouvaille intéressante: il s’agissait à chaque fois d’un problème avec htop:

cat kern.log*  grep -i "soft lockup" 
Mar  4 19:53:26 home-server kernel: [1317256.350378] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [htop:13364]
Mar  4 19:53:54 home-server kernel: [1317284.352266] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [htop:13364]
Mar  4 19:54:30 home-server kernel: [1317320.354691] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 23s! [htop:13364]
Mar  4 19:54:58 home-server kernel: [1317348.356576] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 23s! [htop:13364]
Mar  4 19:55:34 home-server kernel: [1317384.359000] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [htop:13364]
Mar  4 19:56:02 home-server kernel: [1317412.360884] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [htop:13364]
Mar  4 19:56:38 home-server kernel: [1317448.363304] NMI watchdog: BUG: soft lockup - CPU#5 stuck for 22s! [htop:13364]
Dec  5 23:04:55 home-server kernel: [181207.357589] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:05:23 home-server kernel: [181235.358135] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:05:59 home-server kernel: [181271.358838] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 23s! [htop:6630]
Dec  5 23:06:27 home-server kernel: [181299.359384] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 23s! [htop:6630]
Dec  5 23:07:03 home-server kernel: [181335.359811] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:07:31 home-server kernel: [181363.360137] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:08:03 home-server kernel: [181395.360520] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:08:31 home-server kernel: [181423.360866] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:09:07 home-server kernel: [181459.361324] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 23s! [htop:6630]
Dec  5 23:09:35 home-server kernel: [181487.361689] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 23s! [htop:6630]
Dec  5 23:10:11 home-server kernel: [181523.362172] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:10:15 home-server kernel: [181527.350227] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:10:39 home-server kernel: [181551.362555] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:10:43 home-server kernel: [181555.350610] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:11:11 home-server kernel: [181583.351001] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:11:15 home-server kernel: [181587.363058] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:11:39 home-server kernel: [181611.351399] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:12:07 home-server kernel: [181615.363457] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:12:08 home-server kernel: [181639.351803] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:12:20 home-server kernel: [181647.363919] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 23s! [htop:6630]
Dec  5 23:12:35 home-server kernel: [181667.352212] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:12:43 home-server kernel: [181675.364330] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:13:03 home-server kernel: [181695.352629] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:13:19 home-server kernel: [181711.364868] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:13:31 home-server kernel: [181723.353049] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 23s! [top:6768]
Dec  5 23:13:47 home-server kernel: [181739.365292] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:13:59 home-server kernel: [181751.353475] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 23s! [top:6768]
Dec  5 23:14:23 home-server kernel: [181775.365843] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:14:27 home-server kernel: [181779.353905] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 23s! [top:6768]
Dec  5 23:14:51 home-server kernel: [181803.366278] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:14:55 home-server kernel: [181807.354341] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 23s! [top:6768]
Dec  5 23:15:23 home-server kernel: [181835.354779] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:15:27 home-server kernel: [181839.366842] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 23s! [htop:6630]
Dec  5 23:15:51 home-server kernel: [181863.355222] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:15:55 home-server kernel: [181867.367286] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 23s! [htop:6630]
Dec  5 23:16:19 home-server kernel: [181891.355669] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:16:27 home-server kernel: [181899.367797] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:16:47 home-server kernel: [181919.356121] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:16:55 home-server kernel: [181927.368248] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 23s! [htop:6630]
Dec  5 23:17:15 home-server kernel: [181947.356573] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Dec  5 23:17:31 home-server kernel: [181963.368834] NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [htop:6630]
Dec  5 23:17:43 home-server kernel: [181975.357030] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [top:6768]
Feb 17 12:25:29 home-server kernel: [701367.120480] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:25:57 home-server kernel: [701395.123473] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:26:29 home-server kernel: [701427.126892] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:26:57 home-server kernel: [701455.129883] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:27:33 home-server kernel: [701491.133728] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:28:01 home-server kernel: [701519.136717] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:28:37 home-server kernel: [701555.140559] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:29:05 home-server kernel: [701583.143547] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:29:41 home-server kernel: [701619.147387] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:30:09 home-server kernel: [701647.150373] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:30:41 home-server kernel: [701679.153785] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:31:09 home-server kernel: [701707.156770] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:31:45 home-server kernel: [701743.160607] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:32:13 home-server kernel: [701771.163591] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:32:49 home-server kernel: [701807.167426] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:33:17 home-server kernel: [701835.170409] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:33:53 home-server kernel: [701871.174243] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:34:21 home-server kernel: [701899.177225] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:34:53 home-server kernel: [701931.180632] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:35:21 home-server kernel: [701959.183613] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:35:57 home-server kernel: [701995.187446] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:36:25 home-server kernel: [702023.190426] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:37:01 home-server kernel: [702059.194258] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:37:29 home-server kernel: [702087.197238] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:38:05 home-server kernel: [702123.201069] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:38:33 home-server kernel: [702151.204048] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:39:05 home-server kernel: [702183.207452] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:39:33 home-server kernel: [702211.210431] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:40:09 home-server kernel: [702247.214261] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:40:37 home-server kernel: [702275.217239] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:41:13 home-server kernel: [702311.221068] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:41:41 home-server kernel: [702339.224046] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:42:17 home-server kernel: [702375.227875] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:42:45 home-server kernel: [702403.230853] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:43:17 home-server kernel: [702435.234256] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:43:45 home-server kernel: [702463.237233] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:44:21 home-server kernel: [702499.241061] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:44:49 home-server kernel: [702527.244038] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:45:25 home-server kernel: [702563.247866] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:45:53 home-server kernel: [702591.250843] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:46:29 home-server kernel: [702627.254670] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:46:57 home-server kernel: [702655.257647] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:47:29 home-server kernel: [702687.261048] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:47:57 home-server kernel: [702715.264025] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:48:33 home-server kernel: [702751.267852] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:49:01 home-server kernel: [702779.270828] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:49:37 home-server kernel: [702815.274655] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:50:05 home-server kernel: [702843.277632] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:50:41 home-server kernel: [702879.281458] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:51:09 home-server kernel: [702907.284434] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:51:41 home-server kernel: [702939.287835] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:52:09 home-server kernel: [702967.290811] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:52:45 home-server kernel: [703003.294637] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:53:13 home-server kernel: [703031.297613] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:53:49 home-server kernel: [703067.301439] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:54:17 home-server kernel: [703095.304428] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:54:53 home-server kernel: [703131.308454] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:55:21 home-server kernel: [703159.311577] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:55:53 home-server kernel: [703191.315139] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:56:21 home-server kernel: [703219.318249] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:56:57 home-server kernel: [703255.322238] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:57:25 home-server kernel: [703283.325335] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:58:01 home-server kernel: [703319.329309] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:58:29 home-server kernel: [703347.332395] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 12:59:05 home-server kernel: [703383.336356] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 12:59:33 home-server kernel: [703411.339431] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:00:05 home-server kernel: [703443.342941] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:00:33 home-server kernel: [703471.346008] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:01:09 home-server kernel: [703507.349947] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:01:37 home-server kernel: [703535.353006] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:02:13 home-server kernel: [703571.356935] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:02:41 home-server kernel: [703599.359988] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:03:17 home-server kernel: [703635.363909] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:03:45 home-server kernel: [703663.366955] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:04:17 home-server kernel: [703695.370434] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:04:45 home-server kernel: [703723.373475] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:05:21 home-server kernel: [703759.377382] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:05:49 home-server kernel: [703787.380419] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:06:25 home-server kernel: [703823.384320] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:06:53 home-server kernel: [703851.387353] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:07:29 home-server kernel: [703887.391249] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:07:57 home-server kernel: [703915.394277] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:08:33 home-server kernel: [703951.398169] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:09:01 home-server kernel: [703979.401194] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:09:33 home-server kernel: [704011.404650] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:10:01 home-server kernel: [704039.407673] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:10:37 home-server kernel: [704075.411557] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:11:05 home-server kernel: [704103.414578] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:11:41 home-server kernel: [704139.418459] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:12:09 home-server kernel: [704167.421477] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:12:45 home-server kernel: [704203.425355] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:13:13 home-server kernel: [704231.428371] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:13:45 home-server kernel: [704263.431817] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:14:13 home-server kernel: [704291.434831] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:14:49 home-server kernel: [704327.438705] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:15:17 home-server kernel: [704355.441717] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:15:53 home-server kernel: [704391.445590] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:16:21 home-server kernel: [704419.448601] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:16:57 home-server kernel: [704455.452471] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:17:25 home-server kernel: [704483.455481] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:17:57 home-server kernel: [704515.458920] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:18:25 home-server kernel: [704543.461929] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:19:01 home-server kernel: [704579.465797] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:19:29 home-server kernel: [704607.468805] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:20:05 home-server kernel: [704643.472672] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:20:33 home-server kernel: [704671.475679] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:21:09 home-server kernel: [704707.479544] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:21:37 home-server kernel: [704735.482551] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:22:09 home-server kernel: [704767.485986] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:22:37 home-server kernel: [704795.488992] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:23:13 home-server kernel: [704831.492856] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:23:41 home-server kernel: [704859.495861] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:24:17 home-server kernel: [704895.499724] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:24:45 home-server kernel: [704923.502729] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:25:21 home-server kernel: [704959.506591] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:25:49 home-server kernel: [704987.509595] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:26:21 home-server kernel: [705019.513028] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:26:49 home-server kernel: [705047.516032] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:27:25 home-server kernel: [705083.519894] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:27:53 home-server kernel: [705111.522897] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:28:29 home-server kernel: [705147.526679] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:28:57 home-server kernel: [705175.529328] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:29:33 home-server kernel: [705211.532759] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:30:01 home-server kernel: [705239.535444] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:30:33 home-server kernel: [705271.538532] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:31:01 home-server kernel: [705299.541248] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:31:37 home-server kernel: [705335.544759] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:32:05 home-server kernel: [705363.547504] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:32:41 home-server kernel: [705399.551050] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:33:09 home-server kernel: [705427.553820] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:33:45 home-server kernel: [705463.557397] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:34:13 home-server kernel: [705491.560189] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:34:45 home-server kernel: [705523.563392] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:35:13 home-server kernel: [705551.566203] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:35:49 home-server kernel: [705587.569829] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:36:17 home-server kernel: [705615.572657] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:36:53 home-server kernel: [705651.576305] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:37:21 home-server kernel: [705679.579149] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:37:57 home-server kernel: [705715.582814] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:38:25 home-server kernel: [705743.585672] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:38:57 home-server kernel: [705775.588945] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:39:25 home-server kernel: [705803.591814] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:40:01 home-server kernel: [705839.595510] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:40:29 home-server kernel: [705867.598390] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:41:05 home-server kernel: [705903.602099] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:41:33 home-server kernel: [705931.604988] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:42:09 home-server kernel: [705967.608708] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:42:37 home-server kernel: [705995.611606] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:43:09 home-server kernel: [706027.614922] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:43:37 home-server kernel: [706055.617826] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:44:13 home-server kernel: [706091.621566] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:44:41 home-server kernel: [706119.624476] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:45:17 home-server kernel: [706155.628223] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:45:45 home-server kernel: [706183.631140] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:46:21 home-server kernel: [706219.634894] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:46:49 home-server kernel: [706247.637816] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:47:21 home-server kernel: [706279.641158] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:47:49 home-server kernel: [706307.644084] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:48:25 home-server kernel: [706343.647849] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:48:53 home-server kernel: [706371.650779] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:49:29 home-server kernel: [706407.654549] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:49:57 home-server kernel: [706435.657482] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:50:33 home-server kernel: [706471.661257] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:51:01 home-server kernel: [706499.664194] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:51:33 home-server kernel: [706531.667551] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:52:01 home-server kernel: [706559.670491] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:52:37 home-server kernel: [706595.674272] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:53:05 home-server kernel: [706623.677214] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:53:41 home-server kernel: [706659.680998] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:54:09 home-server kernel: [706687.683942] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:54:45 home-server kernel: [706723.687728] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:55:13 home-server kernel: [706751.690674] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:55:45 home-server kernel: [706783.694042] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:56:13 home-server kernel: [706811.696989] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:56:49 home-server kernel: [706847.700780] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:57:17 home-server kernel: [706875.703729] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 23s! [htop:13245]
Feb 17 13:57:53 home-server kernel: [706911.707522] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:58:21 home-server kernel: [706939.710472] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:58:57 home-server kernel: [706975.714266] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]
Feb 17 13:59:25 home-server kernel: [707003.717218] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [htop:13245]

Oui, mais le problème c’est que je n’ai pas d’écran connecté et le serveur n’est pas très pratique d’accès…

Non, apparemment 1, 3, 5, et 7 (cf. ci-dessus logs).

Celle sur laquelle tu tournes a été compilée le 02/02/2019, donc j’imagine qu’avant tu tournais sur la version précédente.
Or il y avait déjà des problèmes en décembre, avec ton autre noyau.
Il faudrait essayer avec une version de noyau vraiment différente.
Je tourne personnellement avec un noyau linux-image-4.19.0-0.bpo.2-amd64 venu de stretch-backports:
sa stabilité mérite peut être d’être testée, en attendant de savoir d’où vient l’instabilité sur le 4.9.

Il y a aussi du top, intéressant et cohérent avec la stack trace qui parle de proc_pid_lookup à chaque fois: ce sont deux commandes qui consultent fortement les PID.

Mais j’arrive à mes limites, à part te recommander de tester un autre noyau pour résoudre le problème, je ne vois pas trop quoi faire de plus:
ok, il y a un problème quand *top tente de verrouiller un truc sur les PIDs, mais bon, d’où ça vient…

Merci de ta réponse!

Je vais essayer un noyau plus récent dès que j’ai un moment pour le faire. Le problème, c’est que ça peut être stable pendant des jours avant de planter donc le test risque d’être long :slight_smile:

Bon pour le moment j’ai un autre souci… Qui s’était déjà présenté à deux reprises dernièrement. Le système s’est remonté en read-only suite à une erreur de checksum sur une inode:

[28944.730610] EXT4-fs error (device dm-1): ext4_iget:4527: inode #109320614: comm du: checksum invalid
[28944.773196] Aborting journal on device dm-1-8.
[28944.831727] EXT4-fs (dm-1): Remounting filesystem read-only
[28944.874572] EXT4-fs error (device dm-1) in ext4_do_update_inode:4975: Journal has aborted
[28944.906541] EXT4-fs error (device dm-1) in ext4_da_write_end:3076: IO failure

Est-ce que ce serait le disque qui est en train de mourir? Pourtant il n’est pas très vieux.

En faisant quelques recherches sur le net, j’ai vu des suggestions de supprimer la fonction metadata_csum pour régler ce problème:

tune2fs -O ^metadata_csum /dev/xxx

Qu’en pensez-vous?

Normalement, autre soucis=>autre sujet.

Bon, avant de crier au loup, possibilité de reconstruction du journal:

systemctl isolate rescue.target
remount -n -o remount,ro /dev/dm-1     #j ai un doute sur le vrai nom de la partition, mais bon
fsck -y /dev/dm-1
tune2fs -O ^has_journal /dev/dm-1     #hop on transforme l ext4 en ext3
mount -n -o remount,rw /dev/dm-1
tune2fs -j /dev/dm-1                               #on en refait une ext4
remount -n -o remount,ro /dev/dm-1
fsck -f /dev/dm-1
reboot

MAIS

Bizarrement,on retrouve cette histoire d’inode, qu’il y avait déjà sur le soft lockup.
Serait ce lié ?
que dit df -i ?

Je réalise que je n’étais jamais revenu sur cette discussion (honte à moi).
Bilan: passer au noyau linux-image-4.19.0-0.bpo.2-amd64 comme le suggérait @mattotop a résolu le problème (dont la cause restera inconnue).

Pour le problème du disque, il n’est plus reproduit. De mémoire, la commande df -i ne retournait pas de résultats alarmants.

Merci @mattotop!

On peut passer ce sujet en résolu du coup, mais je ne sais pas comment faire.

à l’aide de la petite coche que tu trouvera en cliquant sur les trois petits points d’une ligne ayant effectivement répondu à ton problème je viens de le faire pour toi.

1 J'aime