Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
[SOLVED] Kernel Oops / Can only boot one kernel
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware
View previous topic :: View next topic  
Author Message
fidel
Guru
Guru


Joined: 16 Jul 2004
Posts: 407
Location: CH

PostPosted: Tue Jun 27, 2017 7:24 am    Post subject: [SOLVED] Kernel Oops / Can only boot one kernel Reply with quote

Lately I get lots of kernel oops that result in a stalled system. If it happens, I have to reboot the machine. Something very weird is that on this machine I haven't managed to boot any other kernel than 4.9.6-gentoo-r1. I wanted to update to 4.9.16 but after / when grub loads the kernel, the system immediately stalls. This also happened with the version 4.11.4, for both kernels I used make oldconfig with the config of the currently running (but oopsing) kernel 4.9.6-gentoo-r1.
The thing is that those problems started at some time - before I never had such issues, I could always update the system and I am using more or less the same system on a different machine, without problems.

I can find the following output in the journal (using systemd):
Code:
Jun 27 08:46:10 daves-gentoo kernel: BUG: unable to handle kernel paging request at ffffffffffffffff
Jun 27 08:46:10 daves-gentoo kernel: IP: [<ffffffff8113e01d>] kmem_cache_alloc+0x6d/0x120
Jun 27 08:46:10 daves-gentoo kernel: PGD 2c09067
Jun 27 08:46:10 daves-gentoo kernel: PUD 2c0b067
Jun 27 08:46:10 daves-gentoo kernel: PMD 0
Jun 27 08:46:10 daves-gentoo kernel:
Jun 27 08:46:10 daves-gentoo kernel: Oops: 0000 [#1] PREEMPT SMP
Jun 27 08:46:10 daves-gentoo kernel: Modules linked in: tun veth bridge stp llc arc4 ecb md4 cifs fuse ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_log_ipv4 nf_log_common xt_LOG xt_limit xt_multiport xt_recent peak_usb can_dev snd_hda_codec_hdmi snd_hda_c
Jun 27 08:46:10 daves-gentoo kernel: CPU: 6 PID: 1200 Comm: xfce4-appfinder Not tainted 4.9.6-gentoo-r1 #1
Jun 27 08:46:10 daves-gentoo kernel: Hardware name: ASUS All Series/Z87-A, BIOS 2103 08/15/2014
Jun 27 08:46:10 daves-gentoo kernel: task: ffff8803fb5fd780 task.stack: ffffc90005e10000
Jun 27 08:46:10 daves-gentoo kernel: RIP: 0010:[<ffffffff8113e01d>]  [<ffffffff8113e01d>] kmem_cache_alloc+0x6d/0x120
Jun 27 08:46:10 daves-gentoo kernel: RSP: 0018:ffffc90005e13d30  EFLAGS: 00010282
Jun 27 08:46:10 daves-gentoo kernel: RAX: 0000000000000000 RBX: ffff88040f0e4730 RCX: 00000000006722c6
Jun 27 08:46:10 daves-gentoo kernel: RDX: 0000000000672286 RSI: 0000000002000200 RDI: ffff880419fcb480
Jun 27 08:46:10 daves-gentoo kernel: RBP: ffffffffffffffff R08: 000000000001a4a0 R09: ffff8803e77217e8
Jun 27 08:46:10 daves-gentoo kernel: R10: 0000000000000045 R11: ffff8804000070eb R12: ffff88041f006c80
Jun 27 08:46:10 daves-gentoo kernel: R13: 0000000002000200 R14: ffffffff81124b31 R15: 00007f941bea0c50
Jun 27 08:46:10 daves-gentoo kernel: FS:  00007f941bea0980(0000) GS:ffff88042fb80000(0000) knlGS:0000000000000000
Jun 27 08:46:10 daves-gentoo kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 27 08:46:10 daves-gentoo kernel: CR2: ffffffffffffffff CR3: 00000003e7734000 CR4: 00000000001426e0
Jun 27 08:46:10 daves-gentoo kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jun 27 08:46:10 daves-gentoo kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Jun 27 08:46:10 daves-gentoo kernel: Stack:
Jun 27 08:46:10 daves-gentoo kernel:  ffff8803fb5fd780 ffff88040f0e4730 0000000000000000 ffff8803d3767300
Jun 27 08:46:10 daves-gentoo kernel:  ffff88040f0e4730 ffff88040f0e47a8 ffffffff81124b31 ffff8803d83b14c8
Jun 27 08:46:10 daves-gentoo kernel:  ffff8803d83b1450 ffff8803fb5fd780 ffff88040f0e4730 ffff8803d83b1450
Jun 27 08:46:10 daves-gentoo kernel: Call Trace:
Jun 27 08:46:10 daves-gentoo kernel:  [<ffffffff81124b31>] ? anon_vma_clone+0x61/0x1f0
Jun 27 08:46:10 daves-gentoo kernel:  [<ffffffff81124ce8>] ? anon_vma_fork+0x28/0x130
Jun 27 08:46:10 daves-gentoo kernel:  [<ffffffff8104c590>] ? copy_process.part.39+0xc30/0x1b60
Jun 27 08:46:10 daves-gentoo kernel:  [<ffffffff8104d681>] ? _do_fork+0xd1/0x320
Jun 27 08:46:10 daves-gentoo kernel:  [<ffffffff8115e5f2>] ? __do_pipe_flags+0x52/0xd0
Jun 27 08:46:10 daves-gentoo kernel:  [<ffffffff8100144b>] ? do_syscall_64+0x4b/0xb0
Jun 27 08:46:10 daves-gentoo kernel:  [<ffffffff816f492b>] ? entry_SYSCALL64_slow_path+0x25/0x25
Jun 27 08:46:10 daves-gentoo kernel: Code: 03 05 50 c1 ec 7e 49 8b 48 08 48 39 ca 75 e7 49 8b 28 48 85 ed 74 54 49 83 78 10 00 74 4d 49 63 44 24 20 48 8d 4a 40 4d 8b 04 24 <48> 8b 5c 05 00 48 89 e8 65 49 0f c7 08 0f 94 c0 84 c0 74 b3 49
Jun 27 08:46:10 daves-gentoo kernel: RIP  [<ffffffff8113e01d>] kmem_cache_alloc+0x6d/0x120
Jun 27 08:46:10 daves-gentoo kernel:  RSP <ffffc90005e13d30>
Jun 27 08:46:10 daves-gentoo kernel: CR2: ffffffffffffffff
Jun 27 08:46:10 daves-gentoo kernel: ---[ end trace 89268a446c99e34d ]---
Jun 27 08:46:10 daves-gentoo kernel: BUG: unable to handle kernel paging request at ffffffffffffffff
Jun 27 08:46:10 daves-gentoo kernel: IP: [<ffffffff8113e01d>] kmem_cache_alloc+0x6d/0x120
Jun 27 08:46:10 daves-gentoo kernel: PGD 2c09067
Jun 27 08:46:10 daves-gentoo kernel: PUD 2c0b067
Jun 27 08:46:10 daves-gentoo kernel: PMD 0
Jun 27 08:46:10 daves-gentoo kernel:
Jun 27 08:46:10 daves-gentoo kernel: Oops: 0000 [#2] PREEMPT SMP
Jun 27 08:46:10 daves-gentoo kernel: Modules linked in: tun veth bridge stp llc arc4 ecb md4 cifs fuse ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_log_ipv4 nf_log_common xt_LOG xt_limit xt_multiport xt_recent peak_usb can_dev snd_hda_codec_hdmi snd_hda_c
Jun 27 08:46:10 daves-gentoo kernel: CPU: 6 PID: 23922 Comm: chrome Tainted: G      D         4.9.6-gentoo-r1 #1
Jun 27 08:46:10 daves-gentoo kernel: Hardware name: ASUS All Series/Z87-A, BIOS 2103 08/15/2014
Jun 27 08:46:10 daves-gentoo kernel: task: ffff8803f8fd9900 task.stack: ffffc900181b4000
Jun 27 08:46:10 daves-gentoo kernel: RIP: 0010:[<ffffffff8113e01d>]  [<ffffffff8113e01d>] kmem_cache_alloc+0x6d/0x120
Jun 27 08:46:10 daves-gentoo kernel: RSP: 0000:ffffc900181b7e00  EFLAGS: 00010282
Jun 27 08:46:10 daves-gentoo kernel: RAX: 0000000000000000 RBX: ffff88040f0e4ac8 RCX: 00000000006722c6
Jun 27 08:46:10 daves-gentoo kernel: RDX: 0000000000672286 RSI: 00000000024000c0 RDI: ffff880419fc12c0
Jun 27 08:46:10 daves-gentoo kernel: RBP: ffffffffffffffff R08: 000000000001a4a0 R09: 000000000001a3a8
Jun 27 08:46:10 daves-gentoo kernel: R10: 0000000000000002 R11: ffff88042fdf7000 R12: ffff88041f006c80
Jun 27 08:46:10 daves-gentoo kernel: R13: 00000000024000c0 R14: ffffffff811247dd R15: ffff880419fc12c0
Jun 27 08:46:10 daves-gentoo kernel: FS:  00007f632ba1ca40(0000) GS:ffff88042fb80000(0000) knlGS:0000000000000000
Jun 27 08:46:10 daves-gentoo kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 27 08:46:10 daves-gentoo kernel: CR2: ffffffffffffffff CR3: 00000004045c2000 CR4: 00000000001426e0
Jun 27 08:46:10 daves-gentoo kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jun 27 08:46:10 daves-gentoo kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Jun 27 08:46:10 daves-gentoo kernel: Stack:
Jun 27 08:46:10 daves-gentoo kernel:  ffff88040f0e4ac8 ffff88040f0e4ac8 ffff88040f0e4ac8 ffff88040f0e4ac8
Jun 27 08:46:10 daves-gentoo kernel:  ffff880419fc12c0 ffffc900181b7e88 ffffffff811247dd 0000000000000002
Jun 27 08:46:10 daves-gentoo kernel:  ffff88040f0e4ac8 ffff88040f0e4ac8 0000000000000040 ffffc900181b7e88
Jun 27 08:46:10 daves-gentoo kernel: Call Trace:
Jun 27 08:46:10 daves-gentoo kernel:  [<ffffffff811247dd>] ? anon_vma_prepare+0x2d/0x170
Jun 27 08:46:10 daves-gentoo kernel:  [<ffffffff8111a3e8>] ? handle_mm_fault+0xee8/0x1150
Jun 27 08:46:10 daves-gentoo kernel:  [<ffffffff8103fbf7>] ? __do_page_fault+0x1f7/0x470
Jun 27 08:46:10 daves-gentoo kernel:  [<ffffffff816f5f28>] ? page_fault+0x28/0x30
Jun 27 08:46:10 daves-gentoo kernel: Code: 03 05 50 c1 ec 7e 49 8b 48 08 48 39 ca 75 e7 49 8b 28 48 85 ed 74 54 49 83 78 10 00 74 4d 49 63 44 24 20 48 8d 4a 40 4d 8b 04 24 <48> 8b 5c 05 00 48 89 e8 65 49 0f c7 08 0f 94 c0 84 c0 74 b3 49
Jun 27 08:46:10 daves-gentoo kernel: RIP  [<ffffffff8113e01d>] kmem_cache_alloc+0x6d/0x120
Jun 27 08:46:10 daves-gentoo kernel:  RSP <ffffc900181b7e00>
Jun 27 08:46:10 daves-gentoo kernel: CR2: ffffffffffffffff
Jun 27 08:46:10 daves-gentoo kernel: ---[ end trace 89268a446c99e34e ]---
Jun 27 08:46:11 daves-gentoo kernel: BUG: unable to handle kernel paging request at ffffffffffffffff
Jun 27 08:46:11 daves-gentoo kernel: IP: [<ffffffff8113e01d>] kmem_cache_alloc+0x6d/0x120


Its not a Skylake or Kaby lake CPU:

Code:
processor   : 7
vendor_id   : GenuineIntel
cpu family   : 6
model      : 60
model name   : Intel(R) Core(TM) i7-4770 CPU @ 3.40GHz
stepping   : 3
microcode   : 0x19
cpu MHz      : 3399.169
cache size   : 8192 KB
physical id   : 0
siblings   : 8
core id      : 3
cpu cores   : 4
apicid      : 7
initial apicid   : 7
fpu      : yes
fpu_exception   : yes
cpuid level   : 13
wp      : yes
flags      : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm epb tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm xsaveopt dtherm ida arat pln pts
bugs      :
bogomips   : 6800.77
clflush size   : 64
cache_alignment   : 64
address sizes   : 39 bits physical, 48 bits virtual
power management:


Has anyone an idea where I should look at? Thanks in advance for any hints!


Last edited by fidel on Thu Jun 29, 2017 10:14 am; edited 1 time in total
Back to top
View user's profile Send private message
ian.au
Guru
Guru


Joined: 07 Apr 2011
Posts: 434
Location: Australia

PostPosted: Tue Jun 27, 2017 10:36 pm    Post subject: Reply with quote

Quote:
kernel: RIP: 0010:[<ffffffff8113e01d>] [<ffffffff8113e01d>] kmem_cache_alloc+0x6d/0x120

hi fidel,
I'm no kernel hacker but notice that all of those oops' occur in the same function: kmem_cache_alloc
Have you tested your ram? This could be hardware related, as it seems to have started suddenly and not after a significant update / hardware change.
If the ram checks out, you're in for a tough time finding which application / flaky driver / sputtering hardware is causing the behaviour, it looks like graphics paging of some sort fails, but that could be a symptom of the kmem_cache issue, or caused by firmware missing if you're booting an initramfs.
Only other thought is maybe something is expecting swap space before it is available (or if you don't have any swap allocated).
Hopefully something there to start on anyway,

Cheers
Back to top
View user's profile Send private message
fidel
Guru
Guru


Joined: 16 Jul 2004
Posts: 407
Location: CH

PostPosted: Wed Jun 28, 2017 1:11 pm    Post subject: Reply with quote

Thanks a lot! These are indeed useful hints! I will start with checking the ram.
Back to top
View user's profile Send private message
fidel
Guru
Guru


Joined: 16 Jul 2004
Posts: 407
Location: CH

PostPosted: Thu Jun 29, 2017 10:14 am    Post subject: Reply with quote

Thanks again, it was the RAM, memtest86 shows lots of errors.
Back to top
View user's profile Send private message
ian.au
Guru
Guru


Joined: 07 Apr 2011
Posts: 434
Location: Australia

PostPosted: Thu Jun 29, 2017 12:49 pm    Post subject: Reply with quote

Thanks for the feedback, fidel - I'm glad it helped.

Cheers
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum