Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
Only one core using kernel 4.9
View unanswered posts
View posts from last 24 hours

Goto page Previous  1, 2, 3, 4  
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware
View previous topic :: View next topic  
Author Message
axl
Guru
Guru


Joined: 11 Oct 2002
Posts: 537
Location: Romania

PostPosted: Tue Feb 21, 2017 9:18 pm    Post subject: Reply with quote

I've experienced the same thing. With a HP Proliant DL380 G7.

https://bugzilla.kernel.org/show_bug.cgi?id=194501

this is my bug report. I put the problem in the acpi code. my dmesg shows:

Code:
[    0.000000] ACPI: Local APIC address 0xfee00000
[    0.000000] ------------[ cut here ]------------
[    0.000000] WARNING: CPU: 0 PID: 0 at arch/x86/kernel/apic/apic.c:2065 __generic_processor_info+0x297/0x360
[    0.000000] Only 7 processors supported.Processor 8/0x2 and the rest are ignored.
[    0.000000] Modules linked in:
[    0.000000] CPU: 0 PID: 0 Comm: swapper Not tainted 4.9.8-gentoo #2
[    0.000000] Hardware name: HP ProLiant DL380 G7, BIOS P67 07/02/2013
[    0.000000]  0000000000000000 ffffffff8120c34d ffffffff81803dd0 0000000000000000
[    0.000000]  ffffffff810415d4 0000000000000008 ffffffff81803e20 0000000000000000
[    0.000000]  0000000000000008 0000000000000015 0000000000000001 ffffffff8104163a
[    0.000000] Call Trace:
[    0.000000]  [<ffffffff8120c34d>] ? dump_stack+0x46/0x59
[    0.000000]  [<ffffffff810415d4>] ? __warn+0xb4/0xd0
[    0.000000]  [<ffffffff8104163a>] ? warn_slowpath_fmt+0x4a/0x50
[    0.000000]  [<ffffffff8102bef7>] ? __generic_processor_info+0x297/0x360
[    0.000000]  [<ffffffff8103de81>] ? acpi_register_lapic+0x3d/0x6c
[    0.000000]  [<ffffffff818a44d1>] ? acpi_parse_lapic+0x3e/0x43
[    0.000000]  [<ffffffff818bbcb8>] ? acpi_parse_entries_array+0xf4/0x152
[    0.000000]  [<ffffffff818bbe43>] ? acpi_table_parse_entries_array+0xa8/0xc6
[    0.000000]  [<ffffffff818a4e0c>] ? acpi_boot_init+0xde/0x494
[    0.000000]  [<ffffffff818a4493>] ? acpi_parse_x2apic+0x6c/0x6c
[    0.000000]  [<ffffffff818a4427>] ? acpi_parse_ioapic+0x74/0x74
[    0.000000]  [<ffffffff8189e40c>] ? setup_arch+0x8b2/0x924
[    0.000000]  [<ffffffff81898aa0>] ? start_kernel+0x52/0x3af
[    0.000000] ---[ end trace 0000000000000000 ]---


I reverted to kernel 4.4 series to fix the problem. am glad I'm not the only one that this happened to.
Back to top
View user's profile Send private message
donmartio
Apprentice
Apprentice


Joined: 11 Dec 2004
Posts: 233

PostPosted: Wed Mar 01, 2017 7:21 am    Post subject: Reply with quote

Hey axl,

did you try the patch? The error you get seems slightly different and you have an intel cpu. So there may be another problem.
If the patch works for you too, i would post that to the kernel thread to get a little more weight on this matter.

kind regards
DonMartio
_________________
Always code as if the person who ends up maintaining your code will be a violent psychopath who knows where you live.
Back to top
View user's profile Send private message
axl
Guru
Guru


Joined: 11 Oct 2002
Posts: 537
Location: Romania

PostPosted: Fri Mar 03, 2017 12:30 am    Post subject: Reply with quote

donmartio wrote:
Hey axl,

did you try the patch? The error you get seems slightly different and you have an intel cpu. So there may be another problem.
If the patch works for you too, i would post that to the kernel thread to get a little more weight on this matter.

kind regards
DonMartio


Hey

No. Unfortunately I don't have remote access to that machine. That was what made the situation so difficult. I was asked to put it in a configuration that works and leave. AS SOON AS POSSIBLE. :)

It's not a setting where you can just "try" patches. I mentioned that I solved it at the time with a kernel from the previous long term line, the 4.4 series.

I know I scheduled to see that machine again in april, I will try the patch then, if 4.10/4.4 fails. Or if i have extra time.

I wasn't able to replicate the bug on any of my machines. The weird thing was (for me) that it was a proliant server. you would expect more bug reports.

When you say "may be other problem", did you check dmesg if you have that trace? after loapic?

WARNING: CPU: 0 PID: 0 at arch/x86/kernel/apic/apic.c:2065 __generic_processor_info+0x297/0x360

this. this is where it goes to 1 core. do you have the same error?

or this:

Only 7 processors supported.Processor 8/0x2 and the rest are ignored.

which is stupid because 7 it ignores. it has 8, it says only 7 are supported but in fact it works with 1. out of 8. which means 7 are disabled. so clearly an error right there.

In april, I'll be forced to interact again with that machine. I'll post updates then.
Back to top
View user's profile Send private message
donmartio
Apprentice
Apprentice


Joined: 11 Dec 2004
Posts: 233

PostPosted: Sat Mar 04, 2017 11:05 am    Post subject: Reply with quote

When i get this right, you problem seems different but related.
I did'nt get the warning you get. Just one core to work with.

My Problem was , that the detection of the cpu's relies on the order of the found logical cpus.
As it seems, it finds the fpus first, marks them as disabled and when it comes to the real cores it has already found
4 disabled cpus. This leads to one 'physical' cpu with 4 real cores and 4 disabled virtual cores which leaves just one.

You get this warning when it tries to allocate a logical cpuid and already have found 8 logical cpu ids.
My patch jumps in just before it gets to this allocation.

The whole process of allocation logical cpuids and mapping them to the real cores seems pretty error prone to me.
I don't now how those virtual processors are detected, but relying on the order is always risky.
What makes me wonder is, that this kind of error is apparently rare.
_________________
Always code as if the person who ends up maintaining your code will be a violent psychopath who knows where you live.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware All times are GMT
Goto page Previous  1, 2, 3, 4
Page 4 of 4

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum