Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
[SOLVED] Kernel 5.3.11 hangs intel GPU
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware
View previous topic :: View next topic  
Author Message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4498
Location: Somewere around the world

PostPosted: Sat Nov 16, 2019 7:59 am    Post subject: [SOLVED] Kernel 5.3.11 hangs intel GPU Reply with quote

I recently upgraded from kernel 5.3.8 to 5.3.11, so it's just a minor upgrade.

Curiously, when I launch libreoffice, the system freezes for few seconds and I get this in dmesg output:
Code:
i915 0000:00:02.0: Resetting rcs0 for hang on rcs0

This is 100% reproducible

Switching back to 5.3.8 the problem disappears

Here my system specs:
Code:
heavensdoor ~ # inxi -Fzm
System:    Host: heavensdoor Kernel: 5.3.8-gentoo x86_64 bits: 64 Desktop: Xfce 4.14.1 Distro: Gentoo Base System release 2.6
Machine:   Type: Laptop System: Notebook product: P9XXRC v: N/A serial: N/A
           Mobo: Notebook model: P9XXRC serial: N/A UEFI: INSYDE v: 1.07.04 date: 05/03/2019
Battery:   ID-1: BAT0 charge: 59.2 Wh condition: 59.2/56.2 Wh (105%)
Memory:    RAM: total: 31.08 GiB used: 803.1 MiB (2.5%)
           Array-1: capacity: 32 GiB slots: 2 EC: None
           Device-1: ChannelA-DIMM0 size: 16 GiB speed: 3000 MT/s
           Device-2: ChannelB-DIMM0 size: 16 GiB speed: 3000 MT/s
CPU:       Topology: 6-Core model: Intel Core i7-9750H bits: 64 type: MT MCP L2 cache: 12.0 MiB
           Speed: 800 MHz min/max: 800/4500 MHz Core speeds (MHz): 1: 800 2: 800 3: 801 4: 800 5: 801 6: 800 7: 800 8: 800
           9: 800 10: 800 11: 801 12: 801
Graphics:  Device-1: Intel UHD Graphics 630 driver: i915 v: kernel
           Device-2: NVIDIA TU116M [GeForce GTX 1660 Mobile] driver: N/A
           Display: server: X.Org 1.20.5 driver: intel unloaded: modesetting,vesa resolution: 1920x1080~144Hz
           OpenGL: renderer: Mesa DRI Intel UHD Graphics 630 (Coffeelake 3x8 GT2) v: 4.5 Mesa 19.1.8
Audio:     Device-1: Intel Cannon Lake PCH cAVS driver: snd_hda_intel
           Device-2: NVIDIA driver: snd_hda_intel
           Sound Server: ALSA v: k5.3.8-gentoo
Network:   Device-1: Qualcomm Atheros AR9462 Wireless Network Adapter driver: ath9k
           IF: wlp8s0 state: up mac: <filter>
           Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet driver: r8169
           IF: enp9s0 state: down mac: <filter>
           IF-ID-1: sit0 state: down mac: <filter>
           IF-ID-2: tunl0 state: down mac: <filter>
Drives:    Local Storage: total: 1.38 TiB used: 173.88 GiB (12.3%)
           ID-1: /dev/nvme0n1 vendor: Samsung model: SSD 970 PRO 512GB size: 476.94 GiB
           ID-2: /dev/sda vendor: Samsung model: SSD 860 EVO 1TB size: 931.51 GiB
Partition: ID-1: / size: 100.00 GiB used: 7.93 GiB (7.9%) fs: btrfs dev: /dev/dm-0
           ID-2: /home size: 276.45 GiB used: 4.57 GiB (1.7%) fs: btrfs dev: /dev/dm-1
Sensors:   System Temperatures: cpu: 58.0 C mobo: N/A
           Fan Speeds (RPM): N/A
Info:      Processes: 296 Uptime: 44m Shell: bash inxi: 3.0.36


Anyone has a clue about whant happened in 5.3.11?

[edit]
The problem is not present also in 5.3.10, so it must have appeared between these two versions...
_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne


Last edited by Cazzantonio on Tue Nov 26, 2019 5:30 pm; edited 1 time in total
Back to top
View user's profile Send private message
Perfect Gentleman
l33t
l33t


Joined: 18 May 2014
Posts: 921

PostPosted: Sat Nov 16, 2019 8:40 am    Post subject: Reply with quote

Haswell, xf86-video-intel. No such problem.
Back to top
View user's profile Send private message
fturco
l33t
l33t


Joined: 08 Dec 2010
Posts: 898
Location: Italy

PostPosted: Sat Nov 16, 2019 5:43 pm    Post subject: Reply with quote

@Cazzantonio: you may try to use git bisect in order to find the first bad commit. See this Gentoo wiki article for details.
Back to top
View user's profile Send private message
Hu
Moderator
Moderator


Joined: 06 Mar 2007
Posts: 14957

PostPosted: Sat Nov 16, 2019 6:08 pm    Post subject: Reply with quote

Can you bisect between v5.3.10 and v5.3.11 to find the specific offending patch? Based on the error message, it's probably a patch to the i915 graphics code.
Back to top
View user's profile Send private message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4498
Location: Somewere around the world

PostPosted: Sun Nov 17, 2019 10:03 pm    Post subject: Reply with quote

Thanks for the tip. It will take a while but I'll try to find the incriminated patch.
_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne
Back to top
View user's profile Send private message
Perfect Gentleman
l33t
l33t


Joined: 18 May 2014
Posts: 921

PostPosted: Mon Nov 18, 2019 2:55 am    Post subject: Reply with quote

Maybe this bug - https://www.intel.com/content/www/us/en/security-center/advisory/intel-sa-00260.html
Back to top
View user's profile Send private message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4498
Location: Somewere around the world

PostPosted: Wed Nov 20, 2019 3:16 pm    Post subject: Reply with quote

Found!

Code:
77fc9100fc5768ca01ca2dd2cc5a515a4723a58a is the first bad commit
commit 77fc9100fc5768ca01ca2dd2cc5a515a4723a58a
Author: Jon Bloomfield <jon.bloomfield@intel.com>
Date:   Thu Sep 27 10:23:17 2018 -0700

    drm/i915/cmdparser: Use explicit goto for error paths
   
    commit 0546a29cd884fb8184731c79ab008927ca8859d0 upstream.
   
    In the next patch we will be adding a second valid
    termination condition which will require a small
    amount of refactoring to share logic with the BB_END
    case.
   
    Refactor all error conditions to jump to a dedicated
    exit path, with 'break' reserved only for a successful
    parse.
   
    Cc: Tony Luck <tony.luck@intel.com>
    Cc: Dave Airlie <airlied@redhat.com>
    Cc: Takashi Iwai <tiwai@suse.de>
    Cc: Tyler Hicks <tyhicks@canonical.com>
    Signed-off-by: Jon Bloomfield <jon.bloomfield@intel.com>
    Reviewed-by: Chris Wilson <chris.p.wilson@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

 drivers/gpu/drm/i915/i915_cmd_parser.c | 25 +++++++++++++------------
 1 file changed, 13 insertions(+), 12 deletions(-)


What do I have to do now? I signaled this both on gentoo and linux bugzilla...
_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne


Last edited by Cazzantonio on Wed Nov 20, 2019 3:30 pm; edited 1 time in total
Back to top
View user's profile Send private message
fedeliallalinea
Bodhisattva
Bodhisattva


Joined: 08 Mar 2003
Posts: 23504
Location: here

PostPosted: Wed Nov 20, 2019 3:29 pm    Post subject: Reply with quote

Cazzantonio you aren't in Italian forum :P

it wrote:
Che faccio? Lo segnalo sul bugzilla di gentoo o di linux?

en wrote:
What I do? I report this on the gentoo or linux bugzilla?

In think that this problem is already reported in kernel bugzilla, in my opinion you can open new gentoo bug because can be useful to someone.
_________________
Questions are guaranteed in life; Answers aren't.
Back to top
View user's profile Send private message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4498
Location: Somewere around the world

PostPosted: Wed Nov 20, 2019 3:32 pm    Post subject: Reply with quote

fedeliallalinea wrote:
Cazzantonio you aren't in Italian forum :P


Yes sorry I realized a minute later :D

I did already sent the bug to the kernel bugzilla, now I updated with the git bisect log.
I also opened a bug on gentoo bugzilla.

In the meantime I have to stay clear of 5.3.11 and future updates...
_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne
Back to top
View user's profile Send private message
fturco
l33t
l33t


Joined: 08 Dec 2010
Posts: 898
Location: Italy

PostPosted: Wed Nov 20, 2019 4:10 pm    Post subject: Reply with quote

@Cazzantonio: please post the URLs of those two bug reports you filed, so others with your same problem can benefit, too.
Back to top
View user's profile Send private message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4498
Location: Somewere around the world

PostPosted: Thu Nov 21, 2019 7:25 pm    Post subject: Reply with quote

https://bugzilla.kernel.org/show_bug.cgi?id=205545

https://bugs.gentoo.org/700782

The problem is still present in 5.3.12
_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne
Back to top
View user's profile Send private message
tholin
Apprentice
Apprentice


Joined: 04 Oct 2008
Posts: 181

PostPosted: Thu Nov 21, 2019 7:48 pm    Post subject: Reply with quote

Cazzantonio wrote:
What do I have to do now? I signaled this both on gentoo and linux bugzilla...

You need to send an email jon.bloomfield@intel.com and CC intel-gfx@lists.freedesktop.org and describe your problem. Kernel developers almost never check the kernel bugzilla.
Back to top
View user's profile Send private message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4498
Location: Somewere around the world

PostPosted: Tue Nov 26, 2019 5:29 pm    Post subject: Reply with quote

The problem is still present in 5.3.13 but disappeared in 5.4.0, so I guess someone noticed and committed a patch.
I guess this closes the case :-)
_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne
Back to top
View user's profile Send private message
mbar
Veteran
Veteran


Joined: 19 Jan 2005
Posts: 1983
Location: Poland

PostPosted: Wed Nov 27, 2019 11:06 am    Post subject: Reply with quote

5.4.0 still doesn't work for me (Skylake laptop), causes 100% power on Intel GPU.

I also found this: https://bugs.freedesktop.org/show_bug.cgi?id=111837 (please see last comment), but no solution.
How do I revert that commit?
Back to top
View user's profile Send private message
Cazzantonio
Bodhisattva
Bodhisattva


Joined: 20 Mar 2004
Posts: 4498
Location: Somewere around the world

PostPosted: Wed Nov 27, 2019 9:00 pm    Post subject: Reply with quote

I discovered that 5.4.0 now randomly hangs the whole notebook without any warning. No logs, just freeze.

I got a bit sick of debugging this mess, so I think I'll stick with 5.3.10 for a while...
_________________
Any man's death diminishes me, because I am involved in mankind, and therefore never send to know for whom the bell tolls; it tolls for thee
-John Donne
Back to top
View user's profile Send private message
mbar
Veteran
Veteran


Joined: 19 Jan 2005
Posts: 1983
Location: Poland

PostPosted: Fri Nov 29, 2019 6:02 am    Post subject: Reply with quote

https://bugs.freedesktop.org/show_bug.cgi?id=112315
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum