u/Far_Trick7562

▲ 2 r/pop_os

AMD GPU Driver and External Monitor causing instability

Switched from Linux Mint to POP about 2 weeks ago. Largely been fine, however today the system drivers decided to break and panic the system. I have narrowed it down i believe to a conflict in the AMD GPU Driver's handling of external monitors which tripped and crashed Cosmic, Wayland and then locked the laptop on the login screen. When i rebooted the machine it did not recur, but now i am getting intermittent graphical corruption when hovering over the taskbar (red/yellow/purple larger blocks only on the taskbar). is there anyone else having this issue? How exactly would i fix this aside from disabling AMD onboard video, which i would rather not do.

journalctl/dmesg output:

[ 146.651343] workqueue: dm_handle_vmin_vmax_update [amdgpu] hogged CPU for >10000us 5 times, consider switching to WQ_UNBOUND

[ 146.838427] ------------[ cut here ]------------

[ 146.838439] WARNING: CPU: 1 PID: 3751 at drivers/gpu/drm/amd/amdgpu/../display/dc/resource/dcn20/dcn20_resource.c:1735 dcn20_find_secondary_pipe+0x177/0x3f0 [amdgpu]

[ 146.839590] Modules linked in: ccm snd_seq_dummy rfcomm snd_hrtimer cmac algif_hash algif_skcipher af_alg qrtr nvidia_uvm(OE) bnep zram 842_decompress 842_compress lz4hc_compress lz4_compress binfmt_misc snd_sof_amd_acp70 snd_sof_amd_acp63 snd_sof_amd_vangogh snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp snd_sof_pci snd_sof_xtensa_dsp snd_sof intel_rapl_msr snd_sof_utils amd_atl snd_hda_codec_alc269 snd_pci_ps snd_hda_scodec_component intel_rapl_common snd_hda_codec_realtek_lib snd_soc_acpi_amd_match snd_hda_codec_generic snd_amd_sdw_acpi soundwire_amd soundwire_generic_allocation snd_hda_codec_nvhdmi iwlmvm snd_hda_codec_atihdmi soundwire_bus snd_hda_codec_hdmi snd_soc_sdca snd_hda_intel edac_mce_amd snd_soc_core snd_hda_codec snd_compress ac97_bus mac80211 snd_hda_core snd_pcm_dmaengine snd_intel_dspcfg snd_rpl_pci_acp6x nvidia_drm(OE) snd_intel_sdw_acpi snd_hwdep snd_acp_pci kvm_amd snd_amd_acpi_mach snd_acp_legacy_common snd_seq_midi nvidia_modeset(OE) libarc4 snd_pci_acp6x snd_seq_midi_event

[ 146.839833] snd_rawmidi btusb snd_pcm btmtk kvm iwlwifi snd_seq snd_pci_acp5x btrtl btbcm ee1004 btintel snd_rn_pci_acp3x snd_seq_device irqbypass nvidia(OE) wmi_bmof rapl asus_nb_wmi nls_iso8859_1 snd_timer i2c_piix4 snd_acp_config snd_soc_acpi cfg80211 bluetooth k10temp ccp i2c_smbus snd snd_pci_acp3x input_leds soundcore asus_wireless joydev mac_hid sch_fq_codel kyber_iosched msr parport_pc ppdev lp parport efi_pstore nfnetlink dmi_sysfs ip_tables x_tables autofs4 dm_crypt raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 linear system76_io(OE) system76_acpi(OE) hid_asus asus_wmi platform_profile sparse_keymap amdgpu hid_multitouch amdxcp drm_panel_backlight_quirks gpu_sched drm_buddy drm_ttm_helper ttm drm_exec ucsi_acpi i2c_algo_bit nvme typec_ucsi drm_suballoc_helper drm_display_helper nvme_core r8169 hid_generic typec ahci cec nvme_keyring polyval_clmulni ghash_clmulni_intel usbhid libahci thunderbolt nvme_auth i2c_hid_acpi rc_core realtek hkdf i2c_hid video hid

[ 146.840191] wmi aesni_intel

[ 146.840208] CPU: 1 UID: 1000 PID: 3751 Comm: surface-HDMI-A- Tainted: G OE 6.18.7-76061807-generic #202601231045~1777506123~24.04~a891078 PREEMPT(voluntary)

[ 146.840222] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE

[ 146.840227] Hardware name: ASUSTeK COMPUTER INC. ROG Zephyrus G15 GA502IV_GA502IV/GA502IV, BIOS GA502IV.301 09/28/2023

[ 146.840234] RIP: 0010:dcn20_find_secondary_pipe+0x177/0x3f0 [amdgpu]

[ 146.841371] Code: bc 38 30 0b 00 00 00 74 ae 83 eb 01 49 81 ef 00 10 00 00 83 fb ff 75 d9 49 8b 86 78 06 00 00 44 8b a0 a8 02 00 00 41 83 ec 01 <0f> 0b 45 85 e4 78 0e 45 89 e6 49 c1 e6 0c 4d 01 ee eb 30 0f 0b 31

[ 146.841381] RSP: 0018:ffffd36250b8b848 EFLAGS: 00010206

[ 146.841393] RAX: ffff8a20df62c800 RBX: 00000000ffffffff RCX: ffff8a218cd812d8

[ 146.841401] RDX: ffff8a20df62c800 RSI: ffff8a218cd802d8 RDI: ffff8a20dec00000

[ 146.841409] RBP: ffffd36250b8b880 R08: ffff8a218cd812d8 R09: 00000000ffffffff

[ 146.841416] R10: ffffd36250b8b950 R11: ffff8a218cd802d8 R12: 0000000000000003

[ 146.841423] R13: ffff8a218cd802d8 R14: ffff8a20dec00000 R15: fffffffffffff000

[ 146.841429] FS: 000072ceb71fd6c0(0000) GS:ffff8a2643ace000(0000) knlGS:0000000000000000

[ 146.841436] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033

[ 146.841443] CR2: 0000788edddedfe0 CR3: 000000011fd01000 CR4: 0000000000350ef0

[ 146.841450] Call Trace:

[ 146.841455] <TASK>

[ 146.841468] dcn21_fast_validate_bw+0x42a/0x730 [amdgpu]

[ 146.842625] dcn21_validate_bandwidth_fp+0xcf/0x350 [amdgpu]

[ 146.843792] ? srso_return_thunk+0x5/0x5f

[ 146.843814] dcn21_validate_bandwidth+0x61/0xb0 [amdgpu]

[ 146.844908] ? dcn21_validate_bandwidth+0x61/0xb0 [amdgpu]

[ 146.845996] dc_validate_global_state.part.0+0x388/0x500 [amdgpu]

[ 146.847025] dc_validate_global_state+0x5c/0x80 [amdgpu]

[ 146.848038] amdgpu_dm_atomic_check+0x17ba/0x1820 [amdgpu]

[ 146.849120] ? drm_file_get_master+0x40/0x90

[ 146.849135] ? srso_return_thunk+0x5/0x5f

[ 146.849159] drm_atomic_check_only+0x1a4/0x400

[ 146.849174] drm_mode_atomic_ioctl+0x73e/0x870

[ 146.849195] ? __pfx_drm_mode_atomic_ioctl+0x10/0x10

[ 146.849205] drm_ioctl_kernel+0xb7/0x110

[ 146.849216] ? srso_return_thunk+0x5/0x5f

[ 146.849229] drm_ioctl+0x2ec/0x5b0

[ 146.849239] ? __pfx_drm_mode_atomic_ioctl+0x10/0x10

[ 146.849264] amdgpu_drm_ioctl+0x4b/0xa0 [amdgpu]

[ 146.850082] __x64_sys_ioctl+0xa5/0x100

[ 146.850098] x64_sys_call+0x1144/0x26a0

[ 146.850108] do_syscall_64+0x80/0x970

[ 146.850117] ? arch_exit_to_user_mode_prepare.isra.0+0xd/0xe0

[ 146.850126] ? srso_return_thunk+0x5/0x5f

[ 146.850136] ? do_syscall_64+0xb6/0x970

[ 146.850143] ? srso_return_thunk+0x5/0x5f

[ 146.850151] ? srso_return_thunk+0x5/0x5f

[ 146.850160] ? arch_exit_to_user_mode_prepare.isra.0+0xd/0xe0

[ 146.850168] ? srso_return_thunk+0x5/0x5f

[ 146.850177] ? do_syscall_64+0xb6/0x970

[ 146.850184] ? srso_return_thunk+0x5/0x5f

[ 146.850195] entry_SYSCALL_64_after_hwframe+0x76/0x7e

[ 146.850204] RIP: 0033:0x6374c0131420

[ 146.850214] Code: 89 45 a4 4c 89 45 a8 4c 89 6d b0 4c 89 65 b8 4c 89 75 c0 0f 57 c0 0f 11 45 c8 48 8d 55 a0 b8 10 00 00 00 be bc 64 38 c0 0f 05 <49> 89 c7 48 83 3b 00 74 09 4c 89 c7 ff 15 2e f5 c6 00 48 83 7b 18

[ 146.850221] RSP: 002b:000072ceb71f6a50 EFLAGS: 00000297 ORIG_RAX: 0000000000000010

[ 146.850232] RAX: ffffffffffffffda RBX: 000072ceb71f6cf0 RCX: 00006374c0131420

[ 146.850238] RDX: 000072ceb71f6a50 RSI: 00000000c03864bc RDI: 000000000000003e

[ 146.850244] RBP: 000072ceb71f6ab0 R08: 000072ce940ef450 R09: 0000000000000001

[ 146.850250] R10: 000072ceb71f69c0 R11: 0000000000000297 R12: 000072ce941f0cb0

[ 146.850255] R13: 000072ce941d07f0 R14: 000072ce94153dd0 R15: 00006374dbf1aa20

[ 146.850273] </TASK>

[ 146.850279] ---[ end trace 0000000000000000 ]---

Specs:
ASUS ROG Zephyrus G15 GA502IV Laptop
CPU: AMD Ryzen 7 4800HS with Radeon Graphics
Video: NVidia RTX 2060 MaxQ
Ram: 24GB DDR4 3200/Swap 19.0 GB

Output of uname -a:
Linux pop-os 6.18.7-76061807-generic #202601231045~1777506123~24.04~a891078 SMP PREEMPT_DYNAMIC Thu A x86_64 x86_64 x86_64 GNU/Linux

reddit.com
u/Far_Trick7562 — 8 days ago