Skip to content
  1. Dec 29, 2021
  2. Dec 28, 2021
  3. Dec 23, 2021
  4. Dec 18, 2021
    • Alex Deucher's avatar
      drm/amdgpu: add support for IP discovery gc_info table v2 · 5e713c6a
      Alex Deucher authored
      Used on gfx9 based systems. Fixes incorrect CU counts reported
      in the kernel log.
      
      Bug: https://gitlab.freedesktop.org/drm/amd/-/issues/1833
      
      
      Reviewed-by: default avatarHawking Zhang <Hawking.Zhang@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      Cc: stable@vger.kernel.org
      5e713c6a
    • chen gong's avatar
      drm/amdgpu: When the VCN(1.0) block is suspended, powergating is explicitly enabled · b7865173
      chen gong authored
      Play a video on the raven (or PCO, raven2) platform, and then do the S3
      test. When resume, the following error will be reported:
      
      amdgpu 0000:02:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring
      vcn_dec test failed (-110)
      [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block
      <vcn_v1_0> failed -110
      amdgpu 0000:02:00.0: amdgpu: amdgpu_device_ip_resume failed (-110).
      PM: dpm_run_callback(): pci_pm_resume+0x0/0x90 returns -110
      
      [why]
      When playing the video: The power state flag of the vcn block is set to
      POWER_STATE_ON.
      
      When doing suspend: There is no change to the power state flag of the
      vcn block, it is still POWER_STATE_ON.
      
      When doing resume: Need to open the power gate of the vcn block and set
      the power state flag of the VCN block to POWER_STATE_ON.
      But at this time, the power state flag of the vcn block is already
      POWER_STATE_ON. The power status flag check in the "8f2cdef drm/amd/pm:
      avoid duplicate powergate/ungate setting" patch will return the
      amdgpu_dpm_set_powergating_by_smu function directly.
      As a result, the gate of the power was not opened, causing the
      subsequent ring test to fail.
      
      [how]
      In the suspend function of the vcn block, explicitly change the power
      state flag of the vcn block to POWER_STATE_OFF.
      
      BugLink: https://gitlab.freedesktop.org/drm/amd/-/issues/1828
      
      
      Signed-off-by: default avatarchen gong <curry.gong@amd.com>
      Reviewed-by: default avatarEvan Quan <evan.quan@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      Cc: stable@vger.kernel.org
      b7865173
    • Lijo Lazar's avatar
      drm/amd/pm: Fix xgmi link control on aldebaran · 19e66d51
      Lijo Lazar authored
      
      
      Fix the message argument.
      	0: Allow power down
      	1: Disallow power down
      
      Signed-off-by: default avatarLijo Lazar <lijo.lazar@amd.com>
      Reviewed-by: default avatarHawking Zhang <Hawking.Zhang@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      19e66d51
    • Huang Rui's avatar
      drm/amdgpu: introduce new amdgpu_fence object to indicate the job embedded fence · bf67014d
      Huang Rui authored
      
      
      The job embedded fence donesn't initialize the flags at
      dma_fence_init(). Then we will go a wrong way in
      amdgpu_fence_get_timeline_name callback and trigger a null pointer panic
      once we enabled the trace event here. So introduce new amdgpu_fence
      object to indicate the job embedded fence.
      
      [  156.131790] BUG: kernel NULL pointer dereference, address: 00000000000002a0
      [  156.131804] #PF: supervisor read access in kernel mode
      [  156.131811] #PF: error_code(0x0000) - not-present page
      [  156.131817] PGD 0 P4D 0
      [  156.131824] Oops: 0000 [#1] PREEMPT SMP PTI
      [  156.131832] CPU: 6 PID: 1404 Comm: sdma0 Tainted: G           OE     5.16.0-rc1-custom #1
      [  156.131842] Hardware name: Gigabyte Technology Co., Ltd. Z170XP-SLI/Z170XP-SLI-CF, BIOS F20 11/04/2016
      [  156.131848] RIP: 0010:strlen+0x0/0x20
      [  156.131859] Code: 89 c0 c3 0f 1f 80 00 00 00 00 48 01 fe eb 0f 0f b6 07 38 d0 74 10 48 83 c7 01 84 c0 74 05 48 39 f7 75 ec 31 c0 c3 48 89 f8 c3 <80> 3f 00 74 10 48 89 f8 48 83 c0 01 80 38 00 75 f7 48 29 f8 c3 31
      [  156.131872] RSP: 0018:ffff9bd0018dbcf8 EFLAGS: 00010206
      [  156.131880] RAX: 00000000000002a0 RBX: ffff8d0305ef01b0 RCX: 000000000000000b
      [  156.131888] RDX: ffff8d03772ab924 RSI: ffff8d0305ef01b0 RDI: 00000000000002a0
      [  156.131895] RBP: ffff9bd0018dbd60 R08: ffff8d03002094d0 R09: 0000000000000000
      [  156.131901] R10: 000000000000005e R11: 0000000000000065 R12: ffff8d03002094d0
      [  156.131907] R13: 000000000000001f R14: 0000000000070018 R15: 0000000000000007
      [  156.131914] FS:  0000000000000000(0000) GS:ffff8d062ed80000(0000) knlGS:0000000000000000
      [  156.131923] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [  156.131929] CR2: 00000000000002a0 CR3: 000000001120a005 CR4: 00000000003706e0
      [  156.131937] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      [  156.131942] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
      [  156.131949] Call Trace:
      [  156.131953]  <TASK>
      [  156.131957]  ? trace_event_raw_event_dma_fence+0xcc/0x200
      [  156.131973]  ? ring_buffer_unlock_commit+0x23/0x130
      [  156.131982]  dma_fence_init+0x92/0xb0
      [  156.131993]  amdgpu_fence_emit+0x10d/0x2b0 [amdgpu]
      [  156.132302]  amdgpu_ib_schedule+0x2f9/0x580 [amdgpu]
      [  156.132586]  amdgpu_job_run+0xed/0x220 [amdgpu]
      
      v2: fix mismatch warning between the prototype and function name (Ray, kernel test robot)
      
      Signed-off-by: default avatarHuang Rui <ray.huang@amd.com>
      Reviewed-by: default avatarChristian König <christian.koenig@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      bf67014d
  5. Dec 17, 2021
  6. Dec 16, 2021
  7. Dec 15, 2021
  8. Dec 14, 2021
  9. Dec 13, 2021
  10. Dec 12, 2021