PDA

Archiv verlassen und diese Seite im Standarddesign anzeigen : Fehler der Atikmdag.sys - Lösung!?


Schlammsau
2008-09-15, 06:55:47
Introduction

One of the most common stability problems in graphics is when the system appears completely "frozen" or "hung" while processing an end-user command or operation. Users generally wait a few seconds and then reboot the system by pressing the Power button. Usually the graphics processing unit (GPU) is "busy" processing intensive graphical operations, typically during gameplay. This results in nothing being updated on the screen, thus appearing to the user that the system is frozen.

This paper briefly describes the timeout detection and recovery (TDR) process in Windows Vista. It also documents the registry controls so developers can easily debug problems.


Timeout Detection and Recovery

Windows Vista attempts to detect these problematic hang situations and recover a responsive desktop dynamically. In this process, the Microsoft Windows Display Driver Model (WDDM) driver is reinitialized and the GPU is reset. No reboot is necessary, which greatly enhances the user experience. The only visible artifact from the hang detection to the recovery is a screen flicker, which results from resetting some portions of the graphics stack, causing a screen redraw. Some older Microsoft DirectX applications may render to a black screen at the end of this recovery. The end user would have to restart these applications.


The following is a brief overview of the TDR process:

1.
Timeout detection: The Video Scheduler component of the Windows Vista graphics stack detects that the GPU is taking more than the permitted quantum time to execute the particular task and tries to preempt this particular task. The preempt operation has a "wait" timeout—the actual "TDR timeout." This step is thus the "timeout detection" phase of the process. The default timeout period in Windows Vista is 2 seconds. If the GPU cannot complete or preempt the current task within the TDR timeout, then the GPU is diagnosed as hung.

2.
Preparation for recovery: The operating system informs the WDDM driver that a timeout has been detected and it must reset the GPU. The driver is told to stop accessing memory and should not access hardware after this time. The operating system and the WDDM driver collect hardware and other state information that could be useful for post-mortem diagnosis.

3.
Desktop recovery: The operating system resets the appropriate state of the graphics stack. The Video Memory Manager component of the graphics stack purges all allocations from video memory. The WDDM driver resets the GPU hardware state. The graphics stack takes the final actions and restores the desktop to the responsive state. As mentioned earlier, some older DirectX applications may now render just black, and the user may be required to restart these applications. Well-written DirectX 9Ex and DirectX 10 applications that handle "Device Remove" continue to work correctly. The application must release and then recreate its Microsoft Direct3D device and all of its objects. DirectX application programmers can find more information in the Windows SDK.


Error Messaging

Throughout the process of GPU hang detection and recovery, the desktop is unresponsive and thus unavailable to the user. In the final stages of recovery, a brief screen flash occurs that is similar to the one when the screen resolution is changed. After the desktop has been successfully recovered, the following informational message appears to the user.

The message is also logged in the Windows Vista Event Viewer. Diagnosis information is collected in the form of a debug report that is returned to Microsoft through the Online Crash Analysis (OCA) mechanism if the user opts in to provide feedback.


Registry Keys

The following registry keys are documented for testing purposes only. These registry keys should not be manipulated by any applications outside targeted testing or debugging.


• The TDR-related registry keys are located under HKLM\System\CurrentControlSet\Control\GraphicsDrivers


• TdrLevel: REG_DWORD. The initial level of recovery. The possible values are:


• TdrLevelOff (0). – Detection disabled.


• TdrLevelBugcheck (1)– Bug check on detected timeout, for example, no recovery.


• TdrLevelRecoverVGA (2) – Recover to VGA (not implemented).


• TdrLevelRecover(3) – Recover on timeout. This is the default value.


• TdrDelay: REG_DWORD. The number of seconds that the GPU is allowed to delay the preempt request from the scheduler. This is effectively the timeout threshold. The default value is 2.


• TdrDdiDelay: REG_DWORD. The number of seconds that the operating system allows threads to leave the driver. After a specified time, the operating system bug checks the system with the code VIDEO_TDR_FAILURE (0x116). The default value is 5.

• TdrTestMode:REG_DWORD: Internal test usage.


• TdrDebugMode:REG_DWORD: The debugging-related behavior of the TDR process.


• TDR_DEBUG_MODE_OFF (0) breaks to kernel debugger before the recovery to allow investigation of the timeout.

• TDR_DEBUG_MODE_IGNORE_TIMEOUT (1) ignores any timeout.


• TDR_DEBUG_MODE_RECOVER_NO_PROMPT (2) recovers without break into the debugger. This is the default value.

• TDR_DEBUG_MODE_RECOVER_UNCONDITIONAL (3) recovers even if some recovery conditions are not met (for example, recovers on consecutive timeouts).



Graphics hardware vendors:


• Ensure that graphics operations (that is, DMA buffer completion) take no more than 2 seconds in end-user scenarios such as productivity and gameplay.


Graphics software vendors:

• Ensure that the DirectX graphics application does not run at a low frames per second (FPS) rate. As the FPS decreases, the likelihood of the GPU getting reset increases. If the application is running at 10 FPS or lower and a complex graphics operation is about to start, then a flush can be inserted.


• For running benchmark tests on low-end GPUs, use the aforementioned registry keys that control the TDR timeout. Remember that they should not be used in production systems because it would affect overall system stability and robustness. Use these keys only as a final solution.



System manufacturers:


• Work with the graphics hardware vendor to diagnose the TDR debug reports.


• Remember that any system that uses the aforementioned TDR registry keys to change the default values is a Windows Logo Program violation.




===================================================================
===================================================================





737-27116: Radeon™ Series - ATIKMDAG has stopped responding error messages

The information in this article applies to the following configuration(s):

(* Radeon™ HD 4800 series)
* Radeon™ HD 3800 series
* Radeon™ HD 2900 series
* Radeon™ HD 2600 series
* Radeon™ HD 2400 series
* Radeon™ X1900 series
* Radeon™ X1950 series
* Radeon™ X1900 series
* Radeon™ X1800 series
* Radeon™ X1650 series
* Radeon™ X1600 series
* Radeon™ X1550 series
* Radeon™ X1300 series
* Radeon™ X1050 series
* Radeon™ X850 series
* Radeon™ X800 series
* Radeon™ X700 series
* Radeon™ X600 series
* Radeon™ X550 series
* Radeon™ X300 series
* Radeon™ 9800 series
* Radeon™ 9700 series
* Radeon™ 9650 series
* Radeon™ 9600 series
* Radeon™ 9550 series
* Radeon™ 9500 series
* Windows Vista 32-bit Edition
* Windows Vista 64-bit Edition

Symptoms:

When running games or full screen video, some users may be shown a message stating ATIKMDAG has stopped responding but was successfully recovered. In some cases, the system will continue to work as normal. Alternately, this error message may not result in the system being recovered and the system may need to be reset.

Details:

The issue is generally not caused by the graphics driver. It is caused by hardware or system issues. The most common causes for this issue include:

* Insufficient Power
* Bad Memory Modules
* Over Clocking
* Motherboard/Memory Compatibility
* Overheating
* Defective Motherboard



Please refer to this Microsoft for an explanation of the message


Solution:

The following are suggestions you can try in order to isolate the cause of your issue:

* Remove some memory or move memory to a different slot
* Try a different memory brand or replace the memory
* Test with a different power supply
* Test the graphics card in a different system with the same driver version installed.
* If it works the issue is not the graphics card or graphics driver.
* If it does not work, obtain warranty service for the graphics card if still under warranty or try a replacement graphics card.

* Obtain warranty service for your graphics card if still under warranty or try a replacement graphics card if don’t have the option to test your card in another system.

If you have this issue and can reproduce it on command, it is not random or intermittent, please collect the following information to submit to us.

1. Your motherboard Make and Model # or System Make and Model #.
2. A System Information report. In the search box from the Vista menu type msinfo32. Save the file.
3. A Direct X Diagnostic report. In the search box from the Vista menu type dxdiag. Save the file.
4. If you have Catalyst Control Center installed, please collect the data for the Graphics Hardware from the Information section. Take a screen shot or copy the information down.
5. Specific details to reproduce the issue. Document each and every step taken before the issue occurs. Ensure to include your application version including all patch versions installed. For video what format of video, if possible provide a link to a video clip we can use for reproduction.
6. Please provide all your settings for both your application and Catalyst Driver settings if changed from default.
Gefunden hier: http://www.computerbase.de/forum/showthread.php?t=472479

new_vision
2008-09-15, 09:19:55
Sehe keine Lösung... :D

Ich habe meinen PCIE-Takt manuell auf 100 festgelegt (stand vorher auf Auto), seitdem dehören die atikmdag.sys Abstürze der Vergangenheit an, seit mittlerweile 1,5 Wochen läuft das System stabil.

LarsPower
2008-09-16, 10:59:34
Die Lösung findest du http://www.forumdeluxx.de/forum/showthread.php?t=428906. Hat bei mir geholfen, seit dem keine VPU Recoverys mehr. Viel Spaß.

DarkFox
2008-09-16, 17:38:40
Die Lösung findest du http://www.forumdeluxx.de/forum/showthread.php?t=428906. Hat bei mir geholfen, seit dem keine VPU Recoverys mehr. Viel Spaß.
Das wurde schon geklärt, dass das Quatsch ist. Bei 99% verbessert das überhaupt nichts. Statt einem Driver Restore bekommt man dann einen Bluescreen/oder muss resetten. Tolle Verbesserung

Edit: Auch die Erklärung macht keinen Sinn, denn das passiert nicht nur bei niedrigen fps sondern ziemlich willkürlich.

Schlammsau
2008-09-16, 17:42:17
Das wurde schon geklärt, dass das Quatsch ist. Bei 99% verbessert das überhaupt nichts. Statt einem Driver Restore bekommt man dann einen Bluescreen. Tolle Verbesserung.

Edit: Auch die Erklärung macht keinen Sinn, denn das passiert nicht nur bei niedrigen fps sondern ziemlich willkürlich.

Stimmt! Das ist das wichtigste!
The issue is generally not caused by the graphics driver. It is caused by hardware or system issues. The most common causes for this issue include:

* Insufficient Power
* Bad Memory Modules
* Over Clocking
* Motherboard/Memory Compatibility
* Overheating
* Defective Motherboard

Gorkon
2008-09-17, 21:36:27
Moin Moin...

Ich hatte diesen Fehler ja nun auch mit meiner alten X1900XT, allerdings erst nur unter Vista. Nur trat der Fehler dann auch irgendwann unter XP / XP64 auf und es stellte sich heraus, dass es die Graka selber war. Ob GPU und / oder VRAM jetzt schuld waren, war aber nicht herausfindbar.

Die Einzige Modifikation an der Karte war der VF-900 (voll verbaut, inkl. RAM-Kühler)...der sich wohl auf lange Zeit als zu schwachbrüstiig erwiesen hat, obwohl die Temperaturen alle im grünen Bereich lagen.

Ich kann allen nur empfehlen bei diesem Fehler jegliche Hardware gründlichst auf Standard-Takt durchzuchecken...ein Softwarefehler ist das mit Garantie nicht.

mfg