• Use PerfHUD ES to Do Frame Capture Android Game


    Author: http://www.cnblogs.com/open-coder/p/3898224.html

    Get Start

    This is short tutorial about how to do frame capture with Nvidia PrefHUD. You could find a detail tutorial from here.
    Before frame capture, some requirement should be fullfilled:
    1) OpenGL ES 2.0 at least!
    2) Internet permission required. <uses-permission android:name="android.permission.INTERNET"></uses-permission>
    3) Tegra 4.0 at least device.
    4) Must be Nvidia Android adb. You will failed using other adb version, Nvidia Android adb requried, no exception.
    5) Before run app, type following under command line:
         adb shell setprop debug.perfhudes 1
    The following are some screen shots that I frame Captured “ShadowGun”:
    Performance_Dashboard
    draw_call_duration
    Frame_Debugger
    gemetory

     

    With PrefHUD ES Frame Capture, you could find the bottle neck and find the optimaztion direction:
    1) How many trianges will be drawn within one frame: whether we need to reduce the scene trianges or character traingles;
    2) Which one draw call cost the longgest time, is it reasonable? too many traingles? too complicated shader? vertex shader or fragment shader? Can some shader operations could be done in application code? 
        Can we move some operation from fragment shader to vertex shader? Use vertex lighting instead of pixel shader light?  Precalcualte the result and save to the texture?
    3) Too many objects overlap draw with each other? Enable occlusion culling could fix this problem well.
    4) Too many draw calls? Batch the objects with the same material and atlas texture.
    5) Too much textures? Reduce texture size, use small texture to tiling instead of big texture to blend. Use Bilinear filter mode will be much faster than trilinear filter mode.
    6) Reduce ‘glClear’ calls, it will depends on the device, some device will consume much time with ‘glClear’. Generally, we could only clear depth buffer will work. One time for main scene, the other one for HUD.
    7) Avoid use complicated and switch statement in the shader. Later some super shader will support those feature well, but well those features are not supported well currently.
    8) By ingore the whole render loop, and check the FPS. You could figure out whether the game is CPU bind and GPU bind.
    9) LOD for scene objects.
    10) Batch some small objects that near to each other. Seperate some big objects into smaller one, let view volume to cull them.
    11) Above method will opimize under low level. But if you have a good level design, you could save a lot of FPS. Set up a door between a indoor area and outdoor area? Seperate the outdoor area with some view occlusion objects and equipments?
    ……

     

    PerfHUD Usage

    PerfHUD could allow you do some test directly and check whether those are the bottle necks of the game, just as the following diagram:
    direct_test
    You could uncheck one of them, and see fps on the dashboard window status bar. This will help you make sure whether one of them will be the bottle neck.
    2x2 Texture: too many and large textures and texture cache missing was the bottle neck:
    Ignore Draw Calls: too many draw calls and GPU is the bind;
    Null Fragment shader: fragment shader was too expensive;
    Null viewport: a very small view port; Vertex transform, too many draw calls, too many render state switch may be the reason.
    Disable Blending: alpah blend, particlse effects may be the reasons;
    Disable Clear: glClear function take too much time;
    Disable Texture Upload: texture content take too much bindwith, transform from Main memory to GPU memory;
    Disable Buffer Data: vertex data, static or dynamic, transform from Main memory to GPU memory;
    Disable Unfirm Upload: parameters that pass to material shader.

  • 相关阅读:
    在CentOS 6.7 64位安装PHP的PDO_OCI扩展 Installing PDO_OCI extension on CentOS 6.7 64bit
    Windows下Apache+PHP+MySQL开发环境的搭建(WAMP)
    在Window上用cmd创建.htaccess文件
    Magento 0元订单 支付方式 -- Magento 0 Subtotal Payment Method
    PHP使用OPENSSL RSA加密解密数据
    CentOS编译安装PHP 7.0
    [转] CentOS单独安装Apache Benchmark压力测试工具的办法
    [转] 基于MySQL的秒杀核心设计(减库存部分)-防超卖与高并发
    快速激活JetBrains PhpStorm WebStorm系列产品
    Mac OS的phpize空信息解决办法
  • 原文地址:https://www.cnblogs.com/open-coder/p/3898224.html
Copyright © 2020-2023  润新知