http://blog.csdn.net/solstice/archive/2010/01/16/5196544.aspx
多核时代不宜再用 x86 的 RDTSC 指令测试指令周期和时间
虽然 RDTSC 废掉了,性能测试用的高精度计时还是有办法的 [2],在 Windows 用 QueryPerformanceCounter 和 QueryPerformanceFrequency,Linux 下用 POSIX 的 clock_gettime 函数,以 CLOCK_MONOTONIC 参数调用。
QueryPerformanceCounter() 错误的情况我们也碰见过,用 SetThreadAffinityMask() 解决。
转载自:http://cooker.javaeye.com/blog/652604
MSDN上的描述:
- SetThreadAffinityMask
- The SetThreadAffinityMask function sets a processor affinity mask for the specified thread.
- DWORD_PTR SetThreadAffinityMask(
- HANDLE hThread,
- DWORD_PTR dwThreadAffinityMask
- );
Parameters
hThread
[in] Handle to the thread whose affinity mask is to be set.
This handle must have the THREAD_SET_INFORMATION and THREAD_QUERY_INFORMATION access rights. For more information, see Thread Security and Access Rights.
dwThreadAffinityMask
[in] Affinity mask for the thread.
Windows Me/98/95: This value must be 1.
Return Values
If the function succeeds, the return value is the thread's previous affinity mask.
Windows Me/98/95: The return value is 1. To succeed, hThread must be valid and dwThreadAffinityMask must be 1.
If the function fails, the return value is zero. To get extended error information, call GetLastError.
Remarks
A thread affinity mask is a bit vector in which each bit represents the processors that a thread is allowed to run on.
A thread affinity mask must be a proper subset of the process affinity mask for the containing process of a thread. A thread is only allowed to run on the processors its process is allowed to run on.
通过调用SetThreadAffinityMask,就能为各个线程设置亲缘性屏蔽:
该函数中的h T h r e a d参数用于指明要限制哪个线程, dwThreadAffinityMask用于指明该线程能够在哪个CPU上运行。dwThreadAffinityMask必须是进程的亲缘性屏蔽的相应子集。返回值是线程的前一个亲缘性屏蔽。
因此,若要将3个线程限制到CPU1、2和3上去运行,可以这样操作:
本文来自CSDN博客,转载请标明出处:http://blog.csdn.net/W511522329/archive/2010/03/06/5352597.aspx
start.QuadPart = 49102789906513
sleep for10(s)
end.QuadPart = 49127801303663
consume value = end.QuadPart - start.QuadPart = 25011397150
(consume value/(double)freq.QuadPart) Time consumed = 10.0046(s)
hThread
[in] Handle to the thread whose affinity mask is to be set.
This handle must have the THREAD_SET_INFORMATION and THREAD_QUERY_INFORMATION access rights. For more information, see Thread Security and Access Rights.
dwThreadAffinityMask
[in] Affinity mask for the thread.
Windows Me/98/95: This value must be 1.
Return Values
If the function succeeds, the return value is the thread's previous affinity mask.
Windows Me/98/95: The return value is 1. To succeed, hThread must be valid and dwThreadAffinityMask must be 1.
If the function fails, the return value is zero. To get extended error information, call GetLastError.
Remarks
A thread affinity mask is a bit vector in which each bit represents the processors that a thread is allowed to run on.
A thread affinity mask must be a proper subset of the process affinity mask for the containing process of a thread. A thread is only allowed to run on the processors its process is allowed to run on.
通过调用SetThreadAffinityMask,就能为各个线程设置亲缘性屏蔽:
- DWORD_PTR SetThreadAffinityMask(HANDLE hThread, DWORD_PTR dwThreadAffinityMask);
该函数中的h T h r e a d参数用于指明要限制哪个线程, dwThreadAffinityMask用于指明该线程能够在哪个CPU上运行。dwThreadAffinityMask必须是进程的亲缘性屏蔽的相应子集。返回值是线程的前一个亲缘性屏蔽。
因此,若要将3个线程限制到CPU1、2和3上去运行,可以这样操作:
- //Thread 0 can only run on CPU 0.
- SetThreadAffinityMask(hThread0, 0x00000001); //第0位是1
- //Threads 1, 2, 3 run on CPUs 1, 2, 3.//第1 2 3位是1
- SetThreadAffinityMask(hThread1, 0x00000002);
- SetThreadAffinityMask(hThread2, 0x00000003);
- SetThreadAffinityMask(hThread3, 0x00000004);
本文来自CSDN博客,转载请标明出处:http://blog.csdn.net/W511522329/archive/2010/03/06/5352597.aspx
- #include "stdafx.h"
- #include <windows.h>
- #include <string>
- #include <iostream>
- void running(int seconds) {
- Sleep(seconds*1000);
- std::cout<<"sleep for "<<seconds<<"(s)"<<std::endl;
- }
- int _tmain(int argc, _TCHAR* argv[])
- {
- SetThreadAffinityMask(GetCurrentThread(), 1);
- LARGE_INTEGER start, end;
- LARGE_INTEGER freq;
- //timeConsuming();
- QueryPerformanceFrequency(&freq);
- QueryPerformanceCounter(&start);//start
- std::cout<<"start.QuadPart = "<<start.QuadPart<<std::endl; //output start
- running(10); //running 10 seconds
- QueryPerformanceCounter(&end); //end
- std::cout<<"end.QuadPart = "<<end.QuadPart<<std::endl; //output end
- std::cout<<"consume value = end.QuadPart - start.QuadPart = "<<(end.QuadPart - start.QuadPart)<<std::endl;
- std::cout<<"(consume value/(double)freq.QuadPart) Time consumed = "<<(end.QuadPart - start.QuadPart)/(double)freq.QuadPart<<"(s)"<<std::endl; //output consumed time
- return 0;
- }
start.QuadPart = 49102789906513
sleep for10(s)
end.QuadPart = 49127801303663
consume value = end.QuadPart - start.QuadPart = 25011397150
(consume value/(double)freq.QuadPart) Time consumed = 10.0046(s)