c++ - 平方根倒数算法的效率比较的一个问题

浏览：34日期：2023-05-13

问题描述

问题源于这样一道作业题： c++ - 平方根倒数算法的效率比较的一个问题

作业中提到的的那个invSqrt的代码如下：

float Q_rsqrt( float number ){ long i; float x2, y; const float threehalfs = 1.5F; x2 = number * 0.5F; y = number; i = * ( long * ) &y; // evil floating point bit level hacking i = 0x5f3759df - ( i >> 1 ); // what the fuck? y = * ( float * ) &i; y = y * ( threehalfs - ( x2 * y * y ) ); // 1st iteration // y = y * ( threehalfs - ( x2 * y * y ) ); // 2nd iteration, this can be removed return y;}

然后就自己写了个比较性能和精度的程序：

#include<stdio.h>#include<math.h>#include<time.h>float Q_rsqrt(float number)//从wiki上获取的源代码{ long i; float x2, y; const float threehalfs = 1.5F; x2 = number * 0.5F; y = number; i = *(long *)&y; // evil floating point bit level hacking i = 0x5f3759df - (i >> 1); // what the fuck? y = *(float *)&i; y = y * (threehalfs - (x2 * y * y)); // 1st iteration y = y * (threehalfs - (x2 * y * y)); // 2nd iteration, this can be removed return y;}int main(){ int LoopCount = 1000000;//循环计算次数 float dur = 0;//时间间隔，初始化为0 float x;//待计算数 printf('Please input a number:');//输入提示 scanf_s('%f', &x, sizeof(float)); float y; clock_t t1 = clock(); for (int i = 0; i < LoopCount; i++) {y =(float)( 1 / sqrt(x)); } t1 = clock() - t1; printf('1/sqrt(%f)=%fn', x, y); printf('%ldn', t1); printf('Use Time_1:%.10f sn', ((float)t1 / CLOCKS_PER_SEC)); float z; clock_t t2 = clock(); for (int j = 0; j < LoopCount; j++) {z = Q_rsqrt(x); } t2 = clock() - t2; printf('Q_rsqrt(%f)=%fn', x, z); printf('%ldn', t2); printf('Use Time_2:%.10f sn', ((float)t2 / CLOCKS_PER_SEC));getchar(); getchar(); return 0;}

之后，出现了一个很奇怪的问题：

c++ - 平方根倒数算法的效率比较的一个问题

是的。。。C中的库函数效率基本上是下面那个算法的4倍！！！但是，如果用C++中的库函数：

#include<stdio.h>#include<cmath>#include<time.h>float Q_rsqrt(float number)//从wiki上获取的源代码{ long i; float x2, y; const float threehalfs = 1.5F; x2 = number * 0.5F; y = number; i = *(long *)&y; // evil floating point bit level hacking i = 0x5f3759df - (i >> 1); // what the fuck? y = *(float *)&i; y = y * (threehalfs - (x2 * y * y)); // 1st iteration y = y * (threehalfs - (x2 * y * y)); // 2nd iteration, this can be removed return y;}int main(){ int LoopCount = 1000000;//循环计算次数 float dur = 0;//时间间隔，初始化为0 float x;//待计算数 printf('Please input a number:');//输入提示 scanf_s('%f', &x, sizeof(float)); float y; clock_t t1 = clock(); for (int i = 0; i < LoopCount; i++) {y =( 1 / sqrt(x)); } t1 = clock() - t1; printf('1/sqrt(%f)=%fn', x, y); printf('%ldn', t1); printf('Use Time_1:%.10f sn', ((float)t1 / CLOCKS_PER_SEC)); float z; clock_t t2 = clock(); for (int j = 0; j < LoopCount; j++) {z = Q_rsqrt(x); } t2 = clock() - t2; printf('Q_rsqrt(%f)=%fn', x, z); printf('%ldn', t2); printf('Use Time_2:%.10f sn', ((float)t2 / CLOCKS_PER_SEC));getchar(); getchar(); return 0;}

然后跑出来的结果是这样的：

c++ - 平方根倒数算法的效率比较的一个问题

因此。。。似乎C的效率比C++高了很多很多？现在在VS2015中的实现就连那个神奇的快速算法也比不过了？（还是。。。我这样的比较是不对的。。。）

疑惑中。。。

问题解答

回答1：

凡是涉及到效率比较的问题，首先要确定的几个前提条件是：

是否使用了同样的编译器

是否使用了同样编译开关（比如优化开的是O几）

是否在同样的环境配置下运行的

运行的结果是否精确定量可重复

在以上条件都确定以后，才能确认结果的可靠性，然后再考虑结果产生的原因。

不知道题主的问题是否考虑了上述前提条件。

题目中只测试了两个算法在同一个 case 下的结果，也许这个 case 对于一个算法是最优情况、对另一个算法是最差情况，所以结果不能说明问题。例如对于一个一般情况下是O(n log n)复杂度的算法，有可能最坏情况是O(n^2)、最优情况是O(n)。不能仅凭一个 case 就说明某个算法的优劣。

回答2：

补充一下@jaege 的答案。你只算了两个数值，这样很难说明情况。要比较的话至少多跑一些数值，比如几万个数值，然后再比较。

上一条：c++11 - c++里面类成员为什么不能做为类成员函数的默认实参？下一条：c++ - STL的优先队列中为何没有删除某个任意元素的方法呢？

相关文章：
1. 为什么老师，你de button按钮处可以有两个ID是一样的id="loginbtn" 而不会报错啊2. javascript - 在移动端input type="number" 键盘没法弹起3. 为什么总是提示我说Template "movieTemplate" not found，我路径都引对了呀4. javascript - type="date"的默认值能改变为当天日期吗5. mysql - sphinx查询 "中国" 时也能查询到 "中华人民共和国"6. solaris基础和常用知识 (2)7. node.js mysql Cannot find module "net" 和 "tls"和"fs" 的问题8. javascript - 一个字符串转换成数字,例子就是a="2,322.222",b=’1,211.21’,如何在angualr中执行相减9. mysql - 使用hibernate连接数据库时,数据库版本过高不支持关键字"type" ；10. javascript - 关于input="file"，重复选择同一文件时不触发change事件的问题