我对PostgreSQL 中 index scan 与 seq scan 的对比学习

开始

数据量很小的时候，我们可以看到，seq scan 比 index scan 更加有效。那是因为 index scan 至少要发生两次I/O，一次是读取索引块，一次是读取数据块。当index 很大的时候，情况可能会更加复杂。

postgres=# select a.relpages, a.reltuples, a.relfilenode,a.reltype,b.typname from pg_class a, pg_type b where a.relname like 'gaotab%' and a.reltype=b.oid;    
 relpages | reltuples | relfilenode | reltype | typname     
----------+-----------+-------------+---------+---------    
        1 |      100 |       16396 |   16386 | gaotab

数据量为 100条记录。

预估成本：

postgres=# set session enable_seqscan=false;                    
SET                    
postgres=# explain select name from gaotab where id=50;                    
                             QUERY PLAN                                                  
---------------------------------------------------------------------                    
 Index Scan using idx_id on gaotab  (cost=0.00..8.27 rows=1 width=5)                    
   Index Cond: (id = 50)                    
(2 rows)

postgres=# set session enable_seqscan=true;        
SET        
        
postgres=# explain select name from gaotab where id=50;        
                      QUERY PLAN                              
------------------------------------------------------        
 Seq Scan on gaotab  (cost=0.00..2.25 rows=1 width=5)        
   Filter: (id = 50)        
(2 rows)

实际执行：

postgres=# set session enable_seqscan=false;        
SET        
postgres=# explain analyze select name from gaotab where id=50;        
                                                  QUERY PLAN                            
                                       
--------------------------------------------------------------------------------        
-------------------------------        
 Index Scan using idx_id on gaotab  (cost=0.00..8.27 rows=1 width=5) (actual tim        
e=0.112..0.113 rows=1 loops=1)        
   Index Cond: (id = 50)        
 Total runtime: 0.133 ms        
(3 rows)

postgres=# set session enable_seqscan=true;        
SET        
postgres=# explain analyze select name from gaotab where id=50;        
                                           QUERY PLAN                                   
                        
--------------------------------------------------------------------------------        
----------------        
 Seq Scan on gaotab  (cost=0.00..2.25 rows=1 width=5) (actual time=0.014..0.018         
rows=1 loops=1)        
   Filter: (id = 50)        
   Rows Removed by Filter: 99        
 Total runtime: 0.034 ms        
(4 rows)

等到数据量大的时候，就是截然不同了。

数据为1000条记录时，通过查询可以看到，已经跨越了7个page：

postgres=# analyze;    
ANALYZE    
postgres=# select a.relpages, a.reltuples, a.relfilenode,a.reltype,b.typname from pg_class a, pg_type b where a.relname like 'gaotab%' and a.reltype=b.oid;    
 relpages | reltuples | relfilenode | reltype | typname     
----------+-----------+-------------+---------+---------    
        7 |      1000 |       16396 |   16386 | gaotab    
(1 row)    
    
postgres=#

再次预估成本，此时seq scan 已经开始变得不划算了：

postgres=# set session enable_seqscan=false;
SET
postgres=# explain select name from gaotab where id=50;
                             QUERY PLAN                              
---------------------------------------------------------------------
 Index Scan using idx_id on gaotab  (cost=0.00..8.27 rows=1 width=6)
   Index Cond: (id = 50)
(2 rows)

postgres=# set session enable_seqscan=true;
SET
postgres=# explain select name from gaotab where id=50;
                             QUERY PLAN                              
---------------------------------------------------------------------
 Index Scan using idx_id on gaotab  (cost=0.00..8.27 rows=1 width=6)
   Index Cond: (id = 50)
(2 rows)

postgres=# set session enable_indexscan=false;
SET
postgres=# explain select name from gaotab where id=50;
                             QUERY PLAN                              
---------------------------------------------------------------------
 Bitmap Heap Scan on gaotab  (cost=4.26..8.27 rows=1 width=6)
   Recheck Cond: (id = 50)
   ->  Bitmap Index Scan on idx_id  (cost=0.00..4.26 rows=1 width=0)
         Index Cond: (id = 50)
(4 rows)

postgres=# set session enable_bitmapscan=false;
SET
postgres=# explain select name from gaotab where id=50;
                      QUERY PLAN                       
-------------------------------------------------------
 Seq Scan on gaotab  (cost=0.00..19.50 rows=1 width=6)
   Filter: (id = 50)
(2 rows)

postgres=#

实际执行

postgres=# set session enable_seqscan=false;
SET
postgres=# explain analyze select name from gaotab where id=50;
                                                            QUERY PLAN                                      
                       
------------------------------------------------------------------------------------------------------------
-----------------------
 Index Scan using idx_id on gaotab  (cost=10000000000.00..10000000008.27 rows=1 width=6) (actual time=0.020.
.0.022 rows=1 loops=1)
   Index Cond: (id = 50)
 Total runtime: 0.051 ms
(3 rows)

postgres=# set session enable_seqscan=true;
SET
postgres=# set session enable_indexscan=false;
SET
postgres=# set session enable_bitmapscan=false;
SET
postgres=# explain analyze select name from gaotab where id=50;
                                           QUERY PLAN                                            
-------------------------------------------------------------------------------------------------
 Seq Scan on gaotab  (cost=0.00..19.50 rows=1 width=6) (actual time=0.015..0.095 rows=1 loops=1)
   Filter: (id = 50)
   Rows Removed by Filter: 999
 Total runtime: 0.109 ms
(4 rows)

postgres=#

[作者：技术者高健@博客园 mail: luckyjackgao@gmail.com ]

结束

相关阅读:
Spring入门-对异常的处理
 Spring入门-Interceptor基本使用
 Spring入门-浏览器中文乱码问题
 Spring入门-使用SpringMVC完成一个登陆页面
 Spring入门-获取页面参数&向页面传值&重定向
 【Linux】【NodeJs】Centos7安装node-v10.16.3环境
 【Linux】【Go】Centos7安装go1.13环境
 【Linux】【Fabric】Centos7搭建Fabric运行环境
 【Linux】【ELK】利用elasticproxy对elasticsearch进行二次排序
 【Linux】【ELK】搭建Elasticsearch+Logstash+Kibana+Filebeat日志收集系统
原文地址：https://www.cnblogs.com/gaojian/p/2758750.html