平時在客戶業務的數據分析中,經常有這么一種需求,那就是希望得到某個組里面某些字段最大或最小的記錄.
比如雇員表:
[local]:5432 pg12@testdb=# \d employee
Table "public.employee"
Column | Type | Collation | Nullable | Default
------------+-----------------------+-----------+----------+---------
id | integer | | |
name | character varying(30) | | |
department | character varying(30) | | |
salary | double precision |
我們通過 MockData生成測試數據,共1000行,department共有12個.
[local]:5432 pg12@testdb=# select count(*) from employee;
count
-------
1000
(1 row)
Time: 22.747 ms
[local]:5432 pg12@testdb=# select distinct department from employee;
department
--------------------------
Marketing
Training
Sales
Business Development
Product Management
Research and Development
Support
Legal
Accounting
Services
Human Resources
Engineering
(12 rows)
Time: 2.616 ms
下面希望得到每個department中salary中最大的employee.
常規的做法是使用分組求得最大值/最小值,然后進行關聯查詢:
[local]:5432 pg12@testdb=# select a.* from employee a,(select department,max(salary) as salary from employee group by department) b
pg12@testdb-# where a.department = b.department and a.salary = b.salary order by a.department;
id | name | department | salary
-----+--------------------+--------------------------+---------
453 | Ericha Hendrikse | Accounting | 9958.5
307 | Kyle Hartegan | Business Development | 9754.93
969 | Odelinda Marsden | Engineering | 9942.3
201 | Glen Kasperski | Human Resources | 9559.54
892 | Mirabelle Lesslie | Legal | 9720.49
214 | Chane Koschek | Marketing | 9943.86
371 | Josy Ayliff | Product Management | 9975.48
191 | Meir Alvaro | Research and Development | 9870
770 | Adoree de Guerre | Sales | 9808.65
370 | Benoite Overlow | Services | 9884.79
866 | Shirlee McIlherran | Support | 9884.08
586 | Renae Jerromes | Training | 9904.24
(12 rows)
Time: 8.256 ms
[local]:5432 pg12@testdb=#
這種方法有個問題是如果max salary有多條記錄的話,上述查詢的結果會有多條.
PostgreSQL提供了DISTINCT ON,可簡單實現該需求
[local]:5432 pg12@testdb=# SELECT DISTINCT ON (department)
pg12@testdb-# *
pg12@testdb-# FROM
pg12@testdb-# employee
pg12@testdb-# ORDER BY
pg12@testdb-# department,
pg12@testdb-# salary DESC;
id | name | department | salary
-----+--------------------+--------------------------+---------
453 | Ericha Hendrikse | Accounting | 9958.5
307 | Kyle Hartegan | Business Development | 9754.93
969 | Odelinda Marsden | Engineering | 9942.3
201 | Glen Kasperski | Human Resources | 9559.54
892 | Mirabelle Lesslie | Legal | 9720.49
214 | Chane Koschek | Marketing | 9943.86
371 | Josy Ayliff | Product Management | 9975.48
191 | Meir Alvaro | Research and Development | 9870
770 | Adoree de Guerre | Sales | 9808.65
370 | Benoite Overlow | Services | 9884.79
866 | Shirlee McIlherran | Support | 9884.08
586 | Renae Jerromes | Training | 9904.24
(12 rows)
Time: 11.445 ms
Excellent Feature!
參考資料
The Many Faces of DISTINCT in PostgreSQL
免責聲明:本站發布的內容(圖片、視頻和文字)以原創、轉載和分享為主,文章觀點不代表本網站立場,如果涉及侵權請聯系站長郵箱:is@yisu.com進行舉報,并提供相關證據,一經查實,將立刻刪除涉嫌侵權內容。