MySQL supports window functions since version 8.0, which greatly improves the processing efficiency of complex queries. 1. The window function can calculate a group of rows without merging and paralleling, such as ranking, cumulative statistics, etc.; 2. Common functions include ROW_NUMBER(), RANK(), DENSE_RANK(), SUM() OVER(), AVG() OVER(), LAG(), LEAD(); 3. When using it, you must cooperate with the OVER() clause to specify PARTITION BY and ORDER BY; 4. Typical application scenarios include ranking query (such as Top N per Group), cumulative statistics (such as Running Total), context comparison (such as month-on-month analysis); 5. Pay attention to version compatibility (only 8.0), performance optimization, and GROUP BY The difference and frame settings are as well as the settings of the frames to ensure accurate results and efficient query.
MySQL supports window functions starting with version 8.0, which greatly enhances its ability to handle complex queries. If you could only rely on subqueries or temporary tables to implement ranking, cumulative statistics and other functions, now using window functions will be more concise and efficient.

What is a window function?
A window function is a function that can perform calculations on a set of rows, but does not merge multiple rows into one row like an aggregate function. It keeps the data of each row and analyzes them, such as "what does each row rank within the current group".
Common window functions include:

-
ROW_NUMBER()
: Assign a unique sequence number to each line -
RANK()
: The subsequent ranking will be skipped when there are parallel values. -
DENSE_RANK()
: The subsequent ranking will not be skipped when there are parallel values. -
SUM() OVER()
,AVG() OVER()
: You can do moving average, cumulative summing, etc. -
LAG()
,LEAD()
: Get the value of a certain line before and after the current line
These functions must be used in conjunction with OVER()
clause, you can specify the sorting method ( ORDER BY
) and the grouping method ( PARTITION BY
).
How to write a basic window function?
The syntax structure is roughly as follows:

function_name(...) OVER ( [PARTITION BY ...] [ORDER BY ...] [frame_clause] )
To give a simple example, you want to know how each employee ranks in their department:
SELECT emp_id, Dept_id, Salarary, ROW_NUMBER() OVER (PARTITION BY dept_id ORDER BY salary DESC) AS rank_in_dept FROM employees;
The meaning of this code is: divide the data windows by department, rank them from high to low in each department, and then number each person. In this way, you can know who is the first and second highest salary in the department.
Notice:
-
PARTITION BY
is optional, and if it is not added, the entire result set will be a big window. -
ORDER BY
is generally required, otherwise the expected results may not be obtained. - Different functions have different requirements for whether
ORDER BY
is required. For example,SUM() OVER()
may not be required, butRANK()
must be required.
Common application scenarios and techniques
1. Ranking questions (Top N per Group)
For example, find out the three people with the highest salary in each department:
SELECT * FROM ( SELECT *, ROW_NUMBER() OVER (PARTITION BY dept_id ORDER BY salary DESC) AS rn FROM employees ) t WHERE rn <= 3;
Here we will first place the rankings and then filter out the top three. If you want to allow a tie-up (such as two first places), you can use RANK()
or DENSE_RANK()
.
2. Cumulative statistics (Running Total)
For example, counting the cumulative sum of sales per month:
SELECT month, Sales, SUM(sales) OVER (ORDER BY month ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) AS running_total FROM monthly_sales;
This ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
means "from the beginning to the current row", which means adding up month by month.
3. Context comparison (month-on-month, year-on-year)
For example, want to know how much this month has increased compared to last month:
SELECT month, Sales, LAG(sales, 1) OVER (ORDER BY month) AS last_month_sales, sales - LAG(sales, 1) OVER (ORDER BY month) AS diff FROM monthly_sales;
LAG()
function can get the value of the previous line, which is suitable for month-on-month change analysis.
Things to note when using window functions
- Version compatibility : Window functions are only supported in MySQL 8.0 and cannot be used in old versions.
- Performance issues : Window functions may bring additional overhead, especially when there is a large amount of data, pay attention to indexing and partitioning design.
- Avoid misuse of
GROUP BY
: Window functions are not tools that replaceGROUP BY
, and the two use differently. If you just want statistics, you should still use aggregate functions first. - Understand the concept of frames (Frame) :
ROWS
andRANGE
control the window range, affecting the accuracy of the result, especially when it comes to time series.
Basically that's it. After mastering several common functions and syntax structures, you will find that window functions are clearer and more efficient than the original writing method in many scenarios.
The above is the detailed content of how to use window functions in mysql. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undress AI Tool
Undress images for free

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

When handling NULL values ??in MySQL, please note: 1. When designing the table, the key fields are set to NOTNULL, and optional fields are allowed NULL; 2. ISNULL or ISNOTNULL must be used with = or !=; 3. IFNULL or COALESCE functions can be used to replace the display default values; 4. Be cautious when using NULL values ??directly when inserting or updating, and pay attention to the data source and ORM framework processing methods. NULL represents an unknown value and does not equal any value, including itself. Therefore, be careful when querying, counting, and connecting tables to avoid missing data or logical errors. Rational use of functions and constraints can effectively reduce interference caused by NULL.

mysqldump is a common tool for performing logical backups of MySQL databases. It generates SQL files containing CREATE and INSERT statements to rebuild the database. 1. It does not back up the original file, but converts the database structure and content into portable SQL commands; 2. It is suitable for small databases or selective recovery, and is not suitable for fast recovery of TB-level data; 3. Common options include --single-transaction, --databases, --all-databases, --routines, etc.; 4. Use mysql command to import during recovery, and can turn off foreign key checks to improve speed; 5. It is recommended to test backup regularly, use compression, and automatic adjustment.

To view the size of the MySQL database and table, you can query the information_schema directly or use the command line tool. 1. Check the entire database size: Execute the SQL statement SELECTtable_schemaAS'Database',SUM(data_length index_length)/1024/1024AS'Size(MB)'FROMinformation_schema.tablesGROUPBYtable_schema; you can get the total size of all databases, or add WHERE conditions to limit the specific database; 2. Check the single table size: use SELECTta

GROUPBY is used to group data by field and perform aggregation operations, and HAVING is used to filter the results after grouping. For example, using GROUPBYcustomer_id can calculate the total consumption amount of each customer; using HAVING can filter out customers with a total consumption of more than 1,000. The non-aggregated fields after SELECT must appear in GROUPBY, and HAVING can be conditionally filtered using an alias or original expressions. Common techniques include counting the number of each group, grouping multiple fields, and filtering with multiple conditions.

Character set and sorting rules issues are common when cross-platform migration or multi-person development, resulting in garbled code or inconsistent query. There are three core solutions: First, check and unify the character set of database, table, and fields to utf8mb4, view through SHOWCREATEDATABASE/TABLE, and modify it with ALTER statement; second, specify the utf8mb4 character set when the client connects, and set it in connection parameters or execute SETNAMES; third, select the sorting rules reasonably, and recommend using utf8mb4_unicode_ci to ensure the accuracy of comparison and sorting, and specify or modify it through ALTER when building the library and table.

MySQL supports transaction processing, and uses the InnoDB storage engine to ensure data consistency and integrity. 1. Transactions are a set of SQL operations, either all succeed or all fail to roll back; 2. ACID attributes include atomicity, consistency, isolation and persistence; 3. The statements that manually control transactions are STARTTRANSACTION, COMMIT and ROLLBACK; 4. The four isolation levels include read not committed, read submitted, repeatable read and serialization; 5. Use transactions correctly to avoid long-term operation, turn off automatic commits, and reasonably handle locks and exceptions. Through these mechanisms, MySQL can achieve high reliability and concurrent control.

The most direct way to connect to MySQL database is to use the command line client. First enter the mysql-u username -p and enter the password correctly to enter the interactive interface; if you connect to the remote database, you need to add the -h parameter to specify the host address. Secondly, you can directly switch to a specific database or execute SQL files when logging in, such as mysql-u username-p database name or mysql-u username-p database name

The setting of character sets and collation rules in MySQL is crucial, affecting data storage, query efficiency and consistency. First, the character set determines the storable character range, such as utf8mb4 supports Chinese and emojis; the sorting rules control the character comparison method, such as utf8mb4_unicode_ci is case-sensitive, and utf8mb4_bin is binary comparison. Secondly, the character set can be set at multiple levels of server, database, table, and column. It is recommended to use utf8mb4 and utf8mb4_unicode_ci in a unified manner to avoid conflicts. Furthermore, the garbled code problem is often caused by inconsistent character sets of connections, storage or program terminals, and needs to be checked layer by layer and set uniformly. In addition, character sets should be specified when exporting and importing to prevent conversion errors
