news 2026/5/9 12:45:16

SQL窗口函数(使用场景)

作者头像

张小明

前端开发工程师

1.2k 24
文章封面图
SQL窗口函数(使用场景)

1. 常见排名和排序

-- 行号 ROW_NUMBER() OVER (ORDER BY sales DESC) AS rank -- 排名(允许并列) RANK() OVER (ORDER BY sales DESC) AS rank -- 密集排名 DENSE_RANK() OVER (ORDER BY sales DESC) AS dense_rank -- 分位数 NTILE(4) OVER (ORDER BY salary DESC) AS quartile

2. 时间序列分析

-- 移动平均 SELECT date, sales, AVG(sales) OVER ( ORDER BY date ROWS BETWEEN 6 PRECEDING AND CURRENT ROW ) AS 7_day_avg FROM daily_sales; -- 环比增长 SELECT month, revenue, (revenue - LAG(revenue, 1) OVER (ORDER BY month)) / LAG(revenue, 1) OVER (ORDER BY month) * 100 AS growth_rate FROM monthly_revenue;

3. 累计计算

-- 累计求和 SELECT date, sales, SUM(sales) OVER ( ORDER BY date ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW ) AS running_total FROM sales; -- 累计百分比 SELECT customer, revenue, revenue / SUM(revenue) OVER () * 100 AS pct_total, SUM(revenue) OVER ( ORDER BY revenue DESC ROWS UNBOUNDED PRECEDING ) * 100.0 / SUM(revenue) OVER () AS cum_pct FROM customers;

4. 数据比较

-- 与前一行比较 SELECT date, temperature, temperature - LAG(temperature, 1) OVER (ORDER BY date) AS diff_prev, LEAD(temperature, 1) OVER (ORDER BY date) - temperature AS diff_next FROM weather_data; -- 与分组内第一行比较 SELECT department, employee, salary, FIRST_VALUE(salary) OVER (PARTITION BY department ORDER BY salary DESC) as top_salary, salary - FIRST_VALUE(salary) OVER (PARTITION BY department ORDER BY salary DESC) as gap_from_top FROM employees;

5. 高级分析场景

-- 会话分析(找出用户连续访问) SELECT user_id, login_time, LAG(logout_time) OVER (PARTITION BY user_id ORDER BY login_time) as prev_logout, CASE WHEN login_time <= LAG(logout_time) OVER (PARTITION BY user_id ORDER BY login_time) THEN 0 ELSE 1 END as is_new_session FROM user_sessions; -- 查找重复记录 SELECT *, ROW_NUMBER() OVER (PARTITION BY email, name ORDER BY created_at) as dup_count FROM users WHERE dup_count > 1; -- 计算留存率 WITH user_activity AS ( SELECT user_id, login_date, MIN(login_date) OVER (PARTITION BY user_id) as first_login, LEAD(login_date) OVER (PARTITION BY user_id ORDER BY login_date) as next_login FROM logins ) SELECT first_login as cohort_date, COUNT(DISTINCT user_id) as cohort_size, COUNT(DISTINCT CASE WHEN next_login = first_login + INTERVAL '1 day' THEN user_id END) as day1_retained FROM user_activity GROUP BY first_login;

6. 复杂业务场景

-- 漏斗分析 WITH funnel AS ( SELECT user_id, MAX(CASE WHEN event = 'view' THEN 1 ELSE 0 END) as viewed, MAX(CASE WHEN event = 'click' THEN 1 ELSE 0 END) as clicked, MAX(CASE WHEN event = 'purchase' THEN 1 ELSE 0 END) as purchased FROM events GROUP BY user_id ) SELECT COUNT(*) as total_users, SUM(viewed) as viewers, SUM(clicked) as clickers, SUM(purchased) as buyers, 100.0 * SUM(clicked) / NULLIF(SUM(viewed), 0) as click_rate FROM funnel; -- 间隔计算 SELECT user_id, event_time, EXTRACT(EPOCH FROM (event_time - LAG(event_time) OVER (PARTITION BY user_id ORDER BY event_time) )) as seconds_since_last_event FROM events;

7. 性能优化技巧

-- 避免自连接 -- 传统方式(需要自连接) SELECT a.id, a.value, MAX(b.value) as max_so_far FROM table a JOIN table b ON a.id >= b.id GROUP BY a.id, a.value; -- 使用窗口函数(更高效) SELECT id, value, MAX(value) OVER (ORDER BY id ROWS UNBOUNDED PRECEDING) as max_so_far FROM table;

最佳实践建议:

  1. 注意性能:窗口函数在大量数据上可能较慢,合理使用分区

  2. 结合索引:ORDER BY子句中的字段建议有索引

  3. 使用FILTER(如果数据库支持):

    AVG(salary) FILTER (WHERE department = 'Sales') OVER () as avg_sales_salary
  4. 明确窗口范围:明确指定ROWS或RANGE避免歧义

版权声明: 本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若内容造成侵权/违法违规/事实不符,请联系邮箱:809451989@qq.com进行投诉反馈,一经查实,立即删除!
网站建设 2026/5/9 2:52:42

从繁琐到高效:我的 SpringBoot 学习心得与实战感悟

作为一名后端开发者&#xff0c;我曾被 SSM&#xff08;Spring SpringMVC MyBatis&#xff09;框架的配置折磨得 “痛不欲生”—— 无数的 XML 文件、繁杂的依赖配置、bean 的手动注入&#xff0c;稍有不慎就会出现各种奇奇怪怪的异常。直到接触了SpringBoot&#xff0c;我才…

作者头像 李华
网站建设 2026/5/9 2:13:19

Langflow:拖拽式AI工作流构建神器

Langflow&#xff1a;拖拽式AI工作流构建神器 在大模型浪潮席卷各行各业的今天&#xff0c;越来越多团队试图将 LLM 能力融入产品——从智能客服到知识问答&#xff0c;从自动化报告生成到多智能体协作系统。但现实往往令人头疼&#xff1a;写链式调用代码像在拼乐高却没说明书…

作者头像 李华
网站建设 2026/5/8 16:42:48

Ubuntu上快速部署Dify+蓝耘MaaS打造AI应用

Ubuntu上快速部署Dify蓝耘MaaS打造AI应用 在企业级AI应用开发门槛不断降低的今天&#xff0c;越来越多团队开始尝试将大模型能力集成到实际业务场景中。但问题也随之而来&#xff1a;如何在不组建专业AI工程团队的前提下&#xff0c;快速构建一个稳定、可扩展且成本可控的智能…

作者头像 李华
网站建设 2026/5/9 3:05:08

EmotiVoice易魔声:开源情感语音合成引擎

EmotiVoice易魔声&#xff1a;开源情感语音合成引擎 在虚拟主播情绪饱满地讲述故事、游戏NPC因剧情转折而愤怒咆哮的今天&#xff0c;传统的“机械朗读式”语音合成早已无法满足用户对沉浸感的期待。人们不再只想听一段话——他们想听见情绪&#xff0c;听见性格&#xff0c;甚…

作者头像 李华
网站建设 2026/5/9 0:47:09

LangChain与Anything-LLM协同工作的底层逻辑与接口调用方式

LangChain与Anything-LLM协同工作的底层逻辑与接口调用方式 在构建企业级AI知识助手的实践中&#xff0c;一个常见的困境是&#xff1a;研发团队用几十行Python代码就能跑通RAG流程&#xff0c;但最终交付给业务部门时却只有命令行输出。用户需要的是能直接上传PDF、点击提问、…

作者头像 李华