SQL Notes I

This file records SQL usage I encountered(mainly from LeetCode). Notation outside of code block: keywords in SQL will be in bold, variables will be in Italic.

SELECT

SELECT basic structure:

1
2
3

SELECT something
From some_table
Where requirements

Here requirements support keyword in sometable but not specified in something.

Example:

Write an SQL query to find the ids of products that are both low fat and recyclable

1
2
3

select product_id
from Products
where low_fats = 'Y' and recyclable = 'Y'

If we want to rename something in our result, we can use as keyword.

select name as Customers
from Customers
where id not in 
(
    select customerId
    from Orders
)

Union

Union will allow us to pile two tables with the same colomn together.

Example:

Write an SQL query to calculate the bonus of each employee. The bonus of an employee is 100% of their salary if the ID of the employee is an odd number and the employee name does not start with the character 'M'. The bonus of an employee is 0 otherwise.

(select employee_id , salary as bonus 
from employees 
where employee_id%2 <>0 and name not like 'M%'
)
union
(
select employee_id , 0 as bonus
from employees
where employee_id%2 = 0 or name like 'M%'
)
order by employee_id;

Inside this query, we use order by, which will sort the result based on employee_id.

Update

UPDATE is used to update entities inside a table.

UPDATE salary
SET
    sex = CASE sex
        WHEN 'm' THEN 'f'
        ELSE 'm'
    END;

In this example, CASE is used to update the value of sex bases on its original value.

DELETE

1 2	DELETE p1 FROM Person p1, Person p2 WHERE p1.Email = p2.Email AND p1.Id > p2.Id

This example shows how we can construct an inner comparision requirement within one query. Also it shows how to declear variable p1, p2 inside a query.

The above query can also be achieved by using GROUP BY keyword.

1 2	DELETE FROM Person WHERE Id NOT IN (SELECT * FROM(SELECT MIN(Id) FROM Person GROUP BY Email) as p);

GROUP_CONCAT

GROUP_CONCAT is an aggregate_function used to concat all items with in the same group specified in group by. DISTINCT is used to delete repeated entities given a colomn number.

select sell_date, COUNT( DISTINCT product) as num_sold, GROUP_CONCAT(DISTINCT product ORDER BY product) as products
from activities
group by sell_date
order by sell_date

GROUP BY, HAVING

GROUP BY is used to clasify the table based on attributes. HAVING is then used to specify the filter to GROUP BY. Usually, HAVING is used with aggregate function SUM(), COUNT, etc.

SELECT Department, sum(Salary) as Salary
FROM employee
GROUP BY department
HAVING SUM(Salary) >= 50000;

LIKE

LIKE is used to match strings. It supports '%' or '_' etc.

1
2
3

select patient_id, patient_name, conditions
from Patients
where conditions like 'DIAB1%' or conditions like '% DIAB1%'

WITH

WITH is used to decleare a subquery used later

1	WITH xxx as (subquery)

Then in later queries one can use this xxx as reference. WITH is usually used with partition by, rank(), dense_rank(), rownumber().

JOIN, ON

JOIN is used to include two tabls in one query and ON is used to specify the constraints. There are several JOIN methods: LEFT JOIN, RIGHT JOIN, FULL JOIN, INNER JOIN.

The example below shows useage of WITH, JOIN and ON. Write an SQL query to find the employees who are high earners in each of the departments.

with maxsalary as(
    select *, dense_rank() over(
        partition by departmentId order by salary desc
    )as R from Employee
)
select D.name as 'Department', E.name as 'Employee', E.salary as 'Salary'
from Employee as E join Department as D
on E.departmentId = D.id
where
E.id in (select id from maxsalary as M where M.R <= 3)

LIMIT, OFFSET

LIMIT tells DBSM how much results you finally want. OFFSET tells DBSM where you want to start to search. This is generally used when you are trying to query a range of data within a sorted table.

CASE

CASE is used to dynamic deal with cases. For example, CASE can be used to switch attributes between different rows in a table. Also, it can be used to deal with different cases.

select id, 
    case
        when T.p_id is null then 'Root'
        when T.id in (select p_id from Tree) then 'Inner'
        else 'Leaf'
    end as type
from Tree as T

Execution Order

FROM -> ON -> JOIN -> WHERE -> GROUP BY -> WITH CUBE or WITH ROLLUP -> HAVING -> SELECT -> DISTINCT -> ORDER BY -> TOP