Ciro Santilli (@cirosantilli) - undefined articles

SQL histogram  Updated 2025-05-23  +Created 1970-01-01

OK, there's a billion questions:

SQL Server
- stackoverflow.com/questions/485409/generating-a-histogram-from-column-values-in-a-database OP did not know the difference between count and histogram :-) But it's the number one Google result.
- stackoverflow.com/questions/19103991/create-range-bins-from-sql-server-table-for-histograms has a minor extra group by twist, but otherwise fine
- stackoverflow.com/questions/16268441/generate-histogram-in-sql-server
SQLite
- stackoverflow.com/questions/67514208/how-to-optimise-creating-histogram-bins-in-sqlite perf only, benchmarking would be needed. SQLite.
- stackoverflow.com/questions/32155449/create-a-histogram-with-a-dynamic-number-of-partitions-in-sqlite variable bin size, same number of entries per bin
- stackoverflow.com/questions/60348109/histogram-for-time-periods-using-sqlite-regular-buckets-1h-wide time
MySQL: stackoverflow.com/questions/1764881/getting-data-for-histogram-plot MySQL appears to extend ROUND to also round by integers: ROUND(numeric_value, -2), but this is not widely portable which is a shame
stackoverflow.com/questions/72367652/populating-empty-bins-in-a-histogram-generated-using-sql specifically asks about empty bins, which is amazing. Amazon Redshift dialect unfortunately, but answer provided works widely, and Redshift was forked from PostgreSQL, so there's hope. Those newb open source server focused projects that don't use AGPL!

Let's try it on SQLite 3.40.1, Ubuntu 23.04. Data setup:

sqlite3 tmp.sqlite 'create table t(x integer)'
sqlite3 tmp.sqlite <<EOF
insert into t values (
  0,
  2,
  2,
  3,

  5,
  6,
  6,
  8,
  9,

  17,
)
EOF
sqlite3 tmp.sqlite 'create index tx on t(x)'

For a bin size of 5 ignoring empty ranges we can:

sqlite3 tmp.sqlite <<EOF
select floor(x/5)*5 as x,
       count(*) as cnt
from t
group by 1
order by 1
EOF

which produces the desired:

0|4
5|5
15|1

And to consider empty ranges we can use SQL genenerate_series + as per stackoverflow.com/questions/72367652/populating-empty-bins-in-a-histogram-generated-using-sql:

sqlite3 tmp.sqlite <<EOF
select x, sum(cnt) from (
  select floor(x/5)*5 as x,
         count(*) as cnt
    from t
    group by 1
  union
  select *, 0 as cnt from generate_series(0, 15, 5)
)
group by x
EOF

which outputs the desired:

0|4
5|5
10|0
15|1

 Read the full article

sqlite3 Node.js package  Updated 2025-05-23  +Created 1970-01-01

 View more

Includes its own copy of sqlite3, you don't use the system one, which is good to ensure compatibility. The version is shown at: github.com/mapbox/node-sqlite3/blob/918052b538b0effe6c4a44c74a16b2749c08a0d2/deps/common-sqlite.gypi#L3 SQLite source is tracked compressed in-tree: github.com/mapbox/node-sqlite3/blob/918052b538b0effe6c4a44c74a16b2749c08a0d2/deps/sqlite-autoconf-3360000.tar.gz horrendous. This explains why it takes forever to clone that repository. People who don't believe in git submodules, there's even an official Git mirror at: github.com/sqlite/sqlite

It appears to spawn its own threads via its C extension (since JavaScript is single threaded and and SQLite is not server-based), which allows for parallel queries using multiple threads: github.com/mapbox/node-sqlite3/blob/v5.0.2/src/threading.h

Hello world example: nodejs/node-sqlite3/index.js.

As of 2021, this had slumped back a bit, as maintainers got tired. Unmerged pull requests started piling more, and better-sqlite3 Node.js package started pulling ahead a little.

github.com/mapbox/node-sqlite3/issues/1381 FATAL ERROR: Error::ThrowAsJavaScriptException napi_throw with Node.js worker_threads vs better-sqlite3 Node.js package github.com/JoshuaWise/better-sqlite3/issues/237

 Read the full article

SQL RECURSIVE prevent infinite recursion  Updated 2025-05-23  +Created 1970-01-01

 View more

Example under: nodejs/sequelize/raw/tree.js

 Read the full article

SQL TRIGGER  Updated 2025-05-23  +Created 1970-01-01

 View more

SQL's implementation of database triggers.

This feature is really cool, as it allows you to keep caches up to date!

In particular, everything that happens in a trigger happens as if it were in a transaction. This way, you can do less explicit transactions when you use triggers. It is a bit like the advantages of SQL CASCADE.

DBMS:

ORM:

Sequelize: SQL TRIGGER in Sequelize

 Read the full article

SQL window RANGE  Updated 2025-05-23  +Created 1970-01-01

 View more

SQLite: www.sqlite.org/windowfunctions.html#exprrange

rm -f tmp.sqlite
sqlite3 tmp.sqlite "create table t (id integer, val integer)"
sqlite3 tmp.sqlite <<EOF
insert into t values
  (0, 0),
  (1, 5),
  (2, 10),
  (3, 14),
  (4, 15),
  (5, 16),
  (6, 20),
  (7, 25),
  (8, 29),
  (9, 30),
  (10, 30),
  (11, 31),
  (12, 35),
  (13, 40)
EOF

Show how many neighbours each column has with val between val - 2 and val + 2 inclusive:

sqlite3 tmp.sqlite <<EOF
SELECT id, val, COUNT(*) OVER (
  ORDER BY val RANGE BETWEEN 2 PRECEDING AND 2 FOLLOWING
) FROM t;
EOF

Output:

0|0|1
1|5|1
2|10|1
3|14|3
4|15|3
5|16|3
6|20|1
7|25|1
8|29|4
9|30|4
10|30|4
11|31|4
12|35|1
13|40|1

val - 1 and val + 1 inclusive instead:

sqlite3 tmp.sqlite <<EOF
SELECT id, val, COUNT(*) OVER (
  ORDER BY val RANGE BETWEEN 1 PRECEDING AND 1 FOLLOWING
) FROM t;
EOF

Output:

0|0|1
1|5|1
2|10|1
3|14|2
4|15|3
5|16|2
6|20|1
7|25|1
8|29|3
9|30|4
10|30|4
11|31|3
12|35|1
13|40|1

There seems to be no analogue to HAVING for window functions, so we can just settle for a subquery for once, e.g.:

sqlite3 tmp.sqlite <<EOF
SELECT * FROM (
  SELECT id, val, COUNT(*) OVER (
    ORDER BY val RANGE BETWEEN 1 PRECEDING AND 1 FOLLOWING
  ) as c FROM t
) WHERE c > 2
EOF

which outputs:

4|15|3
8|29|3
9|30|4
10|30|4
11|31|3

 Read the full article

The horrors of Sequelize  Updated 2025-05-23  +Created 1970-01-01

 View more

foreign keys are capitalized:
- stackoverflow.com/questions/55273091/why-use-the-uppercase-key-when-creating-association-model-in-sequelize
- github.com/sequelize/sequelize/issues/5828
you must give foreignKey when using aliases, otherwise it fails subtely. That would be derived automatically.
stackoverflow.com/questions/41502699/return-flat-object-from-sequelize-with-association can't auto-flatten to reuse the database's ORDER
limit and offset don't work without subQuery: false when doing includes! It is just too buggy. Examples of this can be found e.g. under nodejs/sequelize/many_to_many_same_model.js.
stackoverflow.com/questions/34059081/how-do-i-reference-an-association-when-creating-a-row-in-sequelize-without-assum hard to not duplicate foreign keys values everywhere
stack traces permanently broken or requiring non-obvious configs:
does not automatically update fields on hooks: github.com/sequelize/sequelize/issues/8586#issuecomment-422877555
cannot change columns when other columns have constraints due to the backup table?
- stackoverflow.com/questions/64533732/process-of-changing-a-table-with-sequelize-migration-if-foreign-key-constraint-i
you have to use .get() for attribute aliased fields, why? stackoverflow.com/questions/32649218/how-do-i-select-a-column-using-an-alias/69890944#69890944
.id gets added to SELECT no matter what, breaking GROUP BY unless you do horrible workarounds:
- github.com/sequelize/sequelize/issues/3256
- github.com/sequelize/sequelize/issues/5481
no simple built-in mechanism for transaction retries: Sequelize transaction retries
impossible to do subqueries in general. Docs just tell you to use literals. This in particular prevents single query deletes with join as done at nodejs/sequelize/raw/many_to_many.js:
- sequelize.org/master/manual/sub-queries.html: the docs actually just tell you to use literals, lol
- stackoverflow.com/questions/45354001/nodejs-sequelize-delete-with-nested-select-query
Also, you can't get query strings either: github.com/sequelize/sequelize/issues/2325
migrations. Generally speaking, anything but the simplest migrations are exceedingly hard to get right, as you have to go very low level when doing migrations. Syntax can be very different from regular DB operations.
- no way to do (non-raw) queries during migrations, e.g. to update fields based on other fields in a complex way?
  github.com/sequelize/cli/issues/862
  stackoverflow.com/questions/18742962/add-data-in-sequelize-migration-script
  stackoverflow.com/questions/38671483/sequelize-migration-update-model-after-updating-column-attributes
  stackoverflow.com/questions/38998397/can-i-use-sequelize-models-in-migration-scripts
  stackoverflow.com/questions/45286429/custom-query-on-sequelize-seeder
  queryInterface.sequelize.models contains only SequelizeMeta. Not sure why they have this limitation.
  Edit: actually things will likely just work if immediately after making table changes you just instantiate a new sequelize and do any data changes.
- stackoverflow.com/questions/56043246/node-js-sequelize-no-primary-keys-when-migrating/56046101#56046101
- SQLite changeColumn migrations do on delete cascades of other tables. SQLite does not have change column statements, so they have to drop and recreate tables, but they don't temporarily remove cascades, so you lose data: stackoverflow.com/questions/62667269/sequelize-js-how-do-we-change-column-type-in-migration/70486686#70486686
- associations require full explicit index construction: stackoverflow.com/questions/39651853/how-to-create-join-table-with-foreign-keys-with-sequelize-or-sequelize-cli
ability to iterate over a large result without blowing up memory and without using limit + offset (which is inneficient e.g. when looping over recursive queries). This is also known as cursor or streaming interfaces:
- stack overflow
- issue tracker
E.g. the Python SQLite interface supports this just fine: stackoverflow.com/questions/29582736/python3-is-there-a-way-to-iterate-row-by-row-over-a-very-large-sqlite-table-wi
empty attributes: [] breaks some nested queries: github.com/sequelize/sequelize/issues/16436
does not expose a iteration API that supports large arrays?
- github.com/sequelize/sequelize/issues/8550
- stackoverflow.com/questions/57164242/perform-sequelize-findall-in-a-huge-array
E.g. Python SQLite does: stackoverflow.com/questions/29582736/python3-is-there-a-way-to-iterate-row-by-row-over-a-very-large-sqlite-table-wi

 Read the full article

The most awesome systems programmers  Updated 2025-05-23  +Created 1970-01-01

 View more

Notable mentions:

Tom Tromey from Red Hat: www.youtube.com/watch?v=RwDA3oIOtWw Dude's a GDB God! He might be gay from that talk.

Other notable people that are likely also awesome but Ciro has less familiarity with their contributions:

Dwayne Richard Hipp from SQLite
Daniel Stenberg from cURL
Michael Niedermayer also from FFmpeg. ikaruga.co.uk/~snacky/mn.html highlights his brutal directness and efficiency, and sometimes sense of humour

 Read the full article

UNION (SQL)  Updated 2025-05-23  +Created 1970-01-01

 View more

Basic example tested on SQLite 3.40.1, Ubuntu 23.04:

sqlite3 :memory: 'select 1 union select 2'

output:

1
2

Two columns two rows:

sqlite3 :memory: <<EOF
select * from (values (1, 2), (2, 3))
union
select * from (values (2, 3), (3, 4))
EOF

output:

1|2
2|3
3|4

Note how duplicates are removed, to keep them we UNION ALL instead:

sqlite3 :memory: <<EOF
select * from (values (1, 2), (2, 3))
union all
select * from (values (2, 3), (3, 4))
EOF

output:

1|2
2|3
2|3
3|4

 Read the full article

UPDATE with JOIN (SQL)  Updated 2025-05-23  +Created 1970-01-01

 View more

Dumping examples under nodejs/sequelize/raw/many_to_many.js.

Not possible without subqueries in the standard syntax, a huge shame: stackoverflow.com/questions/1293330/how-can-i-do-an-update-statement-with-join-in-sql-server

The UPDATE + FROM extension exists in a few DBMS s:

ORM:

Sequelize: UPDATE with JOIN in Sequelize

 Read the full article

Upsert  Updated 2025-05-23  +Created 1970-01-01

 View more

UPSERT is extremely handy, and reduces the number of find, check on server, update loops. But RETURNING is a fundamental part of that (to get the updated/existing) ID. Can't believe SQL hasn't standardized it yet as of 2022. But both SQLite and Postgres support it with similar syntax thankfully.

nodejs/sequelize/raw/upsert.js

 Read the full article

Ciro Santilli @cirosantilli 37

 Incoming links: SQLite