You just do raw byte copies from sample DB, no SQL or "inserts" or anything simi...

tyingq · on July 18, 2021

Do you mean crafting all the various database page btree structures and entries yourself? I'd be concerned about subtle bugs.

ndepoel · on July 18, 2021

An SQLite database is just a file. You can build the empty database with schema and base values ahead of time, save it to a file (or an in-memory byte buffer) and then every time you want to create a new database, you just copy that file. No need to do any expensive initialization queries that way. If raw high-speed throughput is needed, skipping that step can make a significant difference.

tyingq · on July 18, 2021

Yes, that approach makes sense. I thought what I was replying to was suggesting writing b-tree pages themselves, outside of sqlite, for the new data.

emmelaich · on July 19, 2021

I guess if you wanted the fastest creation you could make a custom backend format for sqlite and use that. Especially if query speed was not important.

geofft · on July 19, 2021

Assuming SQLite has any internal counters or indexes, let alone B-trees or anything fancy, then this approach won't work. It will work for a raw record format (CSV, JSON, Protobuf, an mmapped array, etc.), but the author wants to actually interact with a real SQLite database. Generating a billion rows in some non-SQLite format still leaves problem to converting that format by loading it into SQLite, which isn't really a reduction of the original problem.

eismcc · on July 18, 2021

This idea is mentioned as one of the future work at the bottom.