minisql

package module

v0.0.0-...-f9b1771 Latest Latest Go to latest Published: Jan 18, 2026 License: Apache-2.0 Imports: 15 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/RichardKnop/minisql

Links

Open Source Insights

README ¶

minisql

MiniSQL is an embedded single file database written in Golang, inspired by SQLite (however borrowing some features from other databases as well). It originally started as a research project aimed at learning about internals of relational databases. Over time it has progressed and grown to its current form. It is a very early stage project and it might contain bugs and is not battle tested. Please employ caution when using this database.

To use minisql in your Go code, import the driver:

import (
  _ "github.com/RichardKnop/minisql"
)

And create a database instance:

// Simple path
db, err := sql.Open("minisql", "./my.db")

// With connection parameters
db, err := sql.Open("minisql", "./my.db?journal=false")

// Multiple parameters
db, err := sql.Open("minisql", "./my.db?journal=true&log_level=debug")

Connection Pooling

MiniSQL is an embedded, single-file database (like SQLite). Always configure your connection pool to use a single connection and serialize all writes through it.

db.SetMaxOpenConns(1)
db.SetMaxIdleConns(1)

Why? Multiple connections to the same file cause:

Lock contention at the OS level
Connection pool overhead (97% lock time in benchmarks)
No performance benefit (writes are serialized anyway)

This is the same recommendation as SQLite.

Connection String Parameters

MiniSQL supports optional connection string parameters:

Parameter	Values	Default	Description
`journal`	`true`, `false`	`true`	Enable/disable rollback journal for crash recovery
`log_level`	`debug`, `info`, `warn`, `error`	`warn`	Set logging verbosity level
`max_cached_pages`	positive integer	`2000`	Maximum number of pages to keep in memory cache

Examples:

// Disable journaling for better performance (no crash recovery)
db, err := sql.Open("minisql", "./my.db?journal=false")

// Enable debug logging
db, err := sql.Open("minisql", "./my.db?log_level=debug")

// Set cache size to 500 pages (~2MB memory)
db, err := sql.Open("minisql", "./my.db?max_cached_pages=500")

// Combine multiple parameters
db, err := sql.Open("minisql", "/path/to/db.db?journal=true&log_level=info&max_cached_pages=2000")

Note: Disabling journaling (journal=false) improves performance but removes crash recovery protection. If the application crashes during a transaction commit, the database may become corrupted.

Write-Ahead Rollback Journal

MiniSQL uses a rollback journal to achieve atomic commit and rollback . Before committing a transaction,a journal file with -journal suffix is created in the same directory as the database file. It contains original state of the pages (and database header if applicable) before the transaction began. In case of an error encountered during flushing of pages changed by the transaction to the disk, the database quits and recovers to original state before the transaction from the journal file.

Storage

Each page size is 4096 bytes. Rows larger than page size are not supported. Therefore, the largest allowed row size is 4066 bytes. Only exception is root page 0 has first 100 bytes reserved for config.

4096 (page size) 
- 7 (base header size) 
- 8 (internal / leaf node header size) 
- 8 (null bit mask) 
- 8 (internal row ID / key) 
= 4065

All tables are kept track of via a system table minisql_schema which contains table name, CREATE TABLE SQL to document table structure and a root page index indicating which page contains root node of the table B+ Tree.

Each row has an internal row ID which is an unsigned 64 bit integer starting at 0. These are used as keys in B+ Tree data structure.

Moreover, each row starts with 64 bit null mask which determines which values are NULL. Because of the NULL bit mask being an unsigned 64 bit integer, there is a limit of maximum 64 columns per table.

Concurrency

MiniSQL uses Optimistic Concurrency Control or OCC. It is close to PostgreSQL's SERIALIZABLE isolation Transaction manager follows a simple process:

Track read versions - Record which version of each page was read
Check at commit time - Verify no pages were modified during the transaction
Abort on conflict - If a page changed, abort with ErrTxConflict

You can use ErrTxConflict to control whether to retry because of a tx serialization error or to return error.

SQlite uses a snapshot isolation with MVCC (Multi-Version Concurrency Control). Read how SQLite handles isolation. I have chosen a basic OCC model for now for its simplicity.

Example of a snapshot isolation:

Time 0: Read TX1 starts, sees version V1
Time 1: Write TX2 modifies page and commits → creates version V2
Time 2: TX1 continues reading, still sees V1 (not V2!)
Time 3: TX1 completes successfully

System Table

All tables and indexes are tracked in the system table minisql_schema. For empty database, it would contain only its own reference:

 type   | name               | table_name         | root_page   | sql                                                
--------+--------------------+--------------------+-------------+----------------------------------------
 1      | minisql_schema     |                    | 0           | create table "minisql_schema" (        
        |                    |                    |             | 	type int4 not null,                  
        |                    |                    |             | 	name varchar(255) not null,          
        |                    |                    |             | 	table_name varchar(255),             
        |                    |                    |             | 	root_page int4,                      
        |                    |                    |             | 	sql text                             
        |                    |                    |             | )

Let's say you create a table such as:

create table "users" (
	id int8 primary key autoincrement,
	email varchar(255) unique,
	name text,
	age int4,
	created timestamp default now()
);
create index "idx_created" on "users" (
	created
);

It will be added to the system table as well as its primary key and any unique or secondary indexes. Secondary index on created TIMESTAMP column created separately will also be added to the system table.

You can check current objects in the minisql_schema system table by a simple SELECT query.

// type schema struct {
// 	Type      int
// 	Name      string
// 	TableName *string
// 	RootPage  int
// 	Sql       *string
// }

rows, err := db.QueryContext(context.Background(), `select * from minisql_schema;`)
if err != nil {
	return err
}
defer rows.Close()

var schemas []schema
for rows.Next() {
	var aSchema schema
	if err := rows.Scan(&aSchema.Type, &aSchema.Name, &aSchema.TableName, &aSchema.RootPage, &aSchema.SQL); err != nil {
		return err
	}
	schemas = append(schemas, aSchema)
}
if err := rows.Err(); err != nil {
	return err
}

 type   | name               | table_name         | root_page   | sql                                                
--------+--------------------+--------------------+-------------+----------------------------------------
 1      | minisql_schema     |                    | 0           | create table "minisql_schema" (        
        |                    |                    |             | 	type int4 not null,                  
        |                    |                    |             | 	name varchar(255) not null,          
        |                    |                    |             | 	table_name varchar(255),             
        |                    |                    |             | 	root_page int4,                      
        |                    |                    |             | 	sql text                             
        |                    |                    |             | )                                      
 1      | users              |                    | 1           | create table "users" (                 
        |                    |                    |             | 	id int8 primary key autoincrement,   
        |                    |                    |             | 	email varchar(255) unique,           
        |                    |                    |             | 	name text,                           
        |                    |                    |             | 	age int4,                            
        |                    |                    |             | 	created timestamp default now()      
        |                    |                    |             | );                                     
 2      | pkey__users        | users              | 2           | NULL                                   
 3      | key__users_email   | users              | 3           | NULL                                   
 4      | idx_users          | users              | 4           | create index "idx_created" on "users" (             
        |                    |                    |             | 	created,                             
        |                    |                    |             | );

Data Types And Storage

Data type	Description
`BOOLEAN`	1-byte boolean value (true/false).
`INT4`	4-byte signed integer (-2,147,483,648 to 2,147,483,647).
`INT8`	8-byte signed integer (-9,223,372,036,854,775,808 to 9,223,372,036,854,775,807).
`REAL`	4-byte single-precision floating-point number.
`DOUBLE`	8-byte double-precision floating-point number.
`TEXT`	Variable-length text. If length is <= 255, the text is stored inline, otherwise text is stored in overflow pages (with UTF-8 encoding).
`VARCHAR(n)`	Storage works the same way as `TEXT` but allows limiting length of inserted/updated text to max value.
`TIMESTAMP`	8-byte signed integer representing number of microseconds from `2000-01-01 00:00:00 UTC` (`Postgres epoch`). Supported range is from `4713 BC` to `294276 AD` inclusive.

Supported SQL Features

CREATE TABLE, CREATE TABLE IF NOT EXISTS
PRIMARY KEY support, only single column, no composite primary keys
AUTOINCREMENT support, primary key must be of type INT8 for autoincrement
UNIQUE index can be specified when creating a table
composite primary key or unique constraint as part of CREATE TABLE
NULL and NOT NULL support (via null bit mask included in each row/cell)
DEFAULT support for all columns including NOW() for TIMESTAMP
DROP TABLE
CREATE INDEX, DROP INDEX - only for secondary non unique indexes (primary and unique can be declared as part of CREATE TABLE)
INSERT (single row or multi rows via tuple of values separated by comma)
SELECT (all fields with *, only specific fields or count rows with COUNT(*))
UPDATE
DELETE
simple WHERE conditions with AND and OR, no support for more complex nested conditions using parenthesis
supported operators: =, !=, >, >=, <, <=, IN, NOT IN
LIMIT and OFFSET clauses for basic pagination
ORDER BY supported for a single column only

For WHERE clauses, currently supported is maximum one level of nesting. You can define multiple groups where each group item is joined with AND and groups themselves are joined by OR. For example, you could create two condition groups such as:

(a = 1 and b = 'foo') or (c is null and d in ('bar', 'qux'))

Prepared statements are supported using ? as a placeholder. For example:

insert into users("name", "email") values(?, ?), (?, ?);

Planned features:

date/time functions to make working with TIMESTAMP type easier
joins such as INNER, LEFT, RIGHT
foreign keys
support GROUP BY and aggregation functions such as MAX, MIN, SUM
UPDATE from a SELECT
upsert (insert on conflict)
rollback journal file
more complex WHERE clauses
support altering tables, creating and dropping of indexes outside of create table query
more sophisticated query planner
vacuuming
benchmarks

DDL SQL Commands

CREATE TABLE

Let's start by creating your first table:

_, err := db.Exec(`create table "users" (
	id int8 primary key autoincrement,
	email varchar(255) unique,
	name text,
	age int4,
	created timestamp default now()
);`)

DROP TABLE

_, err := db.Exec(`drop table "users";`)

CREATE INDEX

Currently you can only create secondary non unique indexes. Unique and primary index can be created as part of CREATE TABLE.

_, err := db.Exec(`create index "idx_created" on "users" (created);`)

DROP INDEX

Currently you can only drop secondary non unique indexes.

_, err := db.Exec(`drop index "idx_created";`)

DML Commands

INSERT

Insert test rows:

tx, err := s.db.Begin()
if err != nil {
	return err
}
aResult, err := tx.ExecContext(context.Background(), `insert into users("email", "name", "age") values('[email protected]', 'Danny Mason', 35),
('[email protected]', 'Johnathan Walker', 32),
('[email protected]', 'Tyson Weldon', 27),
('[email protected]', 'Mason Callan', 19),
('[email protected]', 'Logan Flynn', 42),
('[email protected]', 'Beatrice Uttley', 32),
('[email protected]', 'Harry Johnson', 25),
('[email protected]', 'Carl Thomson', 53),
('[email protected]', 'Kaylee Johnson', 48),
('[email protected]', 'Cristal Duvall', 27);`)
if err != nil {
	return err
}
rowsAffected, err = aResult.RowsAffected()
if err != nil {
	return err
}
// rowsAffected = 10
if err := tx.Commit(); err != nil {
	if errors.Is(err, minisql.ErrTxConflict) {
		// transaction conflict, you might want to retry here
	}
	return err
}

When trying to insert a duplicate primary key, you will get an error:

_, err := db.ExecContext(context.Background(), `insert into users("id", "name", "email", "age") values(1, 'Danny Mason', '[email protected]', 35);`)
if err != nil {
	if errors.Is(err, minisql.ErrDuplicateKey) {
		// handle duplicate primary key
	}
	return err
}

SELECT

Selecting from the table:

// type user struct {
// 	ID      int64
// 	Email   string
// 	Name    string
// 	Created time.Time
// }

rows, err := db.QueryContext(context.Background(), `select * from users;`)
if err != nil {
	return err
}
defer rows.Close()
var users []user
for rows.Next() {
	var aUser user
	err := rows.Scan(&aUser.ID, &aUser.Name, &aUser.Email, &aUser.Created)
	if err != nil {
		return err
	}
	users = append(users, aUser)
}
if err := rows.Err(); err != nil {
	return err
}
// continue

Table should have 10 rows now:

 id     | email                            | name                    | age    | created                       
--------+----------------------------------+-------------------------+--------+-------------------------------
 1      | [email protected]       | Danny Mason             | 35     | 2025-12-21 22:31:35.514831    
 2      | [email protected]   | Johnathan Walker        | 32     | 2025-12-21 22:31:35.514831    
 3      | [email protected]     | Tyson Weldon            | 27     | 2025-12-21 22:31:35.514831    
 4      | [email protected]       | Mason Callan.           | 19     | 2025-12-21 22:31:35.514831    
 5      | [email protected]        | Logan Flynn             | 42     | 2025-12-21 22:31:35.514831    
 6      | [email protected]    | Beatrice Uttley         | 32     | 2025-12-21 22:31:35.514831    
 7      | [email protected]    | Harry Johnson.          | 25     | 2025-12-21 22:31:35.514831    
 8      | [email protected]      | Carl Thomson            | 53     | 2025-12-21 22:31:35.514831    
 9      | [email protected]  | Kaylee Johnson.         | 48     | 2025-12-21 22:31:35.514831    
 10     | [email protected]   | Cristal Duvall.         | 27     | 2025-12-21 22:31:35.514831

You can also count rows in a table:

var count int
if err := db.QueryRow(`select count(*) from users;`).Scan(&count); err != nil {
	return err
}

UPDATE

Let's try using a prepared statement to update a row:

stmt, err := db.Prepare(`update users set age = ? where id = ?;`)
if err != nil {
	return err
}
aResult, err := stmt.Exec(int64(36), int64(1))
if err != nil {
	return err
}
rowsAffected, err = aResult.RowsAffected()
if err != nil {
	return err
}
// rowsAffected = 1

Select to verify update:

 id     | email                            | name                    | age    | created                       
--------+----------------------------------+-------------------------+--------+-------------------------------
 1      | [email protected]       | Danny Mason             | 36     | 2025-12-21 22:31:35.514831

DELETE

You can also delete rows:

_, err := db.ExecContext(context.Background(), `delete from users;`)
if err != nil {
	return err
}
rowsAffected, err = aResult.RowsAffected()
if err != nil {
	return err
}

Development

MiniSQL uses mockery to generate mocks for interfaces. Install mockery:

go install github.com/vektra/mockery/[email protected]

Then to generate mocks:

mockery

To run unit tests:

LOG_LEVEL=info go test ./... -count=1

Benchmarking

Some benchmarking commands I have used just for my own reference.

go test -bench=BenchmarkPageAccess -benchtime=100000x ./internal/minisql 2>&1 | grep -A 20 "Benchmark"
go test -bench=BenchmarkRow -benchmem ./internal/minisql 2>&1 | grep -E "(Benchmark|B/op)"
go test -bench=BenchmarkFlush -benchmem -benchtime=5s ./internal/minisql 2>&1 | grep -E "(Benchmark|B/op)"
go test -bench=BenchmarkFlush -benchmem -benchtime=3s ./internal/minisql 2>&1 | grep -E "(Benchmark|B/op)"
go test -bench=BenchmarkPageAccess -benchmem ./internal/minisql 2>&1 | grep -E "Benchmark|alloc"

# CPU profile concurrent workload
go test -cpuprofile=cpu.prof -bench=BenchmarkConcurrent -benchtime=10s ./e2e_tests
go tool pprof -top cpu.prof | head -30

# When CPU profile is dominated by runtime scheduling overhead, look at specific database operations
go tool pprof -cum -top cpu_reads.prof | grep "minisql" | head -20

# Memory profile
go test -memprofile=mem.prof -bench=BenchmarkConcurrent -benchtime=10s ./e2e_tests  
go tool pprof -alloc_space -top mem.prof | head -30

# Mutex contention
go test -mutexprofile=mutex.prof -bench=BenchmarkConcurrent -benchtime=10s ./e2e_tests
go tool pprof -top mutex.prof | head -25

Acknowledgements

Shout out to some great repos and other resources that were invaluable while figuring out how to get this all working together:

Let's Build a Simple Database
go-sqldb
sqlparser
sqlite docs (section about file format has been especially useful)
C++ implementation of B+ tree

Documentation ¶

Index ¶

type Conn
type ConnectionConfig
- func DefaultConnectionConfig(filePath string) *ConnectionConfig
- func ParseConnectionString(connStr string) (*ConnectionConfig, error)
- func (c *ConnectionConfig) GetZapLevel() zap.AtomicLevel
type Driver
- func (d *Driver) Open(name string) (driver.Conn, error)
type Result
- func (r Result) LastInsertId() (int64, error)
- func (r Result) RowsAffected() (int64, error)
type Rows
type Stmt
type Tx
- func (tx Tx) Commit() error
- func (tx Tx) Rollback() error

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type Conn ¶

type Conn struct {
	// contains filtered or unexported fields
}

Conn implements the database/sql/driver.Conn interface.

func (*Conn) Begin deprecated

func (c *Conn) Begin() (driver.Tx, error)

Begin starts and returns a new transaction.

Deprecated: Drivers should implement ConnBeginTx instead (or additionally).

func (*Conn) BeginTx ¶

func (c *Conn) BeginTx(ctx context.Context, opts driver.TxOptions) (driver.Tx, error)

BeginTx starts and returns a new transaction.

func (*Conn) Close ¶

func (c *Conn) Close() error

Close invalidates and potentially stops any current prepared statements and transactions, marking this connection as no longer in use.

Because the sql package maintains a free pool of connections and only calls Close when there's a surplus of idle connections, it shouldn't be necessary for drivers to do their own connection caching.

Drivers must ensure all network calls made by Close do not block indefinitely (e.g. apply a timeout).

func (*Conn) ExecContext ¶

func (c *Conn) ExecContext(ctx context.Context, query string, args []driver.NamedValue) (driver.Result, error)

ExecContext executes a query that doesn't return rows.

func (*Conn) HasActiveTransaction ¶

func (c *Conn) HasActiveTransaction() bool

func (*Conn) Ping ¶

func (c *Conn) Ping(ctx context.Context) error

func (*Conn) Prepare ¶

func (c *Conn) Prepare(query string) (driver.Stmt, error)

Prepare returns a prepared statement, bound to this connection.

func (*Conn) PrepareContext ¶

func (c *Conn) PrepareContext(ctx context.Context, query string) (driver.Stmt, error)

PrepareContext returns a prepared statement, bound to this connection. context is for the preparation of the statement, it must not store the context within the statement itself.

func (*Conn) QueryContext ¶

func (c *Conn) QueryContext(ctx context.Context, query string, args []driver.NamedValue) (driver.Rows, error)

QueryContext executes a query that may return rows.

func (*Conn) SetTransaction ¶

func (c *Conn) SetTransaction(tx *minisql.Transaction)

func (*Conn) TransactionContext ¶

func (c *Conn) TransactionContext(ctx context.Context) context.Context

type ConnectionConfig ¶

type ConnectionConfig struct {
	FilePath       string // Database file path
	JournalEnabled bool   // Enable/disable rollback journal (default: true)
	LogLevel       string // Log level: debug, info, warn, error (default: warn)
	MaxCachedPages int    // Maximum number of pages to cache (default: 1000, 0 = use default)
}

ConnectionConfig holds parsed connection string parameters

func DefaultConnectionConfig ¶

func DefaultConnectionConfig(filePath string) *ConnectionConfig

DefaultConnectionConfig returns default configuration

func ParseConnectionString ¶

func ParseConnectionString(connStr string) (*ConnectionConfig, error)

ParseConnectionString parses a connection string with optional query parameters.

Format: /path/to/database.db?param1=value1&param2=value2

Supported parameters:

journal=true|false : Enable/disable rollback journal (default: true)
log_level=debug|info|warn|error : Set logging level (default: warn)

Examples:

"./my.db" : Default settings
"./my.db?journal=false" : Disable journaling
"./my.db?log_level=debug" : Enable debug logging
"./my.db?journal=true&log_level=info" : Both settings

func (*ConnectionConfig) GetZapLevel ¶

func (c *ConnectionConfig) GetZapLevel() zap.AtomicLevel

GetZapLevel converts log level string to zap.Level

type Driver ¶

type Driver struct {
	// contains filtered or unexported fields
}

Driver implements the database/sql/driver.Driver interface.

func (*Driver) Open ¶

func (d *Driver) Open(name string) (driver.Conn, error)

Open returns a new connection to the database. The name is a connection string with optional parameters:

"./my.db" - simple path
"./my.db?journal=false" - disable journaling
"./my.db?log_level=debug" - enable debug logging
"./my.db?journal=true&log_level=info" - multiple parameters

type Result ¶

type Result struct {
	// contains filtered or unexported fields
}

func (Result) LastInsertId ¶

func (r Result) LastInsertId() (int64, error)

LastInsertId returns the database's auto-generated ID after, for example, an INSERT into a table with primary key.

func (Result) RowsAffected ¶

func (r Result) RowsAffected() (int64, error)

RowsAffected returns the number of rows affected by the query.

type Rows ¶

type Rows struct {
	// contains filtered or unexported fields
}

func (Rows) Close ¶

func (r Rows) Close() error

Close closes the rows iterator.

func (Rows) Columns ¶

func (r Rows) Columns() []string

Columns returns the names of the columns. The number of columns of the result is inferred from the length of the slice. If a particular column name isn't known, an empty string should be returned for that entry.

func (Rows) Next ¶

func (r Rows) Next(dest []driver.Value) error

Next is called to populate the next row of data into the provided slice. The provided slice will be the same size as the Columns() are wide.

Next should return io.EOF when there are no more rows.

The dest should not be written to outside of Next. Care should be taken when closing Rows not to modify a buffer held in dest.

type Stmt ¶

type Stmt struct {
	// contains filtered or unexported fields
}

func (Stmt) Close ¶

func (s Stmt) Close() error

Close closes the statement.

As of Go 1.1, a Stmt will not be closed if it's in use by any queries.

Drivers must ensure all network calls made by Close do not block indefinitely (e.g. apply a timeout).

func (Stmt) Exec deprecated

func (s Stmt) Exec(args []driver.Value) (driver.Result, error)

Exec executes a query that doesn't return rows, such as an INSERT or UPDATE.

Deprecated: Drivers should implement StmtExecContext instead (or additionally).

func (Stmt) ExecContext ¶

func (s Stmt) ExecContext(ctx context.Context, args []driver.NamedValue) (driver.Result, error)

func (Stmt) NumInput ¶

func (s Stmt) NumInput() int

NumInput returns the number of placeholder parameters.

If NumInput returns >= 0, the sql package will sanity check argument counts from callers and return errors to the caller before the statement's Exec or Query methods are called.

NumInput may also return -1, if the driver doesn't know its number of placeholders. In that case, the sql package will not sanity check Exec or Query argument counts.

func (Stmt) Query deprecated

func (s Stmt) Query(args []driver.Value) (driver.Rows, error)

Query executes a query that may return rows, such as a SELECT.

Deprecated: Drivers should implement StmtQueryContext instead (or additionally).

func (Stmt) QueryContext ¶

func (s Stmt) QueryContext(ctx context.Context, args []driver.NamedValue) (driver.Rows, error)

type Tx ¶

type Tx struct {
	// contains filtered or unexported fields
}

func (Tx) Commit ¶

func (tx Tx) Commit() error

func (Tx) Rollback ¶

func (tx Tx) Rollback() error

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
internal
minisql
parser
pkg/logging
pkg
bitwise
lrucache

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL