How CKB Works
To understand CKB, you need to understand BTC first. If you understand BTC, you already get 80% knowledge about CKB since CKB inhrerintes most of its design and philosiphy from Bitcoin. In that case, you can jump direct to the rest 20% difference.
If you don't, just keep reading and take 1 minute to understand the UTXO, the essentials of Bitcoin.
Understand UTXO
BTC works just like cash. To tell how much money you have, you simply count how much cash are in your pocket.
In BTC world, such a cash is called UTXO. It looks more like an magic box instead of a piece of paper. Each box(UTXO) can carry a piece of codes that can only be unlocked by some pre-defined conditions.
You can image this piece of codes as a lock hanging on the box. If you have the keys to unlock the lock, then the box is yours and you own the boxs. And just like the piece paper of cash, each box records the money denomination on its surface so you can know how much money the box(UTXO) represents.
There can be small money box and large money box. You can take some large boxes and break it into multiple small boxes, just like exchange the cash.
// picture of utxo boxes
Now you understand the 80% of CKB, the rest 20% difference is that CKB uses generalized UTXOs called Cell. Cell is the basic unit of CKB, like UTXOs to BTC.
But what does generalized UTXOs really mean?
Generalized UTXOs
Let's go back to the little box analogy.
UTXOs are boxes that carry a lock made up by codes with money denomination recorded on its surface.
The cells are boxes too, but more powerful ones:
The box has storage space
You can't do much things with the BTC boxes(UTXOs) since each box have no extra space and it can only use to record number data that presents the denomination of money.
However, the boxes of CKB(Cells) have dynamic storage spaces and the large denomination the box is, the large space of the box has. 1 CKB equals 1 Byte of storage. If you have 50k CKB tokens, then you got 50k bytes of on-chain storage space.
You can put anything you want into the box as long as you have the space to hold it. The data type is also not limited, any type of data works, as long as you can understand and interpret those data by yourself.
The box can carry powerful codes
The lock of BTC boxes(UTXOs) can only use very simple and limited codes. You can only do certain things with the codes so you end up with certain types of locks in BTC.
On the other hand, The lock of CKB boxes(Cells) can use complex codes just like the codes running in your computer. It is the difference of limited scripts and turing-complete scripts.
The box has two locks
BTC only have one lock to guard the ownership of the box(UTXO) while CKB can have two locks for one box(Cell). The first lock is called lock script, used to guard ownership just like BTC. The second lock is called type script, used to determine how boxes can be spent and updated in the future.
Some people are trying to do things like type script to give BTC the ability to limit and determine how the UTXO can be spent and updated in the future transaction, it is called covenant. However, CKB has such ability from the very first beginning due to the design of it. This also makes CKB the ideal layer 2 for Bitcoin since they share the same ideololegy but CKB has more powerful programability.
Good! We have master the most important ideas of CKB.
Now let's get to meet its real face.
Data structure of CKB
The entire cell data structure looks like this:
Cell: {
capacity: HexString; # represent the total storage space size of the cell. The basic unit for capcaity is shannon, 1 CKB equals 10**8 shannon.
lock: Script; // a piece of codes
type: Script; // a piece of codes
data: HexString; // this field can store arbitrary bytes, which means any type of data
}
Note:A cell's total size for all four fields above must be less than or equal to the capacity of the cell. As shown below
capacity = Cell's total space >= Sum of the 4 fields' byte length
A script‘s structure looks like this:
Script: {
code_hash: HexString
args: HexString
hash_type: Uint8, there are 4 allowed values: {0: "data", 1: "type", 2: "data1"i, 3: "data2"}
}
You may notice that the code_hash is not the actual code, but some kind of index of the code. This index allows us to retrieve the code. So, where is the code anyway?
The answer is simple: the code is stored in another cell!
We know that the data field of a cell can contain arbitrary data, so we can put the real code in the data field of another cell and implement this cell as a dependency to a transaction. This dependency cell is called CellDep.
Depending on the value of hash_type, code_hash has different interpretations:
- If hash_type is "data", "data1" or "data2", code_hash should match
blake2b_ckbhash(data)
of a dep cell; - If hash_type is "type", code_hash should instead match
blake2b_ckbhash(type script)
of a dep cell.
Please keep in mind that the code_hash and hash_type fields are used to locate the code. When unlocking a cell, a transaction simply imports the dep cell, and CKB will follow the rules above to find and execute the corresponding code.
So why not just put in the real code, but use this indexing approach?
One of the major advantages of this design is that if everyone needs the same type of lock, the lock code will be identical, and so will the code_hash value. Then it is just a matter of introducing the same dep cell rather than deploying the same code all over again for each case.
What Is A Transaction?
A transaction in CKB is just an action to destroy some cells and create some more.
inputs:
some cells...
|
|
|
\/
outputs:
some new cells...
The cells in the inputs must all be live cells. The input cells will be spent and become dead cells after a transaction is committed. The newly created output cells will become new live cells.
When the transaction is submit on-chain, CKB will run all the scripts in the cells of that transaction to see if all the script runs successfully.
CKB will also ensure that the capacity summary of all the output cells must be less than the capacity summary of all the input cells, which means that a transaction cannot mint capacities from the air. The difference between the output capacities and input capacities is the transaction fee for miners.
Note: In practice, for storage optimization reasons, we do not put the complete cell in an input; instead, we just put the cell's index that leads us to the real input cell. This index structure is called OutPoint, which points to a particular cell.
OutPoint: {
tx_hash: The hash value of the transaction to which the target cell belongs
index: The cell position in the transaction to which the target cell belongs
}
Difference between lock and type script
These two locks are fundamentally the same, but they are given different names because of their different usages.
- Lock script: use to guard the ownership of a Cell. In a transaction, the lock scripts run for all inputs by group.
- Type script: use to guard the transformation of cells. In a transaction, the type scripts run for all inputs and outputs by group.
Congratulations!
Let's review all the concepts we have learned:
- A cell is a box that can be used to store any type of data. 1 CKB = 1 Byte storage.
- The lock's code_hash and hash_type fields are used to locate code, which is stored in the data field of a dep cell.
- Each cell can carry two scripts, one is called lock script (default) and the other, type script (optional).
- Lock scripts are often used to protect the ownership of the cell. Type scripts often used to handle the cell transformation rules.
That's right, with the above theoretical knowledge, you're ready to hit the road.