But 15 is an average to start with.īased on the above sizes, we calculate the size of each row and then multiply by the number of rows to calculate the total size of the dataset. So columns like Country code will be of size 2 bytes and columns having user comments will have 20 to 30 bytes. The correct way for String calculation is based on the max length of the string in the column. String: 15 bytes for the String columns.Boolean: 1 byte for the Boolean columns.Double: 8 bytes for the Double columns.The range of values for this is 1.1754 E-38 to 3.4028 E+38 Bigint: 4 bytes for the Big int or Long columns.Below are the number of bytes for each data type Tellius calculates the size of the dataset closer to the CSV format using a specific number of bytes for each data type. The purpose of this document is to describe the data size calculation algorithm used in Tellius to calculate the size of a dataset as per the Tellius specifications.