I wrote this article in Arabic too. Check out the Arabic version here.
Binary Large Objects (BLOBs) are pieces of data that have -usually- exceptionally large size (such as pictures or audio tracks). These values are stored in SQL Server in an image column.
Sometimes, the term BLOB is also applied to large character data values, such as those stored in text
or ntext
columns.
Also, you can store BLOB data in a binary column, but it doesn't take larger than 8000 bytes. And image
columns are more flexible.
Working with BLOB data is a bit strange because:
- You don't know how much size will be the retrieved data.
- The data may be very large so we need to retrieve it in chunks.
Our example is fairly simple. This example stores files in a database (FileStore
) and retrieves it by name. The example relies on a database that contains one table, MyFiles
. And the table itself contains two columns, one for filename
(PK) and the other is an image
column for the file itself.
Storing BLOB Data
Storing BLOB data in a database is the easiest part:
In order to run this code, you must add using
statements to Sql.Data.SqlClient
and System.IO
.
static void StoreFile(string filename)
{
SqlConnection connection = new SqlConnection
("Server=(local) ; Initial Catalog = FileStore ; Integrated Security = SSPI");
SqlCommand command = new SqlCommand
("INSERT INTO MyFiles VALUES (@Filename, @Data)", connection);
command.Parameters.AddWithValue("@Filename", Path.GetFileName(filename));
command.Parameters.AddWithValue("@Data", File.ReadAllBytes(filename));
connection.Open();
command.ExecuteNonQuery();
connection.Close();
}
Code Explanation
First, we created a connection to the SQL Server database. And then, we created the SqlCommand
object that will hold the T-SQL Insert
statement. After that, we filled the command parameters with required values. Finally, we executed the command.
Well, for avoiding SQL-Injection attacks, it’s recommended that you use parameters instead of hard-coding the argument. Moreover, you can't represent binary values as string
s.
Frankly, it’s recommended using stored procedures instead of coding the commands.
It’s highly recommended that you dispose disposable objects like SqlConnection
and SqlCommand
. Try encapsulating it in a using
statement.
Retrieving BLOB Data
Retrieving BLOB data is a bit complex than storing it. The following method demonstrates this:
static byte[] RetrieveFile(string filename)
{
SqlConnection connection = new SqlConnection
("Server=(local) ; Initial Catalog = FileStore ; Integrated Security = SSPI");
SqlCommand command = new SqlCommand
("SELECT * FROM MyFiles WHERE Filename=@Filename", connection);
command.Parameters.AddWithValue("@Filename", filename);
connection.Open();
SqlDataReader reader =
command.ExecuteReader(System.Data.CommandBehavior.SequentialAccess);
reader.Read();
MemoryStream memory = new MemoryStream();
long startIndex = 0;
const int ChunkSize = 256;
while (true)
{
byte[] buffer = new byte[ChunkSize];
long retrievedBytes = reader.GetBytes(1, startIndex, buffer, 0, ChunkSize);
memory.Write(buffer, 0, (int)retrievedBytes);
startIndex += retrievedBytes;
if (retrievedBytes != ChunkSize)
break;
}
connection.Close();
byte[] data = memory.ToArray();
memory.Dispose();
return data;
}
Code Explanation
After connecting to the database and writing our query, we executed the query by calling ExecuteReader()
method of the command object to get read-only forward-only pointer to the retrieved rows.
By default, SqlDataReader
reads entire rows -that can be gigabytes of data.- By specifying CommandBehavior.SequentialAccess
, it reads the data sequentially in a given chunk size by calling the GetBytes()
-or GetChars
for BLOB textual data- method.
Calling Read()
of the SqlDataReader
objects advances the pointer to the next row which is the first single row -if found- in our example.
The GetBytes()
method takes five arguments:
- The column index
- The index of which to start reading
- The buffer object that will keep current retrieved data
- Index in buffer of which to begin writing t
- The length (chunk size) of the data to retrieve
It is worth mentioning that this method returns number of bytes retrieved.
After calling this method, we used a MemoryStream
object to write all data retrieved to.
Finally, we retrieve data by calling MemoryStream
’s ToArray()
function. (I think the code is now clear.)
It’s not recommended to use MemoryStream
if the data is very huge.
SqlConnection
, SqlCommand
, SqlDataReader
, and MemoryStream
are all disposable objects.
Because the MemoryStream
object may contain the retrieved data, it’s highly recommended that you dispose it as soon as possible.
For a complete example, download the sample project FileStore
. This project uses a database for storing files and retrieving it. This database contains only one table, its definition is as follows:
For creating the database, the project also includes a SQL Query file that contains the commands for creating it. Simply execute the file.
The project has been created using Visual Studio 2008 and .NET Framework 2.0.
Posted in Data Access Tagged: .NET, CodeProject, CSharp, SQL Server