Working with SQL Server BLOB Data in .NET

Mohammad Elsheimy

4.75/5 (6 votes)

11 May 2010CPL3 min read

542

Binary Large Objects (BLOBs) are pieces of data that have -usually- exceptionally large size (such as pictures or audio tracks).

Download sample - 131.5 KB

I wrote this article in Arabic too. Check out the Arabic version here.

Binary Large Objects (BLOBs) are pieces of data that have -usually- exceptionally large size (such as pictures or audio tracks). These values are stored in SQL Server in an image column.

Sometimes, the term BLOB is also applied to large character data values, such as those stored in text or ntext columns.

Also, you can store BLOB data in a binary column, but it doesn't take larger than 8000 bytes. And image columns are more flexible.

Working with BLOB data is a bit strange because:

You don't know how much size will be the retrieved data.
The data may be very large so we need to retrieve it in chunks.

Our example is fairly simple. This example stores files in a database (FileStore) and retrieves it by name. The example relies on a database that contains one table, MyFiles. And the table itself contains two columns, one for filename (PK) and the other is an image column for the file itself.

Storing BLOB Data

Storing BLOB data in a database is the easiest part:

In order to run this code, you must add using statements to Sql.Data.SqlClient and System.IO.

static void StoreFile(string filename)
{
    SqlConnection connection = new SqlConnection
        ("Server=(local) ; Initial Catalog = FileStore ; Integrated Security = SSPI");

    SqlCommand command = new SqlCommand
        ("INSERT INTO MyFiles VALUES (@Filename, @Data)", connection);

    command.Parameters.AddWithValue("@Filename", Path.GetFileName(filename));
    command.Parameters.AddWithValue("@Data", File.ReadAllBytes(filename));

    connection.Open();

    command.ExecuteNonQuery();

    connection.Close();
}

Code Explanation

First, we created a connection to the SQL Server database. And then, we created the SqlCommand object that will hold the T-SQL Insert statement. After that, we filled the command parameters with required values. Finally, we executed the command.

Well, for avoiding SQL-Injection attacks, it’s recommended that you use parameters instead of hard-coding the argument. Moreover, you can't represent binary values as strings.
Frankly, it’s recommended using stored procedures instead of coding the commands.

It’s highly recommended that you dispose disposable objects like SqlConnection and SqlCommand. Try encapsulating it in a using statement.

Retrieving BLOB Data

Retrieving BLOB data is a bit complex than storing it. The following method demonstrates this:

static byte[] RetrieveFile(string filename)
{
    SqlConnection connection = new SqlConnection
       ("Server=(local) ; Initial Catalog = FileStore ; Integrated Security = SSPI");

    SqlCommand command = new SqlCommand
        ("SELECT * FROM MyFiles WHERE Filename=@Filename", connection);

    command.Parameters.AddWithValue("@Filename", filename);

    connection.Open();

    SqlDataReader reader =
        command.ExecuteReader(System.Data.CommandBehavior.SequentialAccess);

    reader.Read();

    MemoryStream memory = new MemoryStream();

    long startIndex = 0;
    const int ChunkSize = 256;
    while (true)
    {
        byte[] buffer = new byte[ChunkSize];

        long retrievedBytes = reader.GetBytes(1, startIndex, buffer, 0, ChunkSize);

        memory.Write(buffer, 0, (int)retrievedBytes);

        startIndex += retrievedBytes;

        if (retrievedBytes != ChunkSize)
            break;
    }

    connection.Close();

    byte[] data = memory.ToArray();

    memory.Dispose();

    return data;
}

Code Explanation

After connecting to the database and writing our query, we executed the query by calling ExecuteReader() method of the command object to get read-only forward-only pointer to the retrieved rows.

By default, SqlDataReader reads entire rows -that can be gigabytes of data.- By specifying CommandBehavior.SequentialAccess, it reads the data sequentially in a given chunk size by calling the GetBytes() -or GetChars for BLOB textual data- method.

Calling Read() of the SqlDataReader objects advances the pointer to the next row which is the first single row -if found- in our example.

The GetBytes() method takes five arguments:

The column index
The index of which to start reading
The buffer object that will keep current retrieved data
Index in buffer of which to begin writing t
The length (chunk size) of the data to retrieve

It is worth mentioning that this method returns number of bytes retrieved.
After calling this method, we used a MemoryStream object to write all data retrieved to.
Finally, we retrieve data by calling MemoryStream’s ToArray() function. (I think the code is now clear.)

It’s not recommended to use MemoryStream if the data is very huge.

SqlConnection, SqlCommand, SqlDataReader, and MemoryStream are all disposable objects.
Because the MemoryStream object may contain the retrieved data, it’s highly recommended that you dispose it as soon as possible.

For a complete example, download the sample project FileStore. This project uses a database for storing files and retrieving it. This database contains only one table, its definition is as follows:

FileStore File Table Definition

For creating the database, the project also includes a SQL Query file that contains the commands for creating it. Simply execute the file.
The project has been created using Visual Studio 2008 and .NET Framework 2.0.

Posted in Data Access Tagged: .NET, CodeProject, CSharp, SQL Server

License

This article, along with any associated source code and files, is licensed under The Common Public License Version 1.0 (CPL)