Introduction
In Part 1, we discussed about the facts and logic behind the Excel 2007 file. In this article, we will see how we write the Excel 2007 file.
As we have already seen, sharedString.xml and Sheet1.xml now let us explore some other files that are important in Excel 2007 file writing.
1. [ContentType].xml
="1.0" ="UTF-8" ="yes"
<Types xmlns="http://schemas.openxmlformats.org/package/2006/content-types">
<Override PartName="/xl/theme/theme1.xml"
ContentType="application/vnd.openxmlformats-officedocument.theme+xml" />
<Override PartName="/xl/styles.xml"
ContentType="application/vnd.openxmlformats-officedocument.spreadsheetml.styles+xml" />
<Default Extension="rels"
ContentType="application/vnd.openxmlformats-package.relationships+xml" />
<Default Extension="xml" ContentType="application/xml" />
<Override PartName="/xl/workbook.xml"
ContentType="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet.main+xml" />
<Override PartName="/docProps/app.xml"
ContentType="application/vnd.openxmlformats-officedocument.extended-properties+xml" />
<Override PartName="/xl/worksheets/sheet2.xml"
ContentType="application/vnd.openxmlformats-officedocument.spreadsheetml.worksheet+xml" />
-->
<Override PartName="/xl/worksheets/sheet3.xml"
ContentType="application/vnd.openxmlformats-officedocument.spreadsheetml.worksheet+xml" />
-->
<Override PartName="/xl/worksheets/sheet1.xml"
ContentType="application/vnd.openxmlformats-officedocument.spreadsheetml.worksheet+xml" />
-->
<Override PartName="/xl/sharedStrings.xml"
ContentType="application/vnd.openxmlformats-officedocument.spreadsheetml.sharedStrings+xml" />
<Override PartName="/docProps/core.xml"
ContentType="application/vnd.openxmlformats-package.core-properties+xml" />
</Types>
As the name shows, this file contains the content type information of every file. When we write Excel file, the rest of the file would remain the same but the portion which is related with sheet information would be changed because it depends upon the number of sheets that Excel file contains. For example, if the Excel file contains only one sheet, then it contains only one line that would be representing sheet content information.
2. App.xml
="1.0" ="UTF-8" ="yes"
<Properties xmlns="http://schemas.openxmlformats.org/officeDocument/2006/
extended-properties"
xmlns:vt="http://schemas.openxmlformats.org/officeDocument/2006/docPropsVTypes">
<Application>Microsoft Excel</Application>
<DocSecurity>0</DocSecurity>
<ScaleCrop>false</ScaleCrop>
<HeadingPairs>
<vt:vector size="2" baseType="variant">
<vt:variant>
<vt:lpstr>Worksheets</vt:lpstr>
</vt:variant>
<vt:variant>
<vt:i4>3</vt:i4> -->
</vt:variant>
</vt:vector>
</HeadingPairs>
<TitlesOfParts>
<vt:vector size="3" baseType="lpstr"> -->
<vt:lpstr>Sheet1</vt:lpstr> -->
<vt:lpstr>Sheet2</vt:lpstr> -->
<vt:lpstr>Sheet3</vt:lpstr> -->
</vt:vector>
</TitlesOfParts>
<LinksUpToDate>false</LinksUpToDate>
<SharedDoc>false</SharedDoc>
<HyperlinksChanged>false</HyperlinksChanged>
<AppVersion>12.0000</AppVersion>
</Properties>
It is placed at docProp\app.xml, this file contains number and names of the Sheets. As you can see the comments on the above code, when we will write this file we need to change the value of the number of sheets and sheet names according to our Excel file. Suppose we have two Sheets in an Excel file and their names are SampleSheet1 and SampleSheet2, then this file would be like:
="1.0" ="UTF-8" ="yes"
<Properties xmlns="http://schemas.openxmlformats.org/officeDocument/
2006/extended-properties"
xmlns:vt="http://schemas.openxmlformats.org/officeDocument/2006/docPropsVTypes">
<Application>Microsoft Excel</Application>
<DocSecurity>0</DocSecurity>
<ScaleCrop>false</ScaleCrop>
<HeadingPairs>
<vt:vector size="2" baseType="variant">
<vt:variant>
<vt:lpstr>Worksheets</vt:lpstr>
</vt:variant>
<vt:variant>
<vt:i4>2</vt:i4>
</vt:variant>
</vt:vector>
</HeadingPairs>
<TitlesOfParts>
<vt:vector size="2" baseType="lpstr">
<vt:lpstr>SampleSheet1</vt:lpstr>
<vt:lpstr>SampleSheet2</vt:lpstr>
</vt:vector>
</TitlesOfParts>
<Company />
<LinksUpToDate>false</LinksUpToDate>
<SharedDoc>false</SharedDoc>
<HyperlinksChanged>false</HyperlinksChanged>
<AppVersion>12.0000</AppVersion>
</Properties>
Here we can see the difference between these two XML files, first the Total number of sheet is changed from 3 to 2, and secondly sheet names are changed.
3. WorkBook.xml.rels
="1.0" ="UTF-8" ="yes"
<Relationships xmlns="http://schemas.openxmlformats.org/package/2006/relationships">
<Relationship Id="rId3"
Type=http://schemas.openxmlformats.org/officeDocument/2006/relationships/worksheet
Target="worksheets/sheet3.xml" />
<Relationship Id="rId2"
Type=http://schemas.openxmlformats.org/officeDocument/2006/relationships/worksheet
Target="worksheets/sheet2.xml" />
<Relationship Id="rId1"
Type=http://schemas.openxmlformats.org/officeDocument/2006/relationships/worksheet
Target="worksheets/sheet1.xml" />
<Relationship Id="rId6"
Type=http://schemas.openxmlformats.org/officeDocument/2006/relationships/sharedStrings
Target="sharedStrings.xml" />
<Relationship Id="rId5"
Type=http://schemas.openxmlformats.org/officeDocument/2006/relationships/styles
Target="styles.xml" />
<Relationship Id="rId4"
Type=http://schemas.openxmlformats.org/officeDocument/2006/relationships/theme
Target="theme/theme1.xml" />
</Relationships>
You can find this file at xl\_rels\workbook.xml.rels. It contains all the relationship information that contains by the WorkBook, and every relationship is assigned an unique Id which set in the value of Id attribute. Let's consider the Excel file which contains two sheets, then how this file would be:
="1.0" ="UTF-8" ="yes"
<Relationships xmlns="http://schemas.openxmlformats.org/package/2006/relationships">
<Relationship Id="rId3"
Type=http://schemas.openxmlformats.org/officeDocument/2006/relationships/theme
Target="theme/theme1.xml" />
<Relationship Id="rId2"
Type=http://schemas.openxmlformats.org/officeDocument/2006/relationships/worksheet
Target="worksheets/sheet2.xml" />
<Relationship Id="rId1"
Type=http://schemas.openxmlformats.org/officeDocument/2006/relationships/worksheet
Target="worksheets/sheet1.xml" />
<Relationship Id="rId5"
Type=http://schemas.openxmlformats.org/officeDocument/2006/relationships/sharedStrings
Target="sharedStrings.xml" />
<Relationship Id="rId4"
Type=http://schemas.openxmlformats.org/officeDocument/2006/relationships/styles
Target="styles.xml" />
</Relationships>
If you compare both the XMLs, the difference you get is that the second XML contains information only for two sheets while the first one contains for three, and of course, the relationship ids are changed. so when we will write this file we should take care of the Id generation and should generate sheet relations according to the number of sheets which Excel file contains.
4. WorkBook.xml
="1.0" ="UTF-8" ="yes"
<workbook xmlns=http://schemas.openxmlformats.org/spreadsheetml/2006/main
xmlns:r="http://schemas.openxmlformats.org/officeDocument/2006/relationships">
<fileVersion appName="xl" lastEdited="4" lowestEdited="4" rupBuild="4505" />
<workbookPr defaultThemeVersion="124226" />
<bookViews>
<workbookView xWindow="120" yWindow="105" windowWidth="14295" windowHeight="5130" />
</bookViews>
<sheets>
<sheet name="Sheet1" sheetId="1" r:id="rId1" />
<sheet name="Sheet2" sheetId="2" r:id="rId2" />
<sheet name="Sheet3" sheetId="3" r:id="rId3" />
</sheets>
<calcPr calcId="124519" />
</workbook>
Workbook contains the information about sheets. This is the file where name of sheets are defined, rest of the files consume these sheet names. This file also contains the mapping between relationships and Sheet names by defining the value of attribute r:Id. We have to be careful when generating this file because we have to map correct relationship ids for the sheets else Excel file would be corrupted.
Logic
So the logic is very simple if we write all those files according to our data properly and compress these files into one file, then it can be read by Excel 2007 engine.
Using the Code
As we know, Excel file is a WorkBook which contains multiple sheets and every sheet contains a collection of rows and columns, while intersection of row and column is called cell.
So to make our code easy to adapt, let's break up our code into three main classes:
WorkBook
(that is the root class, like Excel file which contains all the information about sheets and data.)
WorkSheet
(this class represents single WorkSheet of Excel file).
Cell
(this class represents Cell information of Excel file).
Let's see how we use the code:
Excel2007.WorkBook workBook = new Excel2007.WorkBook();
Response.Clear();
Response.BufferOutput = false;
string archiveName = "test.xlsx"; Response.ContentType = "application/zip";
Response.AddHeader("content-disposition", "filename=" +
archiveName);
workBook.WorkSheets.Add("FirstSheet"); workBook.WorkSheets["FirstSheet"].Cells[0, 0].Value =
"first value"; workBook.WorkSheets["FirstSheet"].Cells[0, 1].Value = "second value";
workBook.WorkSheets.Add("SecondSheet");
workBook.WorkSheets["SecondSheet"].Cells[0, 0].Value = "third value";
workBook.Save(Response.OutputStream); Response.End();
So what our code did is, first it took all the information about the data and Sheets and saved it in memory, and when WorkBook.Save
method called it wrote all the information to the required XML files and then compress those files via DotNetZip library into a single file.
So this is a very simple library that can write basic Excel 2007 file.
Some More Solutions
For complex Excel writing, you can use different solutions which are available for free. Some are as follows:
- ExcelPackage
- EPPlus
- Open XML
Microsoft also provided System.IO.Packaging
library from which you can write Excel file, you can use that library just by adding WindowsBase DLL reference in your project.
History
- 27th July, 2011: Initial version