Click here to Skip to main content
65,938 articles
CodeProject is changing. Read more.
Articles
(untagged)

Dell SharePoint Infrastructure Optimization with AvePoint

2 Apr 2012 1  
This article provides an overview of Dell's SharePoint Infrastructure Optimization solution. Details of the solution stack, which includes components of AvePoint's DocAve Software Platform and Dell hardware components, are discussed along with the set of functional and performance tests that were us

This article is in the Product Showcase section for our sponsors at CodeProject. These articles are intended to provide you with information on products and services that we consider useful and of value to developers.

and the SharePoint Zone

The Dell SharePoint Infrastructure Optimization Solution

SharePoint is a multi-faceted platform that is being utilized at an increasing rate by corporations as a mechanism to manage and gain value from enterprise content. The growth of this content in the form of unstructured data and the increased reliance on SharePoint is challenging IT to adequately architect the underlying infrastructure to match and grow with business usage. Business as usual is not going to address SharePoint requirements especially considering that the typical backup/restoration requirements commonly associated with SQL Server usage are inefficient.

Organizations are realizing storage and data management must be brought into their main SharePoint operations environment to provide for unified SharePoint operations to reduce capital costs, operating expenses, and outage risk. What is required is a comprehensive SharePoint infrastructure approach which can seamlessly grow with the business, support tier 1 applications and their SLA’s, improve end user performance, improve manageability, and consolidate SharePoint unstructured content, backup, and archive datasets.

Dell’s SharePoint Infrastructure Optimization solution has the capability to interoperate and optimize existing SharePoint deployments seamlessly with no disruption to the applications or end users. As well leveraging the differentiated DX Object Storage Platform to consolidate SharePoint back-end data allows IT to scale the infrastructure according to the business demands.

The Dell DX Object Storage Platform and AvePoint’s DocAve Software Platform can deliver a comprehensive, fully integrated solution for complete SharePoint Enterprise Content Lifecycle Management and Migration that can include all or some of the components based on the enterprise’s needs.

  • Simplify the migration process with the DocAve Migration Solution and the DX Object Storage Platform
    • A highly efficient and cost-effective solution to migrate existing enterprise content, Exchange Public Folders, File Systems and previous versions of SharePoint into SharePoint 2010.
  • Improve SQL Performance and reduce SQL capacity requirements by as much as 90+% effectively eliminating backup and restoration challenges with DocAve Extender and the DX Object Storage Platform
    • Offloads Binary Large Objects (BLOBs) data from SQL Server databases to the DX Object Storage Platform - leveraging Microsoft's fully supported External BLOB Storage (EBS) or Remote BLOB Storage (RBS) APIs - to improve SQL performance while providing for seamless end-user access and interaction. To better understand the potential benefits and considerations for incorporating BLOB externalization into SharePoint content storage strategies, please reference “Optimize SharePoint Storage with BLOB Externalization” by SharePoint MVPs Dan Holme, Randy Williams, and Jeremy Thake.
  • Automate SharePoint archive via business-driven rules to automatically apply content lifecycle management with DocAve Archiver and the DX Object Storage Platform
    • Empowers administrators to automate the process of offloading fixed or stale SharePoint content that would ordinarily take up valuable SharePoint resources to the easily manageable peer-scaling DX Object Storage.
  • Robust data protection and quick restore with DocAve Backup and Restore and the DX Object Storage Platform
  • Quickly restore lost or corrupt assets, including content externalized with the DX Object Storage Platform and DocAve storage management solutions, with all metadata and dependencies intact to help ensure minimal business disruption.

Dell DX Object Storage Platform

The DX Object Storage Platform is a new generation of storage platform that combines ease of management with scalability and resilience. It’s unique manner of storing data, be it SharePoint content, a backup dataset, or an archived SharePoint site as an object and not a collection of bytes, provides surprising ROI and TCO benefits that can be breakthroughs for IT departments.

The capabilities of the DX Object Storage Platform include:

Ability to Scale with Ease

  • One storage pool with a global address space to consolidate SharePoint BLOBs, backup, and archive datasets
  • Scale SharePoint to multi-Petabyte's – seamlessly with no service disruption
  • All SharePoint capacity is virtualized and all data is automatically thin provisioned

Powerful Data and Storage Management Services

  • Self-Managing – SharePoint access is inherently and automatically load balanced
  • Self-Healing – all SharePoint content is automatically verified for consistency and repaired if necessary
  • Policy based transactional replication for SharePoint BLOBs, no RPO scheduling required

Future Proofed Design

  • No database complexity, no file system, or LUNs to manage
  • Optimized data protection for SharePoint content without the complexities of RAID
  • No SharePoint application connection limit or application connection state to manage
  • Storage technology and protocol impacts are removed from the SharePoint application and IT environments

Solution Software Data-Control Flow

The resultant Data Flow after the deployment of the SharePoint Infrastructure Optimization solution is enhanced by offloading the BLOB, backup, and archive flows from the main SharePoint environment. This augmented Data Flow serves to provide a parallelism of access to SharePoint, both for the end user as well as IT services, and also allows the optimal operation of the underlying SQL Server.

Figure 1. Detailed Data-Control Flow

AvePoint’s DocAve Software Platform

AvePoint's DocAve Software Platform is a truly integrated platform for SharePoint management, protection, and storage optimization, allowing organizations to scale SharePoint's architecture while simplifying SharePoint management. DocAve's uniquely distributed architecture not only allows for unmatched scalability, but also provides unified storage management of global SharePoint content on the DX Object Storage Platform.

DocAve provides a fully integrated and scalable software solution for organizations to streamline and simplify tasks associated with managing SharePoint environments, help comply with stringent SLAs including aggressive RTOs and RPOs for protecting SharePoint content, improve SharePoint performance for global users, optimize SQL resources, and can reduce risks associated with managing complex SharePoint environments by:

  • Providing the tools necessary to streamline and automate common, yet time-consuming tasks that burden IT administrators, such as daily permissions management.
  • Giving IT Managers insight into how SharePoint is being leveraged by end-users, so IT Managers can make better decisions on how to structure, manage, and grow SharePoint environments.
  • Optimizing SQL resources to improve performance for end-users, and can lower total cost of ownership
  • Providing IT Managers with robust solutions to quickly recover accidentally deleted or corrupted SharePoint content in order to help IT comply with stringent corporate SLAs and minimize business disruption.
  • Allowing architects to expand SharePoint's storage and server architectures, improving quality of service for all global users while keeping costs in check.

The DocAve Software Platform components are shown in Table 1 and Table 2.

Table 1. AvePoint’s DocAve Software Platform Components – DocAve Manager

Component Role
DocAve Manager
Web Service - Represents the access point to the DocAve GUI, allowing users to interact with the software. The web service can be installed on multiple machines and allows administrators to access the DocAve GUI using a web browser.
Auditing Service - Monitors SharePoint activities and returns data to the Control Services for processing.
Control Service - Manages all DocAve operations. All agent machines use the control service port and address to communicate with the manager, This service can be run on a server cluster to ensure high availability.
Media Service - Manages all data storage for the DocAve platform. These can be installed on multiple machines and can manage multiple Logical Drives. Using multiple media agents will allow for high availability and load balanced access to the data storage locations.
Search Service - Manages all data by generating a full text index.
Report Service - Manages all SharePoint data collection and management for the DocAve Report Center. This must be configured with the associated SQL databases and SQL Report Services.

Table 2. AvePoint’s DocAve Software Platform Components – DocAve Agent

Component Role
DocAve Manager
Storage Optimization –Archiver, Content & Media Library Connectors, Extender
Data Protection – Granular Backup, High Availability Sync, Platform Recovery
Administration – Administrator, Content Manager, Deployment Manager, Replicator
Reporting – Report Center for Infrastructure Monitoring, Usage Analysis, and Alerting
Compliance – Auditor, eDiscovery, Vault
Migration – Exchange Public Folder, File Share, EMC Documentum, eRoom, Livelink, Lotus Notes & QuickPlace, SharePoint, Stellent, Vignette Migrators

Solution Hardware Configuration

The Dell SharePoint Infrastructure Optimization with AvePoint’s DocAve Platform Reference Architecture is based on the Medium SharePoint 2010 Farm, supporting up to 5,000 users, as defined by Dell’s Global Solutions Engineering team (for details see Microsoft SharePoint Server 2010: Designing and Implementing a Medium Farm). SharePoint is implemented in four servers: two SharePoint Front End Web Servers for high availability, a SharePoint Application Server, and a SharePoint SQL Server Database server. (For full high availability, a second Application Server can be added and a second SQL Server Database server can be added to form a SQL Server Failover Cluster).

Figure 2. Dell SharePoint Optimization with AvePoint’s DocAve Platform Reference Architecture

To the SharePoint infrastructure is added a highly available DX Object Storage Cluster, consisting of primary and secondary DX6000 Cluster Services Nodes, and two DX6012 Storage Nodes with 12 1 TB disks each.

The SharePoint servers and DX6000 Cluster Services Nodes are connected via a public network subnet (192.168.1.x). Additionally, a private network for the DX Object Storage Nodes (10.10.1.x) connects the DX Object Storage Cluster to the server running the DocAve Media Services (SP-WFE1 in this case).

Solution SharePoint Configuration

SharePoint roles are assigned to the various servers to maximize performance and availability. Looking specifically at the Search application (see Figure 2), the Admin and Crawl components are assigned to the application server while the search Index is spread across both web front end servers, with each web front end server’s Index portion mirrored on the other web front end server. This provides higher performance (by spreading the work across multiple servers) as well as high availability. Additional web front end servers can be added for more Index servers and additional application servers can be added to provide additional search Crawlers, as well as to provide high availability. The two web front end servers are load balanced using Microsoft Network Load Balancing.

Figure 3. SharePoint Roles

SharePoint Site Collections

To facilitate testing, an automated method was developed to quickly populate large SharePoint site collections. First a content database with one site collection was created using the SharePoint administrator web site. Then a PowerShell script (available from authors) was run to create and load a specified number of team sites within that site collection. The content for each site is the same and consists of approximately 1 GB of typical document files: Microsoft Word documents, PDFs, text files, pictures (see Table 2 for details). For uniqueness each file has the site index number appended to the filename, for example QuickStart_Installation_001.pdf.

Table 3. Files in each SharePoint site used in tests

Number of files 173
Size range of files 21 B – 554 MB
Number of files larger than 1 MB 24
File types .docx, .pdf, .txt, .bmp, .zip

The site collections built for testing are shown in the next table. Two large (200GB) site collections were built for testing the Remote BLOB Storage (RBS) feature of SQL Server as implemented by DocAve Extender. Two smaller (10GB) sites were built for testing DocAve Archive and Backup/Restore. The content databases holding the large site collections are stored on their own 400GB volumes on the Dell EqualLogic storage. The smaller site collections’ content databases are stored on a third volume.

Table 4. SharePoint Site Collections Created

Site Collection Name SQL Server Content Database No of Sites Files per Site Size of each Site Total Site Collection Size (approx.) Test
SC1 WSS_Content_1 200 173 1 GB 200 GB BLOBs remain in SQL, Crawled
SC2 WSS_Content_2 200 173 1 GB 200 GB BLOBs Extended, DB Shrunk, Crawled
SC3 WSS_Content_3 10 173 1 GB 10 GB Archived, DB Shrunk
SC4 WSS_Content_4 10 173 1 GB 10 GB Backed up/Restored

DocAve Functionality Testing

Extender

The Remote BLOB Store (RBS) feature of SQL Server 2008 can improve SharePoint functionality immensely by removing the binary large objects (BLOBs) such as large PDF files from the SQL Server database and storing them on external storage, replacing them with stubs. DocAve software contains an RBS provider through their Extender functionality, which has been certified with the Dell DX Object Storage Platform.

To test the Extender functionality and simulate BLOB removal from an existing SharePoint site, one of the large (200GB) site collections, SC2, had all BLOBs larger than 1MB externalized to the DX Object Storage Cluster and then its content database was compacted using the SQL Serve Shrink function. The results are shown in Table 8. In just over 7 hours, 4,800 BLOBs (24 per site) were moved from SQL Server to the DX Object Store Cluster storage. Upon completion, the content database was compacted to 6.5 GB from its original 206 GB, a reduction of 97%.

Table 5. Extender Test - 200 GB Site Collection

Site Collection Size Time for Extender Operation Number of BLOBs Extended Database Size after Shrink
206 GB 7 hours 22 min 4800 6.5 GB

To demonstrate that externalizing the BLOBs did not impact SharePoint functionality, both the Extended site collection (SC2) as well as the non-Extended site SC1 were crawled for search indexing. The crawls on both site collections took the same amount of time (1 hour, 3 minutes) and generated the same search results.

In the next section the performance impact of reducing the content database size on SQL Server backup and restore with Extender will be shown. Load tests in the final section demonstrate the impact of Extender on typical end user SharePoint operations.

Archiver

With DocAve Archiver objects meeting certain criteria – when last accessed, size, etc. - can be moved from SharePoint to the DX Object Storage Platform and maintained there according to a specified retention policy. To test this, all objects greater than 1MB from the 10 GB site collection SC3 was archived. As seen in Table 5, the operation took 24 minutes, 240 files were archived, and the Content database ended taking up less than 1 GB after shrink.

Table 6. Archiver Test - 10 GB Site Collection

Site Collection Size Time for Archive Operation Number of Files Archived Database Size after Shrink
10 GB 24 min 240 .35 GB

Backup/Restore

DocAve Granular Backup and Restore provides for full or incremental backup of any specified sites, site collections, or content databases, to the DX Object Storage Platform. To test this a full backup of a 10 GB site collection, SC4, was created and then restored. Results are in Table 6.

Table 7. Backup and Restore of 10GB Site Collection

Site Collection Size Number of Items Backed Up Backup Time Restore Time
10 GB 2642 22 min 55 sec 17 min 39 sec

Extender Impact on SQL Backup and Restore Performance

To quantify the impact of BLOB externalization on SQL Server database performance, SQL Server backups and restores were performed on the content databases of the two 200 GB site collections. As seen in the next two tables, backing up the Extended site collection was 29x faster than backing up the non-Extended site collection, and restoring it was 32x faster.

Table 8. Backup Performance with and without Extender

Site Collection Name Status Backup Time (sec) Size of Backup File (GB) Backup Performance Improvement over Not Extended DB
SC1 Not Extended 2811 207 -
SC2 Extended 98.3 6.42 29X

Table 9. Restore Performance with and without Extender

Site Collection Name Status Restore Time (sec) Restore Performance Improvement over Not Extended DB
SC1 Not Extended 1533 -
SC2 Extended 47.2 32X

SharePoint Load Performance with Extender

To further evaluate the effect of Extender on SharePoint end user performance, a load test was built using Microsoft Visual Studio 2010 Test to simulate typical SharePoint use, and run against both the site collection with BLOBs still in SQL Server (SC1) and the site collection with BLOBs externalized to the DX Object Storage Platform store with DocAve Extender (SC2).

Four typical use cases – Browse, Download File, Upload File and Search – were modeled. Browses involved users going to the various site home pages as well as certain list pages. Downloads and uploads involved the same set of files of varying sizes that were used to build each site. Search used a common set of technical search terms relevant to the file set. All tests were run against all 200 sites of each site collection.

The four use cases were run in a proportion to model the Microsoft SharePoint 2010 Capacity Planner (SCP) “Heavy Collaboration” usage profile, as shown the next table. In each test 1,000 simulated users were run for 60 minutes, with a 60 second think time inserted between operations, so that each user would issue roughly one request per minute, again according to the SCP.

Table 10. Microsoft SharePoint Capacity Planner Heavy Collaboration Workload

Operation Mix
Browse 57
Download file 17
Upload file 9
Search 17
Total 100

The results of the test are shown in the next table. Using DocAve Extender to move BLOBs from SQL Server to the Dell DX Object Storage Platform results in performance improvements of from 17% to 40%.

Table 11. Extender Load Test Results

SC1 – BLOBs remain in SQL Server SC2 –BLOBs Extended to DX Object Storage Extended Faster than Non-Extended
Request Start Mix Requests Completed % Average Test Time (s) Requests Completed % Average Test Time (s)
Browse 57 33,243 56.8% 0.91 33,705 57.1% 0.64 30%
Download 17 10,197 17.4% 1.26 10,035 17.0% 0.76 40%
Upload 9 5,146 8.8% 4.53 5,296 9.0% 3.42 25%
Search 17 9,913 16.9% 1.53 9,964 16.9% 1.27 17%

Acknowledgements

Dave Jaffe would like to thank Quocdat Nguyen and Ravi Chaganti of the Dell Global Solutions Engineering team for much SharePoint and PowerShell assistance, Omar Rawashdeh of the Dell Storage Engineering team for help setting up the DX Object Storage Platform with DocAve, and Roger Yu of AvePoint for much help with installing and running DocAve.

License

This article has no explicit license attached to it but may contain usage terms in the article text or the download files themselves. If in doubt please contact the author via the discussion board below.

A list of licenses authors might use can be found here