Programming Languages White Papers

Using Formats, MP Connect, and Other SAS Efficiency Techniques to Save Time and Disk Space

Overview Working with historic household Census data can be quite challenging. Semi-annual data sets can exceed 50 gigabytes and when working with multi-year data it is often necessary to extract information from yearly clusters of data sets exceeding 300 gigabytes. These yearly clusters, until recently, have been processed sequencally, each independent process relying on the completion of the previous process before continuing. The sorting alone of some of these data sets can take over an hour of clock time. Processing six data set representing three years of data can take six hours. By using format tables and eliminating sorts and merges, reducing repeated passes through the data, and the use of MP connect for independent processes processing time and temporary storage can be reduced by over 50 percent.

Further White Paper Details
PublisherSAS Institute File FormatPDF
Date PublishedFebruary 2009
FormatWhite Papers   
Topics

Market-Leading Data-Modeling Tools: Research Report from the Burton Group

The Burton Group provides an in-depth research report on Market-Leading Data-Modeling Tools. According to their research, basic data modeling tools have become commoditized - basic features are yesterday's...

The Converging Paths of SQL Server and SharePoint - Don't Wait Until It's Too Late!

SharePoint and SQL server have much in common, and understanding their similarities will help you streamline your day-to-day tasks and help you work more efficiently. Do you know what those...

Supporting Employees Anytime, Anywhere

New business demands require a new approach to end-user support.  This is leading organizations to a remote service delivery model that leverages the Web and Saas technology

The Pursuit of a Standardized Solution for Secure Enterprise RBAC

Each RBAC implementation varies in its capabilities and method of management. In a multi-platform environment, these differences introduce higher administration hours and costs because the various RBAC models are not...

Massive But Agile: Best Practices for Scaling the Next-Generation Enterprise Data Warehouse - Forrester Report

Information and knowledge management (I&KM) professionals continue to expand the scale, scope, and deployment roles for their enterprise data warehouse (EDW) investments. Information managers are adopting EDW best practices that...


Quick Sitemap Links: