Fri, 07 Jun 2013 15:38:00 GMT
Thu, 06 Jun 2013 16:00:00 GMT
Thu, 06 Jun 2013 15:54:00 GMT
One of the significant differentiators of the STATISTICA family of data analysis software is its performance on large data sets and computationally intensive applications, such as analyses requiring recursive access to data or complex data management and database query operations.
For example, in a recent carefully designed and conducted comparison of competing analytic software packages performed on a quad-core 64-bit machine running under a 64-bit Microsoft Windows operating system, STATISTICA outperformed other widely used data analysis packages by a wide margin:
Read more about Performance comparison of STATISTICA Version 9 on multi-core 64-bit machines with current 64-bit releases of SAS (Version 9.2) and PASW (formerly SPSS) Statistics Version 18; basic data management, basic statistics, and aggregation operations.
The current version of STATISTICA software, including STATISTICA Data Miner , takes full advantage of state-of-the-art hardware and software technologies, as well as proprietary performance optimization technologies developed at StatSoft. STATISTICA is available as a native 64-bit application, and most STATISTICA computational (statistical) routines, as well as the key predictive modeling algorithms available in STATISTICA Data Miner, will take full advantage of multi-processor computing platforms.
Shown below are some performance benchmark data collected as part of the STATISTICA and STATISTICA Data Miner software validation and release process. Each analysis was repeated multiple times on 64 bit computers with either 1, 2, 3 or 4 processors (and otherwise identical hardware configurations). STATISTICA was designed to take advantage of available hardware resources to achieve maximum performance for complex predictive modeling analyses (e.g., via regression trees, stochastic gradient boosting, or random forests analyses), as well as common statistical analyses (e.g., computing correlation coefficients).
STATISTICA Data Miner contains multithreaded implementations of Classification and Regression Trees, CHAID, stochastic gradient boosting of trees (Boosted Trees), Random Forests (voting trees), and others, as well as multithreaded implementation of traditional generalized linear modeling techniques (e.g., logit regression, etc.). The performance of these predictive modeling algorithms on modern 64-bit multi-core hardware and 64-bit operating system platforms is spectacular, and as of this writing not matched by any other general software platform for predictive modeling (see also graphs shown above). Analyses with hundreds of variables and millions of cases will complete in minutes.
In part, the unmatched performance of STATISTICA and STATISTICA Data Miner computational algorithms was achieved through carefully redesigned intelligent data access, storage, and buffering methods. Data can be read asynchronously in multiple threads servicing different parallel computations for a single (e.g., classification and regression trees) analysis. Data arrays are never stored explicitly in memory, so there are no limitations on file sizes; yet, the available memory is used intelligently to buffer the data (read by multiple threads) to make them available for computation.
Using these technologies, STATISTICA data analysis and STATISTICA Data Miner software has leapfrogged the competition.