![]() |
|
The Sheerpower Cluster Statistics Package provides a powerful set of statistical functions to analyze data stored in cluster arrays. These functions enable users to extract meaningful insights—such as trends, variability, and relationships—from large datasets, making them invaluable for applications in finance, manufacturing, science, and more.
This package works seamlessly with cluster arrays, allowing
you to perform calculations on columns of data (e.g.,
sample->value1
) without manual iteration. Each
function returns a specific statistic, paired with practical
examples and decision-making guidance.
Note: The statistical function examples in
this tutorial use a cluster array named sample
,
which contains two fields: value1
and
value2
. The value1
field holds 100
rows with values increasing from 1 to 100, while
value2
holds values decreasing from 100 to 1.
These functions are optimized for high performance, leveraging
SheerPower’s high-speed in-memory processing capabilities.
With enough RAM, even billion row clusters are supported -- making the Sheerpower Statistics Package a very powerful tool.
Best Practice: When performing statistical analysis on a subset of data, first copy that subset into a separate cluster. This method allows you to compute statistics on any chosen data set without limitations. Since SheerPower can copy millions of rows per second, this process is highly efficient, making execution time negligible. For details, see: Copy Cluster
Below is a detailed table of the statistical functions available in the Sheerpower Statistics Package, including when to use them, example scenarios, returned values, and actionable decisions based on the results.
(Show/Hide Functions Table)Function | Purpose |
---|---|
Basic Descriptive Statistics | |
stats$max(cluster->var) |
Highest value in a column |
stats$min(cluster->var) |
Lowest value in a column |
stats$range(cluster->var) |
Difference between max and min |
stats$sum(cluster->var) |
Total of all values in a column |
stats$mean(cluster->var) |
Arithmetic mean of a column |
stats$median(cluster->var) |
Middle value of a column |
stats$mode(cluster->var) |
Most frequent value |
stats$midrange(cluster->var) |
Average of max and min values |
Dispersion & Distribution Metrics | |
stats$stddev(cluster->var) |
Standard deviation of a column |
stats$var(cluster->var) |
Variance of a column |
stats$iqr(cluster->var) |
Interquartile range (Q3 - Q1) |
stats$cv(cluster->var) |
Coefficient of variation |
Percentiles & Risk Measures | |
stats$percentile(cluster->var, 50) |
Specific quantile (e.g., median) |
stats$zscore(cluster->var, 50) |
Standardized score |
stats$drawdown(cluster->var) |
Peak-to-trough decline |
stats$vrisk(cluster->var, 0.95) |
Value at Risk at 95% |
stats$cvrisk(cluster->var, 0.95) |
Expected shortfall at 95% |
Skewness & Kurtosis (Shape of Distribution) | |
stats$skew(cluster->var) |
Distribution symmetry |
stats$kurtosis(cluster->var) |
Tail heaviness of distribution |
Alternative Measures of Central Tendency | |
stats$gmean(cluster->var) |
Geometric mean of a column |
stats$hmean(cluster->var) |
Harmonic mean of a column |
Statistical Tests & Inference | |
stats$ttest(var1, var2) |
Compare means of two columns |
stats$chi2(var1, var2) |
Goodness-of-fit test |
stats$ftest(var1, var2) |
Variance comparison |
Regression & Correlation | |
stats$linreg(var1, var2) |
Linear regression slope |
stats$pcorr(var1, var2) |
Pearson correlation |
stats$spearman(var1, var2) |
Rank correlation |
stats$cov(var1, var2) |
Covariance between columns |
Finance & Risk Analysis | |
stats$sharpe(cluster->var, 0) |
Risk-adjusted return |
stats$cagr(cluster->var, 2) |
Compound annual growth rate |
When to Use: Finding the peak value in a dataset, such as the highest daily stock price in a month.
sample->value1
to identify its maximum
value over 100 days.sample->value1
).When to Use: Identifying the lowest point, such as the minimum temperature in a weather dataset.
sample->value1
to find the lowest
moisture level over a season.sample->value1
).When to Use: Measuring the spread between highest and lowest values, such as temperature fluctuation.
sample->value1
.sample->value1
).When to Use: Aggregating totals, such as total sales revenue over a period.
sample->value1
across 100 stores.sample->value1
).When to Use: Measuring central tendency, such as average customer wait time in a call center.
sample->value1
.sample->value1
).When to Use: Robust central tendency, such as median home prices.
sample->value1
.sample->value1
).When to Use: Most common value, such as popular product size.
sample->value1
.sample->value1
, first unique).When to Use: Simple central estimate, like average of extremes in test scores.
sample->value1
.sample->value1
).When to Use: Assessing variability, such as consistency of product weights in manufacturing.
sample->value1
.sample->value1
).When to Use: Quantifying spread, such as volatility of investment returns.
sample->value1
.sample->value1
).When to Use: Measuring middle 50% spread, such as income variability.
sample->value1
.sample->value1
).When to Use: Relative variability, such as comparing consistency across datasets.
sample->value1
.sample->value1
).When to Use: Finding a specific quantile, such as median income in a survey.
sample->value1
.sample->value1
).When to Use: Standardizing data, such as identifying outliers in test scores.
sample->value1
(100 results).sample->value1
).When to Use: Measuring peak-to-trough decline, such as maximum loss in a trading account.
sample->value1
over 100 days.sample->value1
since it’s increasing).When to Use: Risk assessment, such as Value at Risk for portfolio losses.
sample->value1
.sample->value1
).When to Use: Expected shortfall, such as average loss beyond VaR in finance.
sample->value1
.sample->value1
).When to Use: Checking distribution symmetry, such as customer purchase amounts.
sample->value1
.sample->value1
, uniform).When to Use: Evaluating tail heaviness, such as risk of extreme events in insurance claims.
sample->value1
.sample->value1
).When to Use: Growth rates, such as average investment return over time.
sample->value1
.sample->value1
).When to Use: Rates or averages with inverses, such as average speed over distances.
sample->value1
.sample->value1
).When to Use: Comparing means, such as A/B testing website conversion rates.
sample->value1
vs.
sample->value2
, 100 clicks each).sample->value1
vs. sample->value2
,
equal means).When to Use: Goodness-of-fit, such as testing observed vs. expected frequencies.
sample->value1
vs.
expected in sample->value2
.When to Use: Variance comparison, such as consistency of two machines.
sample->value1
vs.
sample->value2
, 100 units each).sample->value1
vs. sample->value2
).When to Use: Linear relationship, such as sales vs. ad spend.
sample->value1
) and sales
(sample->value2
).sample->value1
vs. sample->value2
).When to Use: Normalized correlation, such as temperature vs. ice cream sales.
sample->value1
vs.
sample->value2
).sample->value1
vs.
sample->value2
).When to Use: Rank correlation, such as student test scores vs. study hours.
sample->value1
vs.
sample->value2
.sample->value1
vs.
sample->value2
).When to Use: Relationship strength, such as stock price correlations.
sample->value1
vs.
sample->value2
).sample->value1
vs.
sample->value2
).When to Use: Risk-adjusted return, such as portfolio performance.
sample->value1
.sample->value1
).When to Use: Compound growth, such as investment growth rate.
sample->value1
(100
periods).sample->value1
with num_years = 2).With sample->value1
and
sample->value2
as example data sources, the
Sheerpower Statistics Package analyzes trends, variability,
and relationships in contexts like finance, manufacturing,
and science. Decisions pivot on whether values meet targets
(e.g., stats$mean
, stats$cagr
),
indicate risk (stats$vrisk
,
stats$sharpe
), or suggest relationships
(stats$cov
, stats$pcorr
)—guiding
actions from resource allocation to strategic shifts.
stats$mean
, stats$median
) to
advanced metrics (stats$kurtosis
,
stats$spearman
).The Sheerpower Statistics Package empowers users to turn raw data into actionable insights with ease and precision.
Hide Description
|
|
Enter or modify the code below, and then click on RUN |
Looking for the full power of Sheerpower?
Check out the Sheerpower website. Free to download. Free to use. |