fast_search_histogram

Version 1.0.0 (3.6 MB) by Bryce Henson

fast 1d histogramming algorithms based on binary search. (faster than histcounts)

https://github.com/brycehenson/fast_search_histogram

4.0

(1)

80 Downloads

Updated 2 Dec 2021

View License on GitHub

This project demonstrates superior speed to Matlab's inbuilt histcounts function and provides an adaptive function (hist_adaptive_method) that automatically picks the fastest method.

The brute force approach to histogramming is to compare each bin to each data value (or *count*) and gives a complexity **O(n·m)** where *n* is the number of data values and *m* is the number of bins. This can be improved by two algorithms.

1. **Bin Search, O(n·log(m))**: For each count do a binary search for the histogram bin that it should go into and then increment that bin. Because the bins are already ordered then there is no sorting needed. Best when m>>n (sparse histogramming).
to use:
bin_counts=hist_bin_search(data,edges)

2. **Count Search, O(m·log(n))**: For each bin edge do a binary search to find the nearest data index. Use the difference in this data index between bins to give the number of counts. Must have ordered data for the search to work, sorting first would cost **O(n·log(n))** and would make this method slower unless repeated histogramming was needed. Best when n>>m (dense histogramming) which is the more common use case. (this is the method shown in the logo)
to use:
bin_counts=hist_count_search(data,edges) (WARNING SORTED DATA REQUIRED)

I observe empirically (see /figs/scaling_comparison.png & hist_scaling_test) that there is a fairly complex dependence of which algorithm is best on the value of n and m. I have implemented a function that does a good job of picking the fastest method.

Cite As

Bryce Henson (2024). fast_search_histogram (https://github.com/brycehenson/fast_search_histogram), GitHub. Retrieved July 27, 2024.

MATLAB Release Compatibility

Created with R2019a

Compatible with any release

Platform Compatibility

Windows macOS Linux

Tags Add Tags

Acknowledgements

Inspired by: M-file Header Template, Binary search for closest value in an array, "Smart"/Silent Figure

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

bin/Colormaps

bin/RegularizeData3D

bin/closest_value

closest_value

bin/sfigure

sfigure

bin/template_header

Versions that use the GitHub default branch cannot be downloaded

Version	Published	Release Notes
1.0.0	2 Dec 2021

To view or report issues in this GitHub add-on, visit the GitHub Repository.

fast_search_histogram

Cite As

MATLAB Release Compatibility

Platform Compatibility

Categories

Tags Add Tags

Acknowledgements

Community Treasure Hunt

Discover Live Editor

bin/Colormaps

bin/RegularizeData3D

bin/closest_value

bin/sfigure

bin/template_header

fast_search_histogr​am

Cite As

MATLAB Release Compatibility

Platform Compatibility

Categories

Tags Add Tags

Acknowledgements

Community Treasure Hunt

Discover Live Editor

bin/Colormaps

bin/RegularizeData3D

bin/closest_value

bin/sfigure

bin/template_header

fast_search_histogram