トップ差分一覧 Farm ソース検索ヘルプ PDF RSS ログイン

Diary/2010-1-13

各種ベンチマーク

Parsec

The PARSEC benchmark suite: characterization and architectural implications
Christian Bienia, et al, 2008

Unlike Woo et al. who measured actual concurrency on an abstract machine,
we therefore decided to analyze inherent concurrency and its limitations.
Our approach is based on the number of executed instructions in parallel
and serial regions of the code.
We neglect any delays due to blocking on contended locks and load imbalance.
This methodology is feasible because we do not study performance,
our interest is in fundamental program characteristics.

blackscholes
- calculates the prices for a portfolio of European options analytically with the Black-Scholes partial differential equation(PDE)
bodytrack
- tracks a human body with multiple cameras through an image sequence
canneal
- uses cache-aware simulated annealing(SA) to minimize the routing cost of a chip design
- uses fine-grained parallelism with a lock-free algorithm and a very aggressive synchronization strategy that is based on data race recovery instead of avoidance
dedup
- compresses a data stream with a combination of global and local compression that is called 'deduplication'
- the kernel uses a pipelined programming model to mimic real-world implementations
facesim
- computes a visually realistic animation of the modeled face by simulating the underlying physics
ferret
- is based on the Ferret toolkit which is used for content-based similarity search.
- represents emerging next-generation search engines for non-text document data types
- is parallelized using the pipeline model
fluidanimate
- uses an an extension of the Smoothed Particle Hydrodynamics(SPH) method to simulate an incompressible fluid for interactive animation purposes
freqmine
- employs an array-based version of the FP-growth(Frequent Pattern-growth) method for Frequent Itemset Mining(FIMI)
streamcluster
- solves the online clustering problem
- the prevalence of problems with streaming characteristics
swaptions
- uses the Health-Jarrow-Morton(HJM) framework to price a portfolio of swapoptions
- employs Monte Carlo(MC) simulation to compute the prices
vips
- is based on the VASARI Image Processing System(VIPS)
- includes fundamental image operations such as an affine transformation and a convolution
x264
is an H.264/AVC(Advanced Video Coding) video encoder

ALPBench

The ALPBench Benchmark Suite for Complex Multimedia Applications
Man-lap Li, et al, 2005

MPEG-2 Encoder(MPGenc)
MPEG-2 Decoder(MPGdec)
RayTracing(RayTrace)
Speech Recognition(SpeechRec)
Face Recognition (FaceRec)

MediaBench II Video

MediaBench II video: Expediting the next generation of video systems research
Jason E. Fritts, et al, 2009

H.263
- A video coder(h263enc) and decoder(h263dec) based on the ITU H.263 standard targeting video compression for transmission over ISDN networks.
- Source code produced by Telenor R&D.
H.264
- A video coder(h264enc) and decoder(h264dec) based on the forthcoming joint ISO/ITU H.264 standard(also known as MPEG-4 part 10) for very low bit-rate video coding.
- Source code is the test model produced by the H.264 working group.
Motion-JPEG
- A video coder(jpegenc) and decoder(jpegdec) based on the ISO JPEG standard for image compression.** Source code produced by the Independent JPEG Group.
Motion-JPEG2000
- A video coder(jp2Kenc) and decoder(jp2Kdec) based on the recent ISO JPEG standard for wavelet-based image compression.
- Source code is the JasPer library for JPEG-2000.
MPEG-2
- A video coder(mpeg2enc) and decoder(mpeg2dec) based on the ISO MPEG-2 standard for high-quality video coding.
- Source code produced by the MPEG Software Simulations Groups(MSSG)
MPEG-4
- A video coder(mpeg4enc) and decoder(mpeg4dec) based on the recent ISO MPEG-4 standard for object-based and very-low bit-rate video coding.
- Source code is the ffmpeg library for audio/video coding.

NU-MineBench

NU-MineBench 2.0
J. Pisharath, et al, 2005.

k-Means(clustering)
- is a partition-based method and is arguably the most commonly used clustering technique.
- "hard" clusters, since any data object either is or is not a member of a particular cluster
Fuzzy k-Means(clustering)
- K-means algorithm relaxes the condition.
BIRCH(clustering)
- is one of the hierarchical clustering methods that employ a hierarchical tree to represent the closeness of data objects.
HOP(clustering)
- Density-based methods grow clusters according to the density of neighboring objects ore according to some other density function.
- highly scalable when applied to large databases
Naive Bayesian(classification)
- a simple statistical classifier uses an input training data-set to build a predictive model.
- based on Bayes' Theorem
ScalParC(classification)
- a scalable decision tree based classifier
- builds the decision tree by recursively splitting the training data-set
Apriori(Association Rule Mining)
- explores the level-wise mining using the Apriori property
Eclat(Association Rule Mining)
- uses a vertical database format instead of the hash trees(horizontal format) as in apriori
- enables breaking the search space into small, independent, and manageable chunks
Utility(Association Rule Mining)
- based data mining technique where higher "utility" item-sets are identified from a database by considering different values of individual items as utilities.
SNP(Bayesian network)
- Single nucleotide polymorphisms(SNP) are DNA sequence variations that occur when a single nucleotide is altered in the genome sequence.
GeneNet(Bayesian network)
- uses a similar hill climbing algorithm as in SNP
- the input data is the micro-array data
SEMPHY(Expectation Maximization)
- is a structure learning algorithm that is based on phylogenetic trees.
- uses Structural Expectation Maximization(probability estimation) algorithm to address this complication
Rsearch(Pattern Recognition)
- uses a grammar based approach to achieve this goal.
- uses SCFG(Stochastic Context-Free Grammar)
SVM-RFE(Support Vector Machine)
- is a feature selection method that uses SVM techniques to refine and identify the optimum feature set in the feature data.
PLSA(Dynamic Programming)
- uses a dynamic programming approach to identify the similar and diverged regions between two sequences(eg. biological DNA/protein sequence, or text strings)
- since this method is dependent on the sequence length, it is computationally very intense.

SPLASH-2

The SPLASH-2 programs: characterization and methodological considerations
Steven Cameron Woo, et al, 1995

Barns
- the interaction of a system bodies(galaxies or particles, etc) in 3D over a number of time-steps, using the Barnes-Hut hierarchical N-body method
Cholesky
- The blocked sparse Cholesky factorization kernel factors a sparse matrix into the product of a lower triangular matrix and its transpose
- LUとの違い:sparse matrices, not globally synchronized between steps
FFT
- a complex 1-D version of the radix-√n six-step FFT algorithms
FMM
- simulates a system of bodies over a number of time-steps
- in 2-D using a different hierarchical N-body method(the adaptive Fast Multipole Method)
LU
- factors a dense matrix into the product of a lower triangular and an upper triangular matrix
- n*nの行列AはB*BブロックのN*Nのarrayに分割される．(n = N*B)
Ocean
- studies large-scale ocean movements based on eddy and boundary currents
Radiosity
- computes the equilibrium distribution of light in a scene using the iterative hierarchical diffuse radiosity method
Radix
- The integer radix sort kernel
- the permutation step in the final step requires all-to-all communication.
Raytrace
- renders a 3-D scene using ray tracing.
Volrend
- renders a 3-D volume using a ray casting technique.
- data access are input-dependent and irregular, and no attempt is made at intelligent data distribution.

BioBench

BioBench: A Benchmark Suite of Bioinformatics Applications
K. Albayraktaroglu, et al, 2005

blastn
- DNA sequence searching
blastp
- Protein sequence searching
clustalw
- Multiple sequence alignment
fasta_dna
- DNA sequence searching
fasta_prot
- Protein sequence searching
hmmer
- Sequence profile searching
mummer
- Genome-level alignment
protpars
- Phylogenetic analysis
tigr
- Sequence assembly

BioParallel(in C++)

Last level cache (LLC) performance of data mining workloads on a CMP - a case study of parallel bioinformatics workloads
A. Jaleel, et al, 2006

GeneNet
- is used to measure the regulatory relationship between genes.
- hill-climbing as its main search algorithm
SNP
- is used to measure and understand the patterns of Single Nucleotide Plymorphisms.
SEMPHY
- is a tool for constructing phylogenetic trees.
Support Vector Machines Recursive Feature Elimination
- is used to eliminate gene redundancy from a given input data set in order to provide compact gene subset.
PLSA
- is used to identify the similarities or differences between two genetic sequences.

一言メモ

たんたんめん食べた (Wed Jan 13 13:37:25 2010 +0900)

Diary/2010-1-13

各種ベンチマーク

一言メモ

検索

趣味の工作

コンピュータ

イベント

リンク