jm + optimization 13
Measuring & Optimizing I/O Performance
8 days ago by jm
Another good writeup on iostat and EBS, from Ilya Grigorik
io
optimization
sysadmin
performance
iostat
ebs
aws
ops
8 days ago by jm
Jetty-9 goes fast with Mechanical Sympathy
february 2013 by jm
This is very cool! Applying Mechanical Sympathy optimization techniques to Jetty, specifically: "False sharing" on the BlockingArrayQueue data structure resolved; a new ArrayTernaryTrie data structure to improve header field storage, making it faster to build. look up, efficient on RAM, cheap to GC, and more cache-friendly than a traditional trie; and a branchless hex-to-byte conversion statement. The results are a 30%-faster microbenchmark on amd64, with 50% less Young Gen garbage collections. Lovely to see low-level infrastructure libs like Jetty getting this kind of optimization.
jetty
java
mechanical-sympathy
optimization
coding
tries
february 2013 by jm
Implementing strcmp, strlen, and strstr using SSE 4.2 instructions - strchr.com
january 2013 by jm
Using new Intel Core i7 instructions to speed up string manipulation. Fascinating stuff. SSE ftw
sse
optimization
simd
assembly
intel
i7
intel-core
strstr
strings
string-matching
strchr
strlen
coding
january 2013 by jm
AnandTech - The Intel SSD DC S3700: Intel's 3rd Generation Controller Analyzed
november 2012 by jm
Interesting trend; Intel moved from a btree to an array-based data structure for their logical-block address indirection map, in order to reduce worst-case latencies (via Martin Thompson)
latency
intel
via:martin-thompson
optimization
speed
p99
data-structures
arrays
btrees
ssd
hardware
november 2012 by jm
PCRE Performance Project
september 2012 by jm
Excellent stuff. Using "sljit", a stackless platform-independent JIT compiler, this compiles Perl-compatible regular expressions to machine code on ARM, x86, MIPS and PowerPC platforms, resulting in 'similar matching speed to DFA based engines (like re2) on common patterns' with Perl compatibility. 'This work has been released as part of PCRE 8.20 and above. Now (PCRE 8.31), nearly all PCRE features are supported including UTF-8/16 and partial matching.'
pcre
regexps
regex
performance
optimization
jit
compilation
dfa
re2
via:akohli
september 2012 by jm
Avoiding Hash Lookups in a Ruby Implementation
september 2012 by jm
'If I were to sum up the past 6 years I've spent optimizing JRuby it would be with the following phrase: Get Rid Of Hash Lookups.'
This has been a particular theme of some recent optimization hacks I've been working on. Hashes may be O(1) to read, on average, but that doesn't necessarily mean they're the right tool for performance...
(via Declan McGrath)
via:declanmcgrath
hash
optimization
ruby
performance
jruby
hashing
data-structures
big-o
optimisation
This has been a particular theme of some recent optimization hacks I've been working on. Hashes may be O(1) to read, on average, but that doesn't necessarily mean they're the right tool for performance...
(via Declan McGrath)
september 2012 by jm
Expensive lessons in Python performance tuning
july 2012 by jm
some good advice for large-scale Python performance: prun and guppy for profiling, namedtuples for memory efficiency, and picloud for trivial EC2-based scale-out. (via Nelson)
picloud
prun
guppy
namedtuples
python
optimization
performance
tuning
profiling
july 2012 by jm
Fastest sort of fixed length 6 int array - Stack Overflow
august 2011 by jm
huh, I'd never heard of sorting networks before
sorting-networks
c
algorithms
sorting
optimization
sort
stack-overflow
august 2011 by jm
Linux Profiling tools and techniques
april 2011 by jm
great tips for system-level and app-level profiling on Linux from Padraig
profiling
optimization
linux
cache
valgrind
april 2011 by jm
Blosc
october 2010 by jm
A high-performance compressor optimized for binary data -- 'designed to transmit data to the processor cache faster than a traditional, non-compressed, direct memory fetch via memcpy()' (via Bill de hOra)
via:dehora
compression
memcpy
caching
l1
software
memory
optimization
performance
python
pytables
from delicious
october 2010 by jm
The MySQL “swap insanity” problem and the effects of the NUMA architecture
september 2010 by jm
very interesting; modern multicore x86 architectures use a NUMA memory architecture, which can cause a dip into swap, even when there appears to be plenty of free RAM available
linux
memory
mysql
optimization
performance
swap
tuning
vm
numa
swap-insanity
swapping
from delicious
september 2010 by jm
spamtune.pl
april 2010 by jm
'a Perl script that generates an OpenOffice.org spreadsheet which loads up SpamAssassin configuration and known spam and ham messages. Once loaded, you can tweak individual SpamAssassin scores in the spreadsheet itself and see their effect on spam/ham classification in real-time. The script also shows you the number of false positives and negatives for a set of scores in real-time.' by Raj Mathur <raju at linux-delhi.org>
spamtune
spamassassin
rules
scores
optimization
tweaking
openoffice
from delicious
april 2010 by jm
"Source Code Optimisation", Felix von Leitner, Linux Kongress 2009 [PDF]
november 2009 by jm
Good presentation on C compiler optimization, via Cal Henderson. 'People often write less readable code because they think it will produce faster code. Unfortunately, in most cases, the code will not be faster.' I particularly like 'Fancy-Schmancy Algorithms': 'If you have 10-100 elements, use a list, not a red-black tree; Fancy data structures help on paper, but rarely in reality. (More space overhead in the data structure, less L2 cache left for actual data.)'
via:iamcal
compilers
c
c++
optimization
coding
assembly
speed
from delicious
november 2009 by jm
related tags
algorithms ⊕ arrays ⊕ assembly ⊕ aws ⊕ big-o ⊕ btrees ⊕ c ⊕ c++ ⊕ cache ⊕ caching ⊕ coding ⊕ compilation ⊕ compilers ⊕ compression ⊕ data-structures ⊕ dfa ⊕ ebs ⊕ guppy ⊕ hardware ⊕ hash ⊕ hashing ⊕ i7 ⊕ intel ⊕ intel-core ⊕ io ⊕ iostat ⊕ java ⊕ jetty ⊕ jit ⊕ jruby ⊕ l1 ⊕ latency ⊕ linux ⊕ mechanical-sympathy ⊕ memcpy ⊕ memory ⊕ mysql ⊕ namedtuples ⊕ numa ⊕ openoffice ⊕ ops ⊕ optimisation ⊕ optimization ⊖ p99 ⊕ pcre ⊕ performance ⊕ picloud ⊕ profiling ⊕ prun ⊕ pytables ⊕ python ⊕ re2 ⊕ regex ⊕ regexps ⊕ ruby ⊕ rules ⊕ scores ⊕ simd ⊕ software ⊕ sort ⊕ sorting ⊕ sorting-networks ⊕ spamassassin ⊕ spamtune ⊕ speed ⊕ ssd ⊕ sse ⊕ stack-overflow ⊕ strchr ⊕ string-matching ⊕ strings ⊕ strlen ⊕ strstr ⊕ swap ⊕ swap-insanity ⊕ swapping ⊕ sysadmin ⊕ tries ⊕ tuning ⊕ tweaking ⊕ valgrind ⊕ via:akohli ⊕ via:declanmcgrath ⊕ via:dehora ⊕ via:iamcal ⊕ via:martin-thompson ⊕ vm ⊕Copy this bookmark: