Search: [opti] - Shaarli

How To Write Fast Rust Code

opti · rust

September 30, 2023 at 08:12:54 UTC * · permalink

·

http://likebike.com/posts/How_To_Write_Fast_Rust_Code.html

bit permutations code generations

bitpermutation · prog · opti

May 23, 2016 at 04:11:25 UTC · permalink

·

http://programming.sirrida.de/calcperm.php

SSD: how to optimize your Solid State Drive for Linux Mint 17.1, Ubuntu 14.04 and Debian - Easy Linux tips project

ssd · linux · opti

May 31, 2015 at 12:14:27 UTC * · permalink

·

https://sites.google.com/site/easylinuxtipsproject/ssd#TOC-Avoid-exaggerated-measures

Branchless code sequences | DaveSpace

Sometimes when programming we need to tune a small portion of code which is critical to an implementation. For example an inner loop may involve a pixel bashing operation which dominates the program’s overall performance. If this operation uses a comparison, and that results in the compiled code branching, it can hurt performance on pipelined CPUs. It may be better to find a branch-free alternative even if it appears to make the code slightly more complex.

aha · branch · opti · prog

April 24, 2015 at 00:40:39 UTC * · permalink

·

http://www.davespace.co.uk/blog/20150131-branchless-sequences.html

C++ Devirtualization - Ranting @ 741 MHz

Using a virtual dispatch might get relatively expensive in terms of clock cycles due to multiple levels of indirections including indirect branching as well as this pointer adjustment. Wise programmers do not use virtual dispatch without a good reason but oftentimes it is required either by design or when creating non-template reusable components/libraries and the final implementation of some parts of the program is not known.

cpp · opti · linker · prog

November 27, 2014 at 23:09:07 UTC * · permalink

·

http://741mhz.com/devirtualize/

Automatic memoization in C++0x

Memoization is a pretty well-known optimization technique which consists in “remembering” (i.e.: caching) the results of previous calls to a function, so that repeated calls with the same parameters are resolved without repeating the original computation.

cpp · c++11 · prog · memoization · cache · opti

August 28, 2014 at 21:20:22 UTC · permalink

·

http://slackito.com/2011/03/17/automatic-memoization-in-cplusplus0x/

Tiptop

Hardware performance monitoring counters have recently received a lot of attention. They have been used by diverse communities to understand and improve the quality of computing systems: for example, architects use them to extract application characteristics and propose new hardware mechanisms; compiler writers study how generated code behaves on particular hardware; software developers identify critical regions of their applications and evaluate design choices to select the best performing implementation. We propose that counters be used by all categories of users, in particular non-experts, and we advocate that a few simple metrics derived from these counters are relevant and useful. For example, a low IPC (number of executed instructions per cycle) indicates that the hardware is not performing at its best; a high cache miss ratio can suggest several causes, such as conflicts between processes in a multicore environment.

cpu · performance · perf · opti · architecture

July 16, 2014 at 09:20:42 UTC * · permalink

·

http://tiptop.gforge.inria.fr/

principles of high performance programs | libtorrent blog

This article is an attempt to sum up a small number of generic rules that appear to be useful rules of thumb when creating high performing programs. It is structured by first establishing some fundamental causes of performance hits followed by their extensions.

opti · cpu · teaking · prog · pattern · syscall

May 22, 2014 at 01:10:45 UTC · permalink

·

http://blog.libtorrent.org/2012/12/principles-of-high-performance-programs/

memory cache optimizations | libtorrent blog

When optimizing memory access, and memory cache misses in particular, there are surprisingly few tools to help you. valgrind’s cachegrind tool is the closest one I’ve found. It gives you a lot of information on cache misses, but not necessarily in the form you need it.

memory · opti · prog · c · cpp · linux · cache

January 2, 2014 at 13:12:30 UTC · permalink

·

http://blog.libtorrent.org/2013/12/memory-cache-optimizations/

x86 64 - Why would introducing useless MOV instructions speed up a tight loop in x86_64 assembly? - Stack Overflow

There is really black magic !
also see that: http://www.agner.org/optimize/microarchitecture.pdf

asm · opti

July 28, 2013 at 10:15:23 UTC · permalink

·

http://stackoverflow.com/questions/17896714/why-would-introducing-useless-mov-instructions-speed-up-a-tight-loop-in-x86-64-a

SSE – Vectorizing conditional code | Félix Abecassis

How to use sse vector instruction
Also: http://felix.abecassis.me/2011/09/cpp-getting-started-with-sse/

sse · prog · c · cpp · opti

July 19, 2013 at 16:44:44 UTC · permalink

·

http://felix.abecassis.me/2012/08/sse-vectorizing-conditional-code/

perf (Linux) - Wikipedia, the free encyclopedia

perf is performance tool like gprof but it's more user friendly :)
Use it like this:
perf record -p $(pidof program)
(time passes, press Ctrl-C)
perf report -i perf.data

c · cpp · linux · opti

February 16, 2013 at 16:49:22 UTC * · permalink

·

http://en.wikipedia.org/wiki/Perf_(Linux)