[prev in list] [next in list] [prev in thread] [next in thread]
List: john-dev
Subject: Re: [john-dev] Hello, interested on slow hash and fast hash on GPU
From: Solar Designer <solar () openwall ! com>
Date: 2012-03-27 9:22:09
Message-ID: 20120327092209.GC14695 () openwall ! com
[Download RAW message or body]
On Mon, Mar 26, 2012 at 04:48:26AM +0800, myrice wrote:
> 1) I read papers you recently posted. And I emailed author of qhasm-cudasm
> for requesting the tool.
Please let us know if/what you hear back. (BTW, I think that multiple
authors contributed to it.)
> 2) The bitslice implementations of SHA-256 or SHA-512 on GPUs are worth
> discussing. I read your bitslice implementation of MD5. It takes advantages
> of sse2 to compute MD5 hashes at one time. Now raw-sha256-cuda
> implementation already use SIMD and/or bitslice.
Just where did you find any "SIMD and/or bitslice" in the
raw-sha256-cuda implementation? While the compiler can auto-vectorize
(at least theoretically), I don't see that in the source code.
We only have non-bitslice implementations of these currently (except for
my experiment with MD5 on CPUs that you saw), and it is not known
whether non-bitslice SIMD or bitslice will be faster (it may vary by
target architecture, etc.) This is why I suggest that whoever works on
this should try both approaches. My guess is that non-bitslice (with
SIMD where appropriate) will be faster on currently common CPUs/GPUs
(speaking of SHA-256 and SHA-512), but it does not hurt to try bitslice
as well.
> I tried use more
> threads(than now just 1) to compute one hash. However, the data dependence
> make it hard to implement. I am looking forward ideas of optimization.
You should be computing many hashes, not one.
> 3) As you suggested, I am starting write DES format on cuda. I will make
> another post to track my progress and clarify questions on it.
OK. This was not exactly my suggestion to you - I was merely answering
questions on what remains to be done in terms of JtR/GPU, and the DES
stuff is among tasks that haven't been approached yet (as it relates to
JtR/GPU only).
Alexander
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic