[prev in list] [next in list] [prev in thread] [next in thread] 

List:       pypy-dev
Subject:    Re: [pypy-dev] Great experience with PyPy
From:       Gelin Yan <dynamicgl () gmail ! com>
Date:       2013-02-07 15:12:53
Message-ID: CABkOF6THh6K-+6jAzuPzS0SY1K7g2XKmwRPoXaUQnRx_2cRzDQ () mail ! gmail ! com
[Download RAW message or body]

[Attachment #2 (multipart/alternative)]


On Thu, Feb 7, 2013 at 10:11 PM, Phyo Arkar <phyo.arkarlwin@gmail.com>wrote:

> Pypy should have a page for "Success Stories!"
>
> Now with this and Quora proving Power of PyPy , i am beginning to start
> converting my projects into PyPy soon!
> I am only withholding right now because my projects uses a lot of C
> Libraries and Numpy/Matplotlib/scilit-learn.
>
> Thanks
>
> Phyo.
>
> On Thursday, February 7, 2013, Maciej Fijalkowski wrote:
>
>> On Thu, Feb 7, 2013 at 1:55 PM, Marko Tasic <mtasic85@gmail.com> wrote:
>> > Hi,
>> >
>> > I would like to share short story with you and share what we have
>> > accomplished with PyPy and its friends so far.
>> >
>> > Company that I have worked for last 7 months (intentionally unnamed)
>> > gave me absolute permission to pick up technologies on which we based
>> > our solution. What we do is: crawl for PDFs and newspapers articles,
>> > download, translate them if needed, OCR if needed, do extensive
>> > analysis of downloaded PDFs and articles, store them in more organized
>> > structures for faster querying, search for them and generate bunch of
>> > complex reports.
>> >
>> > From very beginning I decided to go with PyPy no matter what. What we
>> > picked is following:
>> > * Flask for web framework, and few of its extensions such as
>> > Flask-Login, Flask-Principal, Flask-WTF, Flask-Mail, etc.
>> > * Cassandra as database because of its features and great experience
>> > with it. PyCassa is used as client to talk to Cassandra server.
>> > * ElasticSearch as distributed search engine, and its client library
>> pyes.
>> > * Whoosh as search engine, but with some modifications to support
>> > Cassandra as storage and distributed locking.
>> > * Redis, and its client library redis-py, for caching and to speed up
>> > common auto-completion patterns.
>> > * ZooKeeper, and its client library Kazoo, for distributed locking
>> > which plays essential role in system for transaction-like behavior
>> > over many services at once.
>> > * Celery in conjunction with RabbitMQ for task distribution.
>> > * Sentry for error logging.
>> >
>> > What we have developed on our own are wrappers and clients for:
>> > * Moses which is language translator
>> > * Tesseract which is OCR engine
>> > * Cassandra store for Whoosh
>> > * wkhtmltopdf and wkhtmltoimage which are used for conversion of HTML
>> > to PDF/Image
>> > * etc
>> >
>> > Now when product is finished and in final testing phase, I can say
>> > that we did not regret because we used PyPy and stack around it.
>> > Typical speed improvement is 2x-3x over CPython in our case, but
>> > anyway we are mostly IO and memory bound, expect for Celery workers
>> > where we do analysis which are again many small CPU intensive tasks
>> > that are exchanged via RabbitMQ. Another reason why we don't see
>> > speedup us is that we are dependent on external software (servers)
>> > written in Erlang and Java.
>> >
>> > I'm already planing to do Cassandra (distributed key/value only
>> > database without index features), ZooKeeper, Redis and ElasticSearch
>> > ports in Python for next projects, and hopefully opensource them.
>> >
>> > Regards,
>> > Marko Tasic
>> > _______________________________________________
>> > pypy-dev mailing list
>> > pypy-dev@python.org
>> > http://mail.python.org/mailman/listinfo/pypy-dev
>>
>> Awesome!
>>
>> I'm glad people can make pypy work for non-trivial tasks which require
>> a lot of dependencies. We're trying to lower the bar, however it takes
>> time.
>>
>> Cheers,
>> fijal
>> _______________________________________________
>> pypy-dev mailing list
>> pypy-dev@python.org
>> http://mail.python.org/mailman/listinfo/pypy-dev
>>
>
> _______________________________________________
> pypy-dev mailing list
> pypy-dev@python.org
> http://mail.python.org/mailman/listinfo/pypy-dev
>
>

Hi, It might be off topic. I want to know whether pypy support postgres.
The last time I noticed ctypes based psycopg2 was still beta. I mainly use
twisted & postgres. pypy supports twisted well but not good for psycopg2.

Regards

gelin yan

[Attachment #5 (text/html)]

<br><br><div class="gmail_quote">On Thu, Feb 7, 2013 at 10:11 PM, Phyo Arkar <span \
dir="ltr">&lt;<a href="mailto:phyo.arkarlwin@gmail.com" \
target="_blank">phyo.arkarlwin@gmail.com</a>&gt;</span> wrote:<br><blockquote \
class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc \
solid;padding-left:1ex"> Pypy should have a page for &quot;Success \
Stories!&quot;<div><br></div><div>Now with this and Quora proving Power of PyPy , i \
am beginning to start converting my projects into PyPy soon!</div><div>I am only \
withholding right now because my projects uses a lot of C Libraries and \
Numpy/Matplotlib/scilit-learn.</div>

<div><br></div><div>Thanks</div><span class="HOEnZb"><font \
color="#888888"><div><br></div><div>Phyo.</div></font></span><div class="HOEnZb"><div \
class="h5"><div><br>On Thursday, February 7, 2013, Maciej Fijalkowski  wrote:<br> \
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc \
solid;padding-left:1ex"> On Thu, Feb 7, 2013 at 1:55 PM, Marko Tasic \
&lt;<a>mtasic85@gmail.com</a>&gt; wrote:<br> &gt; Hi,<br>
&gt;<br>
&gt; I would like to share short story with you and share what we have<br>
&gt; accomplished with PyPy and its friends so far.<br>
&gt;<br>
&gt; Company that I have worked for last 7 months (intentionally unnamed)<br>
&gt; gave me absolute permission to pick up technologies on which we based<br>
&gt; our solution. What we do is: crawl for PDFs and newspapers articles,<br>
&gt; download, translate them if needed, OCR if needed, do extensive<br>
&gt; analysis of downloaded PDFs and articles, store them in more organized<br>
&gt; structures for faster querying, search for them and generate bunch of<br>
&gt; complex reports.<br>
&gt;<br>
&gt; From very beginning I decided to go with PyPy no matter what. What we<br>
&gt; picked is following:<br>
&gt; * Flask for web framework, and few of its extensions such as<br>
&gt; Flask-Login, Flask-Principal, Flask-WTF, Flask-Mail, etc.<br>
&gt; * Cassandra as database because of its features and great experience<br>
&gt; with it. PyCassa is used as client to talk to Cassandra server.<br>
&gt; * ElasticSearch as distributed search engine, and its client library pyes.<br>
&gt; * Whoosh as search engine, but with some modifications to support<br>
&gt; Cassandra as storage and distributed locking.<br>
&gt; * Redis, and its client library redis-py, for caching and to speed up<br>
&gt; common auto-completion patterns.<br>
&gt; * ZooKeeper, and its client library Kazoo, for distributed locking<br>
&gt; which plays essential role in system for transaction-like behavior<br>
&gt; over many services at once.<br>
&gt; * Celery in conjunction with RabbitMQ for task distribution.<br>
&gt; * Sentry for error logging.<br>
&gt;<br>
&gt; What we have developed on our own are wrappers and clients for:<br>
&gt; * Moses which is language translator<br>
&gt; * Tesseract which is OCR engine<br>
&gt; * Cassandra store for Whoosh<br>
&gt; * wkhtmltopdf and wkhtmltoimage which are used for conversion of HTML<br>
&gt; to PDF/Image<br>
&gt; * etc<br>
&gt;<br>
&gt; Now when product is finished and in final testing phase, I can say<br>
&gt; that we did not regret because we used PyPy and stack around it.<br>
&gt; Typical speed improvement is 2x-3x over CPython in our case, but<br>
&gt; anyway we are mostly IO and memory bound, expect for Celery workers<br>
&gt; where we do analysis which are again many small CPU intensive tasks<br>
&gt; that are exchanged via RabbitMQ. Another reason why we don&#39;t see<br>
&gt; speedup us is that we are dependent on external software (servers)<br>
&gt; written in Erlang and Java.<br>
&gt;<br>
&gt; I&#39;m already planing to do Cassandra (distributed key/value only<br>
&gt; database without index features), ZooKeeper, Redis and ElasticSearch<br>
&gt; ports in Python for next projects, and hopefully opensource them.<br>
&gt;<br>
&gt; Regards,<br>
&gt; Marko Tasic<br>
&gt; _______________________________________________<br>
&gt; pypy-dev mailing list<br>
&gt; <a>pypy-dev@python.org</a><br>
&gt; <a href="http://mail.python.org/mailman/listinfo/pypy-dev" \
target="_blank">http://mail.python.org/mailman/listinfo/pypy-dev</a><br> <br>
Awesome!<br>
<br>
I&#39;m glad people can make pypy work for non-trivial tasks which require<br>
a lot of dependencies. We&#39;re trying to lower the bar, however it takes<br>
time.<br>
<br>
Cheers,<br>
fijal<br>
_______________________________________________<br>
pypy-dev mailing list<br>
<a>pypy-dev@python.org</a><br>
<a href="http://mail.python.org/mailman/listinfo/pypy-dev" \
target="_blank">http://mail.python.org/mailman/listinfo/pypy-dev</a><br> \
</blockquote></div> </div></div><br>_______________________________________________<br>
 pypy-dev mailing list<br>
<a href="mailto:pypy-dev@python.org">pypy-dev@python.org</a><br>
<a href="http://mail.python.org/mailman/listinfo/pypy-dev" \
target="_blank">http://mail.python.org/mailman/listinfo/pypy-dev</a><br> \
<br></blockquote></div><br><div><br></div><div>Hi, It might be off topic. I want to \
know whether pypy support postgres. The last time I noticed ctypes based psycopg2 was \
still beta. I mainly use twisted &amp; postgres. pypy supports twisted well but not \
good for psycopg2.</div> <div><br></div><div>Regards</div><div><br></div><div>gelin \
yan</div>



_______________________________________________
pypy-dev mailing list
pypy-dev@python.org
http://mail.python.org/mailman/listinfo/pypy-dev


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic