<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>GPU Software Blog</title>
	<atom:link href="http://blog.accelereyes.com/blog/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.accelereyes.com/blog</link>
	<description>Helpful posts about GPU computing. Discussion of Jacket and ArrayFire. Real speedups on real code!</description>
	<lastBuildDate>Mon, 23 Jan 2012 13:29:08 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>AccelerEyes Webinar Video &#8211; Medical Image Segmentation</title>
		<link>http://blog.accelereyes.com/blog/2012/01/19/accelereyes-webinar-video-medica/</link>
		<comments>http://blog.accelereyes.com/blog/2012/01/19/accelereyes-webinar-video-medica/#comments</comments>
		<pubDate>Thu, 19 Jan 2012 21:59:37 +0000</pubDate>
		<dc:creator>vishy</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[MATLAB®]]></category>
		<category><![CDATA[Parallel computing]]></category>
		<category><![CDATA[Videos]]></category>

		<guid isPermaLink="false">http://blog.accelereyes.com/blog/?p=2173</guid>
		<description><![CDATA[In case you missed it, we recently held a webinar on how to accelerate common medical imaging applications using an easy, powerful programming library with Jacket for MATLAB®. This webinar was part of an ongoing series of webinars that will help you learn more about the many applications of Jacket and ArrayFire, while interacting with AccelerEyes [...]]]></description>
			<content:encoded><![CDATA[<p></p><!-- Start Shareaholic LikeButtonSetTop Automatic --><!-- End Shareaholic LikeButtonSetTop Automatic --><p>In case you missed it, we recently held a webinar on how to accelerate common medical imaging applications using an easy, powerful programming library with Jacket for MATLAB®.</p>
<p>This webinar was part of an ongoing <a href="http://blog.accelereyes.com/blog/2012/01/12/accelereyes_webinars_2012q1/">series of webinars</a> that will help you learn more about the many applications of Jacket and ArrayFire, while interacting with AccelerEyes GPU computing experts.  Gallagher Pryor, CTO of AccelerEyes, used the Bayesian Image Segmentation algorithm as a simple use-case to show how easy it is to convert CPU code to GPU code with Jacket (only 4 lines of CPU code needed to be changed!).</p>
<p>For those of you who missed it, we uploaded the webinar on Youtube. We hope to see you at <a href="https://accelereyes.webex.com/mw0306ld/mywebex/default.do?siteurl=accelereyes">the next one</a>!<br />
<center><iframe align="center" src="http://www.youtube.com/embed/yWaibjgdOEg" frameborder="0" width="420" height="315"></iframe></center></p>
<div class="shr-publisher-2173"></div><!-- Start Shareaholic LikeButtonSetBottom Automatic --><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><div class='shareaholic-like-buttonset' style='float:none;height:30px;'><a class='shareaholic-googleplusone' data-shr_size='medium' data-shr_count='false' data-shr_href='http%3A%2F%2Fblog.accelereyes.com%2Fblog%2F2012%2F01%2F19%2Faccelereyes-webinar-video-medica%2F' data-shr_title='AccelerEyes+Webinar+Video+-+Medical+Image+Segmentation'></a></div><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><!-- End Shareaholic LikeButtonSetBottom Automatic -->]]></content:encoded>
			<wfw:commentRss>http://blog.accelereyes.com/blog/2012/01/19/accelereyes-webinar-video-medica/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Jacket over Remote Desktop for Tesla and Quadro GPUs</title>
		<link>http://blog.accelereyes.com/blog/2012/01/17/jacket-over-remote-desktop-for-tesla-and-quadro-gpus/</link>
		<comments>http://blog.accelereyes.com/blog/2012/01/17/jacket-over-remote-desktop-for-tesla-and-quadro-gpus/#comments</comments>
		<pubDate>Tue, 17 Jan 2012 22:19:30 +0000</pubDate>
		<dc:creator>vishy</dc:creator>
				<category><![CDATA[CUDA]]></category>
		<category><![CDATA[MATLAB®]]></category>
		<category><![CDATA[Parallel computing]]></category>

		<guid isPermaLink="false">http://blog.accelereyes.com/blog/?p=2166</guid>
		<description><![CDATA[We recently reported that Jacket could be used over Windows Remote Desktop connections as long as you had an NVIDIA Tesla device in TCC mode. With the latest NVIDIA driver updates, Tesla and Quadro devices can be put into TCC mode, making it possible to use Jacket over Remote Desktop with both Tesla and Quadro [...]]]></description>
			<content:encoded><![CDATA[<p></p><!-- Start Shareaholic LikeButtonSetTop Automatic --><!-- End Shareaholic LikeButtonSetTop Automatic --><p style="text-align: justify;">We recently reported that Jacket <a title="CUDA over Remote Desktop now available for Tesla GPUs" href="http://blog.accelereyes.com/blog/2011/02/10/cuda_remote_desktop_for_tesla_gpus/">could be used over</a> Windows Remote Desktop connections as long as you had an NVIDIA Tesla device in TCC mode. With the latest NVIDIA driver updates, Tesla <em>and</em> Quadro devices can be put into TCC mode, making it possible to use Jacket over Remote Desktop with both Tesla <em>and</em> Quadro devices.</p>
<p style="text-align: justify;">We have tested this out with the NVIDIA Quadro 4000 as well as Quadro 6000 GPUs. The system had a Tesla C2050 connected to the display, and the Quadro in TCC mode. Here&#8217;s the ginfo output:</p>
<pre>&gt;&gt; ginfo
Jacket v2.0 (build 80c7ba4) by AccelerEyes (64-bit Windows)
License Type: Designated Computer ([JACKET_ROOT]\jacket\engine\jlicense.dat)
Addons: MGL4, JMC, SDK, DLA, SLA
CUDA toolkit 4.0, driver 285.62
GPU1 Quadro 4000, 2048 MB, Compute 2.0 (single,double)
Memory Usage: 1977 MB free (2048 MB total)</pre>
<p style="text-align: justify;">Jacket over Remote Desktop is documented extensively on the <a title="AccelerEyes Wiki - Jacket Over Remote Connections" href="http://wiki.accelereyes.com/wiki/index.php/Jacket_Over_Remote_Connections">AccelerEyes Wiki</a>. Please check that page for more information.</p>
<p style="text-align: justify;">
<div class="shr-publisher-2166"></div><!-- Start Shareaholic LikeButtonSetBottom Automatic --><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><div class='shareaholic-like-buttonset' style='float:none;height:30px;'><a class='shareaholic-googleplusone' data-shr_size='medium' data-shr_count='false' data-shr_href='http%3A%2F%2Fblog.accelereyes.com%2Fblog%2F2012%2F01%2F17%2Fjacket-over-remote-desktop-for-tesla-and-quadro-gpus%2F' data-shr_title='Jacket+over+Remote+Desktop+for+Tesla+and+Quadro+GPUs'></a></div><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><!-- End Shareaholic LikeButtonSetBottom Automatic -->]]></content:encoded>
			<wfw:commentRss>http://blog.accelereyes.com/blog/2012/01/17/jacket-over-remote-desktop-for-tesla-and-quadro-gpus/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>AccelerEyes Webinar Series</title>
		<link>http://blog.accelereyes.com/blog/2012/01/12/accelereyes_webinars_2012q1/</link>
		<comments>http://blog.accelereyes.com/blog/2012/01/12/accelereyes_webinars_2012q1/#comments</comments>
		<pubDate>Thu, 12 Jan 2012 15:51:10 +0000</pubDate>
		<dc:creator>scott</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[CUDA]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[MATLAB®]]></category>
		<category><![CDATA[OpenCL]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[announcement]]></category>
		<category><![CDATA[ArrayFire]]></category>
		<category><![CDATA[cuda]]></category>
		<category><![CDATA[Jacket]]></category>
		<category><![CDATA[matlab]]></category>

		<guid isPermaLink="false">http://blog.accelereyes.com/blog/?p=2133</guid>
		<description><![CDATA[AccelerEyes invites you to participate in series of webinars designed to help you learn more about Jacket for MATLAB® and ArrayFire for C/C++/Fortran/Python, a comprehensive library of GPU-accelerated functions. GPU Programming for Medical Image Segmentation: January 18, 2012 at 3:00 p.m. EST There&#8217;s a huge volume of data generated using acquisition modalities like computer tomography (CT), magnetic resonance imaging (MRI), positron emission tomography or [...]]]></description>
			<content:encoded><![CDATA[<p></p><!-- Start Shareaholic LikeButtonSetTop Automatic --><!-- End Shareaholic LikeButtonSetTop Automatic --><p>AccelerEyes invites you to participate in series of <a title="Register for Webinar Series" href="https://accelereyes.webex.com/mw0306ld/mywebex/default.do?nomenu=true&amp;siteurl=accelereyes&amp;service=6&amp;rnd=0.4385461764274333&amp;main_url=https%3A%2F%2Faccelereyes.webex.com%2Fec0605ld%2Feventcenter%2Fprogram%2FprogramDetail.do%3FtheAction%3Ddetail%26siteurl%3Daccelereyes%26cProgViewID%3D0" target="_blank">webinars</a> designed to help you learn more about <a title="Learn about Jacket for MATLAB" href="http://www.accelereyes.com/products/jacket" target="_blank">Jacket</a> for MATLAB® and <a title="Learn about ArrayFire" href="http://www.accelereyes.com/products/arrayfire" target="_blank">ArrayFire</a> for C/C++/Fortran/Python, a comprehensive library of GPU-accelerated functions.</p>
<p><strong>GPU Programming for Medical Image Segmentation: <a title="Register for Webinar" href="https://accelereyes.webex.com/mw0306ld/mywebex/default.do?siteurl=accelereyes" target="_blank">January </a></strong><strong><a title="Register for Webinar" href="https://accelereyes.webex.com/mw0306ld/mywebex/default.do?siteurl=accelereyes" target="_blank">18, 2012</a> </strong><strong>at 3:00 p.m. EST</strong></p>
<p style="text-align: justify;"><a href="http://blog.accelereyes.com/blog/wp-content/uploads/2012/01/brainimagesm.jpg"><img class="alignright  wp-image-2054" title="brainimagesm" src="http://blog.accelereyes.com/blog/wp-content/uploads/2012/01/brainimagesm-225x300.jpg" alt="" width="203" height="270" /></a>There&#8217;s a huge volume of data generated using acquisition modalities like computer tomography (CT), magnetic resonance imaging (MRI), positron emission tomography or nuclear medicine. A common need is to manipulate and transmit this data using compression techniques in as little time as possible. During this webinar we will show Jacket’s superior speed and handling volumes from subscripting to convolutions.  Come and learn how to accelerate common medical imaging applications using an easy, powerful programming library with Jacket for MATLAB®.</p>
<p><strong>OpenCL and CUDA Trade-Offs and Comparison: <a title="Register for Webinar" href="https://accelereyes.webex.com/mw0306ld/mywebex/default.do?siteurl=accelereyes" target="_blank">February 15</a></strong><strong><a title="Register for Webinar" href="https://accelereyes.webex.com/mw0306ld/mywebex/default.do?siteurl=accelereyes" target="_blank">, 2012</a></strong><strong> at 3:00 p.m. EST</strong></p>
<p style="text-align: justify;"><a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/12/raindrop.png"><img class="alignleft size-medium wp-image-2040" title="raindrop" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/12/raindrop-300x140.png" alt="" width="300" height="140" /></a>The OpenCL standard continues to mature and is now (or soon will be) supported by a variety of GPUs and manycore processors. At AccelerEyes, we remain at the forefront of OpenCL development. ArrayFire OpenCL is a fast software library for GPU computing with a simple API.  In this informative webinar, our team of GPU experts will discuss OpenCL and CUDA trade-offs and comparisons.  In addition, you&#8217;ll get to see ArrayFire OpenCL in action with real code.</p>
<p><strong>GPU Programming for Financial Computing:  <a title="Register for Webinar" href="https://accelereyes.webex.com/mw0306ld/mywebex/default.do?siteurl=accelereyes" target="_blank">March 15</a><a title="Register for Webinar" href="https://accelereyes.webex.com/mw0306ld/mywebex/default.do?siteurl=accelereyes" target="_blank">, 2012</a> at 3:00 p.m. EST</strong></p>
<p style="text-align: justify;">Quantitative analysts are discovering the benefits of leveraging GPUs in tackling complex financial computing models. Using Jacket&#8217;s computational horsepower, analysts can employ a variety of functions to achieve speedups in trade signal generation, complex derivative pricing, evaluating risk scenarios, and more. In this webinar, we&#8217;ll discuss the latest developments in GPU programming for financial computing.</p>
<p><a href="http://blog.accelereyes.com/blog/wp-content/uploads/2012/01/financial_modeling-300x225.jpg"><img class="aligncenter size-full wp-image-2069" title="financial_modeling-300x225" src="http://blog.accelereyes.com/blog/wp-content/uploads/2012/01/financial_modeling-300x225.jpg" alt="" width="300" height="225" /></a></p>
<p style="text-align: justify;">Each webinar will be conducted by AccelerEyes’ team of GPU computing experts and will include live demos of Jacket and ArrayFire.   We hope you will <a title="Register for Webinar" href="http://accelereyes.webex.com/mw0306ld/mywebex/default.do?siteurl=accelereyes" target="_blank">join us</a> as we discuss exciting developments in GPU computing software!</p>
<div class="shr-publisher-2133"></div><!-- Start Shareaholic LikeButtonSetBottom Automatic --><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><div class='shareaholic-like-buttonset' style='float:none;height:30px;'><a class='shareaholic-googleplusone' data-shr_size='medium' data-shr_count='false' data-shr_href='http%3A%2F%2Fblog.accelereyes.com%2Fblog%2F2012%2F01%2F12%2Faccelereyes_webinars_2012q1%2F' data-shr_title='AccelerEyes+Webinar+Series'></a></div><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><!-- End Shareaholic LikeButtonSetBottom Automatic -->]]></content:encoded>
			<wfw:commentRss>http://blog.accelereyes.com/blog/2012/01/12/accelereyes_webinars_2012q1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>GPU Computing with Python</title>
		<link>http://blog.accelereyes.com/blog/2011/12/15/gpu-computing-with-python/</link>
		<comments>http://blog.accelereyes.com/blog/2011/12/15/gpu-computing-with-python/#comments</comments>
		<pubDate>Thu, 15 Dec 2011 23:11:28 +0000</pubDate>
		<dc:creator>pavan</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[CUDA]]></category>
		<category><![CDATA[Parallel computing]]></category>
		<category><![CDATA[Python]]></category>
		<category><![CDATA[ArrayFire]]></category>
		<category><![CDATA[python]]></category>

		<guid isPermaLink="false">http://blog.accelereyes.com/blog/?p=2009</guid>
		<description><![CDATA[One of the biggest areas where GPUs are providing benefit is with scientific computing. With libraries like Sage and SciPy providing a huge collection of functions and algorithms for free, Python has become one of the favorite tools for developers around the world. Even though these libraries have C/C++ back-ends, performance on large problems quickly [...]]]></description>
			<content:encoded><![CDATA[<p></p><!-- Start Shareaholic LikeButtonSetTop Automatic --><!-- End Shareaholic LikeButtonSetTop Automatic --><p>One of the biggest areas where GPUs are providing benefit is with scientific computing. With libraries like <a title="Sage" href="http://www.sagemath.org/">Sage</a> and <a title="SciPy" href="http://www.scipy.org/">SciPy</a> providing a huge collection of functions and algorithms for free, Python has become one of the favorite tools for developers around the world. Even though these libraries have C/C++ back-ends, performance on large problems quickly becomes an issue and can kill productivity.</p>
<p>On the heals of our free release of <a title="ArrayFire" href="http://www.accelereyes.com/products/arrayfire">Arrayfire C/C++</a>, we&#8217;re excited to release <a href="http://www.accelereyes.com/arrayfire_cuda/afpy.html">ArrayFire Python</a>. All of this is <strong>FREE</strong> for most users (see below for clarification)!</p>
<p>The structure of ArrayFire/Python is loosely based on <a title="NumPy" href="http://numpy.scipy.org/">NumPy</a> in that it uses a single <tt>array</tt> object that can contain multiple data types. You can convert NumPy arrays to ArrayFire arrays and vice versa. If you already have your application using NumPy arrays, this is a quick way to jump in and tweak critical sections.</p>
<pre lang="python">import numpy as np
import arrayfire as af
a = np.random.rand(5,5)
b = af.array(a)
c = b.host() # c is the same as a</pre>
<p>Alternatively, you can generate data on the device:</p>
<pre lang="python">r = af.randu(5, 5)
o = af.ones(5,5)
z = af.zeros(5,5)</pre>
<p>Once you have the data you need, you can utilize <a href="http://www.accelereyes.com/arrayfire_cuda/afpy.html">hundreds of functions</a> to convert your code entirely onto the GPU. You&#8217;ll find much of the API follows directly from <a href="http://www.scipy.org/Numpy_Functions_by_Category">NumPy</a> itself.</p>
<div class="wp-caption aligncenter" style="width: 600px">
	<a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/12/raindrop1.png"><img title="Raindrop Example" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/12/raindrop1-1024x478.png" alt="" width="600" height="300" /></a>
	<p class="wp-caption-text">Screenshot taken while running raindrop example</p>
</div>
<p>Jumping right in, here is an example showing the Monte-Carlo calculation of pi</p>
<pre lang="python">from arrayfire import *
def pi(samples=20000000):
    x = randu(samples, 1)
    y = randu(samples, 1)
    return 4 * sum(mul(x, x) + mul(y, y) &lt; 1) / samples</pre>
<p>You can visit our website to <a href="http://www.accelereyes.com/products/arrayfire">download</a> the latest version of ArrayFire. You can find the Python wrapper in <tt>arrayfire/python</tt> directory. Installation instructions are in <tt>README</tt>, and a few examples are included that show off both compute and visualizations.</p>
<p>Our <a href="http://forums.accelereyes.com/forums/viewforum.php?f=17">Forums</a> are the best place to get the latest info and help.</p>
<p>AccelerEyes provides this software for free in the hope that some of you might be interested in hiring us to port your code to the GPU.  If that is interesting, <a href="https://www.accelereyes.com/company/contact_us">let us know</a>!</p>
<p>* ArrayFire is free for use on a single GPU.  To run ArrayFire on larger hardware systems, contact <a href="mailto:sales@accelereyes.com">sales@accelereyes.com</a>.</p>
<div class="shr-publisher-2009"></div><!-- Start Shareaholic LikeButtonSetBottom Automatic --><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><div class='shareaholic-like-buttonset' style='float:none;height:30px;'><a class='shareaholic-googleplusone' data-shr_size='medium' data-shr_count='false' data-shr_href='http%3A%2F%2Fblog.accelereyes.com%2Fblog%2F2011%2F12%2F15%2Fgpu-computing-with-python%2F' data-shr_title='GPU+Computing+with+Python'></a></div><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><!-- End Shareaholic LikeButtonSetBottom Automatic -->]]></content:encoded>
			<wfw:commentRss>http://blog.accelereyes.com/blog/2011/12/15/gpu-computing-with-python/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Jacket v2.0 Now Available</title>
		<link>http://blog.accelereyes.com/blog/2011/12/08/jacket-version-2-0/</link>
		<comments>http://blog.accelereyes.com/blog/2011/12/08/jacket-version-2-0/#comments</comments>
		<pubDate>Thu, 08 Dec 2011 19:03:23 +0000</pubDate>
		<dc:creator>scott</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[MATLAB®]]></category>
		<category><![CDATA[OpenCL]]></category>
		<category><![CDATA[Jacket]]></category>
		<category><![CDATA[matlab]]></category>
		<category><![CDATA[parallel computing toolbox]]></category>

		<guid isPermaLink="false">http://blog.accelereyes.com/blog/?p=1984</guid>
		<description><![CDATA[New Multi-GPU functionality , added support for OpenCL devices, and much more&#8230; AccelerEyes announces the release of Jacket version 2.0, adding GPU computing capabilities for use with MATLAB®.  Version 2.0 delivers even more speed through a host of new improvements, maximizing GPU device performance and utilization. Notable new features include a multi-GPU interface and support [...]]]></description>
			<content:encoded><![CDATA[<p></p><!-- Start Shareaholic LikeButtonSetTop Automatic --><!-- End Shareaholic LikeButtonSetTop Automatic --><p style="text-align: left;" align="center"><strong><a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/12/jacket_logo.png"><img class="alignright" title="jacket_logo" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/12/jacket_logo-300x235.png" alt="" width="240" height="188" /></a></strong></p>
<p style="text-align: left;" align="center"><strong>New Multi-GPU functionality <strong>, added support for OpenCL devices, and much more&#8230;</strong></strong></p>
<p style="text-align: left;" align="center"><strong><strong></strong></strong>AccelerEyes announces the release of Jacket version 2.0, adding GPU computing capabilities for use with MATLAB®.  Version 2.0 delivers even more speed through a host of new improvements, maximizing GPU device performance and utilization.</p>
<p>Notable new features include a multi-GPU interface and support for OpenCL devices. With Jacket v2.0, your M-code is now portable across all major GPU devices, including AMD/ATI, Intel, and NVIDIA chips.</p>
<p><a title="About Jacket" href="http://www.accelereyes.com/products/jacket">Jacket</a> is the premier GPU software plugin for MATLAB®, <a title="Compare Jacket" href="http://www.accelereyes.com/products/compare">better</a> than alternative solutions.  It is relied upon by thousands of organizations for rapid prototyping and problem solving across a range of government, manufacturing, energy, media, biomedical, financial, and scientific research applications.</p>
<p><strong>Multi-GPU Details:</strong></p>
<ul>
<li>Control over all GPUs in your program through simple, fast GPU selection functions.  Jacket automatically handles communication between the GPU devices, without the need to launch bulky parallel computing workers</li>
<li>GINFO, GSELECT, GSYNC all extended to handle multiple devices</li>
</ul>
<p><strong>OpenCL Details:</strong></p>
<ul>
<li>Supports single precision, floating point, real, and complex types</li>
<li>Supports array math, FFTs, element-wise operations, and more</li>
<li>Selection of any OpenCL compliant device listed in GINFO</li>
<li>Currently available as a FREE beta feature, <a title="Download Jacket" href="https://accelereyes.com/licenses_jacket">download now</a></li>
</ul>
<p><strong>Other Notable Improvements:</strong></p>
<ul>
<li>New Base Jacket Functions, such as MEDIAN and PROD</li>
<li>Additional Image Processing Library functions</li>
<li>Additional Statistics Library functions</li>
<li>Additional Signal Processing Library functions</li>
<li>Support for binary to decimal conversion with BI2DE, DE2BI</li>
<li>New Demos (included in every download):
<ul>
<li>Defense Optical Flow Tracking example, Music Visualizer example, and new Jacket CPU v. GPU demo</li>
</ul>
</li>
<li>Financial example of Black-Scholes with GCOMPILE is 35X faster than CPU<a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/12/jacket_logo.png"><br />
</a></li>
</ul>
<p>Visit our <a href="http://www.accelereyes.com/">company website</a> and see the <a href="http://wiki.accelereyes.com/wiki/index.php/Release_Notes">v2.0 release notes</a> for the full list of enhancements.</p>
<p><strong>Priciing and Availability</strong></p>
<p>Jacket v2.0 is available for download on the AccelerEyes website.  Pricing for a Jacket base license with support for a single GPU is $999.00 USD for commercial and $350.00 USD for academic customers.  AccelerEyes provides 12 months of software maintenance and updates with each software license.  Volume packages and development bundles are also <a href="http://www.accelereyes.com/purchase/special_offers"><span style="color: #0000ff;">now available</span></a> at special price points.</p>
<p><strong>Try our Professional Services</strong></p>
<p>AccelerEyes provides professional GPU consulting services.  Our team of engineers guarantees great results from GPU computing.  Equipped with Jacket and years of experience, our experts deliver results in fewer hours than any other consulting firms.  Set up a <a href="mailto:support@accelereyes.com?subject=FREE%20GPU%20Computing%20Consultation&amp;body=I%20would%20like%20to%20request%20a%20FREE%20GPU%20computing%20consultation%20session%20with%20one%20of%20your%20GPU%20experts.%20%20I%20am%20available%20for%20a%20phone%20call%20during%20the%20following%20times%3A%0A%0A%3Clist%20available%20times%3E%0A">free GPU consultation</a> today.</p>
<div class="shr-publisher-1984"></div><!-- Start Shareaholic LikeButtonSetBottom Automatic --><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><div class='shareaholic-like-buttonset' style='float:none;height:30px;'><a class='shareaholic-googleplusone' data-shr_size='medium' data-shr_count='false' data-shr_href='http%3A%2F%2Fblog.accelereyes.com%2Fblog%2F2011%2F12%2F08%2Fjacket-version-2-0%2F' data-shr_title='Jacket+v2.0+Now+Available'></a></div><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><!-- End Shareaholic LikeButtonSetBottom Automatic -->]]></content:encoded>
			<wfw:commentRss>http://blog.accelereyes.com/blog/2011/12/08/jacket-version-2-0/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Jacket on Lenovo Systems</title>
		<link>http://blog.accelereyes.com/blog/2011/11/23/jacket-on-lenovo-systems/</link>
		<comments>http://blog.accelereyes.com/blog/2011/11/23/jacket-on-lenovo-systems/#comments</comments>
		<pubDate>Wed, 23 Nov 2011 21:47:52 +0000</pubDate>
		<dc:creator>scott</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Benchmarks]]></category>
		<category><![CDATA[MATLAB®]]></category>
		<category><![CDATA[Parallel computing]]></category>
		<category><![CDATA[benchmarks]]></category>
		<category><![CDATA[Jacket]]></category>
		<category><![CDATA[Lenovo]]></category>
		<category><![CDATA[matlab]]></category>

		<guid isPermaLink="false">http://blog.accelereyes.com/blog/?p=1891</guid>
		<description><![CDATA[Lenovo and AccelerEyes have a joint solution for optimizing M code on Lenovo workstations.  The combined HPC solution combines high Intel Xeon CPU performance for daily productivity with unprecedented NVIDIA graphics (GPU) performance for parallel computing with Jacket. Jacket’s comprehensive benchmark suite, when run on Lenovo ThinkStation systems, shows tremendous amounts of speedups for a [...]]]></description>
			<content:encoded><![CDATA[<p></p><!-- Start Shareaholic LikeButtonSetTop Automatic --><!-- End Shareaholic LikeButtonSetTop Automatic --><p style="text-align: justify;">Lenovo and AccelerEyes have a joint solution for optimizing M code on Lenovo workstations.  The combined HPC solution combines high Intel Xeon CPU performance for daily productivity with unprecedented NVIDIA graphics (GPU) performance for parallel computing with Jacket. Jacket’s comprehensive benchmark suite, when run on Lenovo ThinkStation systems, shows tremendous amounts of speedups for a wide variety of computationally-intensive applications.</p>
<div class="mceTemp">
<div class="mceTemp">
<p><a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/11/Lenovo-ThinkStations.png"><img class="aligncenter size-full wp-image-1901" title="Lenovo ThinkStations" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/11/Lenovo-ThinkStations.png" alt="" width="682" height="312" /></a></p>
<p style="text-align: justify;">Jacket is the world’s fastest and broadest GPU software accelerating the M-language commonly found in MATLAB®.  Thousands of customers around the world have used Jacket to accelerate their MATLAB code.</p>
<p style="text-align: justify;">Lenovo ThinkStation systems are ideally suited for running real-world high-performance applications using Jacket. While the high-end CPUs are ideal for daily productivity tasks, Jacket and the Quadro GPUs perform HPC operations with ease.</p>
<p style="text-align: justify;">To demonstrate the value gained by upgrading to a ThinkStation with an NVIDIA Quadro, benchmarks were run on the E20, S20 and D20 systems with Jacket and a variety of GPUs. We combined each of the three systems with three different GPUs in a good-better-best configuration, to create 9 different hardware test environments for the Jacket benchmark suite.</p>
<p style="text-align: justify;">The resulting speed-ups achieved over the baseline system show tremendous speed advantages that get wider as the configuration gets better.  It is worth noting that with the Jacket MGL add-on, you can run code on multiple GPUs on the same machine. We observed a performance boost of up to 90% with each additional GPU added to the system.</p>
<p style="text-align: justify;"><a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/11/Lenovo-Measured-Speedups-e1320954198284.png"><img class="aligncenter size-full wp-image-1905" title="Lenovo Measured Speedups" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/11/Lenovo-Measured-Speedups-e1320954198284.png" alt="" width="600" height="348" /></a></p>
<p style="text-align: justify;">Jacket has a wide range of domain-specific library functions available for free. Functions for Image, Signal and Video Processing, statistics and graphics are included with the Jacket package. This allows domain professionals to get going right away without the added hassle of choosing which packages to buy.  Jacket combines high-level programmability in M-code with the ability to control the nuts and bolts. Using the Jacket SDK, you can create customized computational kernels for your domain-specific algorithms using the same code that many of Jacket’s functions are written in. Functions that use Jacket SDK plug in effortlessly to Jacket’s core and benefit from Jacket’s automated optimizations.  Jacket code is deployable to machines without a MATLAB or Jacket license. Using the Jacket JMC add-on, your code can be compiled either into an executable package or a library that can be linked into other programs.</p>
<p style="text-align: justify;">The Lenovo ThinkStation with Jacket is a high-performance, power-efficient advanced workstation HPC platform solution that brings supercomputing power to MATLAB users for a fraction of the cost. With its demonstrated ability to achieve high speedups across a variety of applications, Jacket for MATLAB will help you harness the ThinkStation’s full computing potential.</p>
<p style="text-align: justify;">
</div>
</div>
<div class="shr-publisher-1891"></div><!-- Start Shareaholic LikeButtonSetBottom Automatic --><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><div class='shareaholic-like-buttonset' style='float:none;height:30px;'><a class='shareaholic-googleplusone' data-shr_size='medium' data-shr_count='false' data-shr_href='http%3A%2F%2Fblog.accelereyes.com%2Fblog%2F2011%2F11%2F23%2Fjacket-on-lenovo-systems%2F' data-shr_title='Jacket+on+Lenovo+Systems'></a></div><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><!-- End Shareaholic LikeButtonSetBottom Automatic -->]]></content:encoded>
			<wfw:commentRss>http://blog.accelereyes.com/blog/2011/11/23/jacket-on-lenovo-systems/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>AccelerEyes Releases ArrayFire GPU Software</title>
		<link>http://blog.accelereyes.com/blog/2011/11/21/accelereyes-releases-arrayfire-gpu-software/</link>
		<comments>http://blog.accelereyes.com/blog/2011/11/21/accelereyes-releases-arrayfire-gpu-software/#comments</comments>
		<pubDate>Mon, 21 Nov 2011 18:39:22 +0000</pubDate>
		<dc:creator>scott</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[CUDA]]></category>
		<category><![CDATA[OpenCL]]></category>
		<category><![CDATA[ArrayFire]]></category>
		<category><![CDATA[cuda]]></category>
		<category><![CDATA[Free GPU Software]]></category>
		<category><![CDATA[parallel computing]]></category>

		<guid isPermaLink="false">http://blog.accelereyes.com/blog/?p=1933</guid>
		<description><![CDATA[A free, fast, and simple GPU library for CUDA and OpenCL devices. AccelerEyes announces the launch of ArrayFire, a freely-available GPU software library supporting CUDA and OpenCL devices. ArrayFire supports C, C++, Fortran, and Python languages on AMD, Intel, and NVIDIA hardware.  Learn more by visiting the ArrayFire product page. &#8220;ArrayFire is our best software [...]]]></description>
			<content:encoded><![CDATA[<p></p><!-- Start Shareaholic LikeButtonSetTop Automatic --><!-- End Shareaholic LikeButtonSetTop Automatic --><p><a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/11/array_fire_mini_logo_krunal1.png"><img class="aligncenter size-medium wp-image-1940" title="array_fire_mini_logo_krunal" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/11/array_fire_mini_logo_krunal1-300x64.png" alt="" width="300" height="64" /></a><strong><strong>A free, fast, and simple GPU library for CUDA and OpenCL devices.</strong></strong></p>
<p style="text-align: justify;">AccelerEyes announces the launch of ArrayFire, a freely-available GPU software library supporting CUDA and OpenCL devices.</p>
<p style="text-align: justify;">ArrayFire supports C, C++, Fortran, and Python languages on AMD, Intel, and NVIDIA hardware.  Learn more by visiting the ArrayFire <a title="ArrayFire Product Info" href="http://www.accelereyes.com/products/ArrayFire">product page</a>.</p>
<p style="text-align: justify;">&#8220;ArrayFire is our best software yet and anyone considering GPU computing can benefit,&#8221; says James Malcolm, VP Engineering at AccelerEyes.  &#8220;It is fast, simple, GPU-vendor neutral, full of functions, and free for most users.&#8221;</p>
<p style="text-align: justify;">Thousands of paying customers currently enjoy AccelerEyes’ GPU software products.  With ArrayFire, everyone developing software for GPUs has an opportunity to enjoy these benefits without the upfront expense of a developer license.</p>
<h4>Reasons to use ArrayFire:</h4>
<ul>
<li><strong>Fast.</strong>  It beats other CPU and GPU acceleration software.  Benchmark it yourself!</li>
<li><strong>Friendly.</strong>  You can learn it in minutes.  It is super easy to use.</li>
<li><strong>Useful.</strong>  It will benefit your code.  It contains the largest set of GPU software functions in the world.</li>
<li><strong>HW-neutral.</strong>  Run on your favorite hardware.  ArrayFire code runs on any CUDA or OpenCL device.</li>
<li><strong>Proven.</strong>  AccelerEyes’ software is relied upon by thousands of active users.  You can tap great support on the ArrayFire <a href="http://forums.accelereyes.com/" target="_blank">forums</a>.</li>
<li><strong>GFOR.</strong>  You get the powerful and only GPU FOR-loop in the world.</li>
<li><strong>Multi-GPU Scalable.</strong>  You can scale from one to multiple GPUs in minutes with one simple function call.</li>
<li><strong>Graphics.</strong>  Beautiful OpenGL visualizations, adding eye-candy to your acceleration.</li>
</ul>
<p style="text-align: justify;">&#8220;We are excited to make ArrayFire free to most customers,&#8221; says John Melonakos, CEO of AccelerEyes.  &#8220;We see too many organizations frustrated by the difficulty of programming GPUs today.  ArrayFire removes that frustration, enabling GPU tire-kickers to realize the true benefits of the powerful GPU hardware.&#8221;</p>
<p style="text-align: justify;">It is well-supported commercial software at open-source prices.  Visit <a title="AccelerEyes" href="http://www.accelereyes.com/">our website</a> to download the new software today!</p>
<p style="text-align: justify;"><strong>Pricing and Availability</strong><strong> </strong></p>
<p style="text-align: justify;">ArrayFire is free for most users.  To learn more about ArrayFire licensing, visit the ArrayFire <a title="ArrayFire Licensing" href="http://www.accelereyes.com/products/arrayfire_licensing">licensing page</a>.</p>
<p style="text-align: justify;"><strong>Try our Professional Services</strong></p>
<p style="text-align: justify;">AccelerEyes provides professional GPU consulting services.  Our team of engineers guarantees great results from GPU computing.  Equipped with ArrayFire and years of experience, our experts deliver results in fewer hours than any other consulting firms.  Set up a free GPU consultation today by emailing us at <a href="mailto:sales@accelereyes.com">sales@accelereyes.com</a>.</p>
<p style="text-align: justify;"><a href="http://blog.accelereyes.com/blog/wp-content/uploads/2010/08/accelereyes_logo_small_nobg_w180px.jpg"><img class="aligncenter size-full wp-image-619" title="accelereyes_logo_small_nobg_w180px" src="http://blog.accelereyes.com/blog/wp-content/uploads/2010/08/accelereyes_logo_small_nobg_w180px.jpg" alt="AccelerEyes Logo" width="180" height="70" /></a></p>
<div class="shr-publisher-1933"></div><!-- Start Shareaholic LikeButtonSetBottom Automatic --><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><div class='shareaholic-like-buttonset' style='float:none;height:30px;'><a class='shareaholic-googleplusone' data-shr_size='medium' data-shr_count='false' data-shr_href='http%3A%2F%2Fblog.accelereyes.com%2Fblog%2F2011%2F11%2F21%2Faccelereyes-releases-arrayfire-gpu-software%2F' data-shr_title='AccelerEyes+Releases+ArrayFire+GPU+Software'></a></div><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><!-- End Shareaholic LikeButtonSetBottom Automatic -->]]></content:encoded>
			<wfw:commentRss>http://blog.accelereyes.com/blog/2011/11/21/accelereyes-releases-arrayfire-gpu-software/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>AccelerEyes Webinar Series</title>
		<link>http://blog.accelereyes.com/blog/2011/10/27/accelereyes-webinar-series/</link>
		<comments>http://blog.accelereyes.com/blog/2011/10/27/accelereyes-webinar-series/#comments</comments>
		<pubDate>Thu, 27 Oct 2011 20:24:58 +0000</pubDate>
		<dc:creator>scott</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[CUDA]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[MATLAB®]]></category>
		<category><![CDATA[accelereyes]]></category>
		<category><![CDATA[event]]></category>
		<category><![CDATA[Jacket]]></category>
		<category><![CDATA[libjacket]]></category>
		<category><![CDATA[Webinar]]></category>

		<guid isPermaLink="false">http://blog.accelereyes.com/blog/?p=1791</guid>
		<description><![CDATA[AccelerEyes invites you to participate in series of webinars designed to help you learn more about Jacket for MATLAB® and LibJacket for C/C++/Fortran/Python, a comprehensive library of GPU-accelerated functions. Joint Webinar With NVIDIA: LibJacket CUDA Library On October 20th we co-hosted a joint webinar with NVIDIA.  During this well-attended event, our GPU computing experts provided a general product overview and [...]]]></description>
			<content:encoded><![CDATA[<p></p><!-- Start Shareaholic LikeButtonSetTop Automatic --><!-- End Shareaholic LikeButtonSetTop Automatic --><p>AccelerEyes invites you to participate in series of <a title="Register for Webinar Series" href="https://accelereyes.webex.com/mw0306ld/mywebex/default.do?nomenu=true&amp;siteurl=accelereyes&amp;service=6&amp;rnd=0.4385461764274333&amp;main_url=https%3A%2F%2Faccelereyes.webex.com%2Fec0605ld%2Feventcenter%2Fprogram%2FprogramDetail.do%3FtheAction%3Ddetail%26siteurl%3Daccelereyes%26cProgViewID%3D0" target="_blank"><span style="color: #0000ff;">webinars</span></a> designed to help you learn more about <a title="Information on Jacket for MATLAB" href="http://www.accelereyes.com/products/jacket" target="_blank">Jacket</a> for MATLAB® and <a title="Information on LibJacket" href="http://www.accelereyes.com/products/libjacket" target="_blank">LibJacke</a>t for C/C++/Fortran/Python, a comprehensive library of GPU-accelerated functions.</p>
<p><strong>Joint Webinar With NVIDIA: LibJacket CUDA Library</strong></p>
<p>On October 20th we co-hosted a<a title="Recording of NVIDIA Joint Webinar" href="http://developer.download.nvidia.com/CUDA/training/LibJacket_Oct2011.mp4" target="_blank"> joint webinar with NVIDIA</a>.  During this well-attended event, our GPU computing experts provided a general product overview and usage of the LibJacket CUDA library.  Several impressive <a title="demos" href="http://blog.accelereyes.com/blog/2011/09/01/jacket_demo/" target="_blank">demos</a> of LibJacket in action were provided as well.  LibJacket supports hundreds of <a title="LibJacket Functions" href="http://wiki.accelereyes.com/wiki/libjacket/modules.htm" target="_blank"><span style="color: #0000ff;">GPU computing functions</span></a> and programmers in numerous industries have been able to speedup applications.  Be sure to check out the Q&amp;A session included in the <a title="Go to Recorded Webinar" href="http://developer.download.nvidia.com/CUDA/training/LibJacket_Oct2011.mp4" target="_blank"><span style="color: #0000ff;">recorded webinar</span></a> posted on NVIDIA’s Developer Zone. Thanks again to NVIDIA for co-hosting this informative webinar!</p>
<div class="mceTemp">
<div id="attachment_1805" class="wp-caption aligncenter" style="width: 350px">
	<a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/10/Graphics-Lib-1.png"><img class="size-full wp-image-1805" title="Graphics Library 1" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/10/Graphics-Lib-1.png" alt="" width="350" height="205" /></a>
	<p class="wp-caption-text">Graphics library: Tweaked FDTD (example in LibJacket package)</p>
</div>
</div>
<p><strong>GPU Programming for Defense/Intelligence Apps: </strong><strong><span style="color: #0000ff;"><a title="Register for Webinar" href="http://accelereyes.webex.com/mw0306ld/mywebex/default.do?siteurl=accelereyes" target="_blank">November 15, 2011</a> </span></strong><strong>at 3:00 p.m. EST</strong></p>
<p>Major defense and intelligence institutions are discovering just how effective GPU computing can be in enabling unique solutions using Jacket and LibJacket.  Come and learn how to accelerate common defense and intelligence algorithms using easy, powerful programming libraries, with Jacket for MATLAB® and LibJacket for C/C++/Fortran.</p>
<p><strong>LibJacket CUDA Library for Maximus Applications:  </strong><strong><span style="color: #0000ff;"><a title="Register for Webinar" href="http://accelereyes.webex.com/mw0306ld/mywebex/default.do?siteurl=accelereyes" target="_blank">December 15, 2011</a></span></strong><strong> at 3:00 p.m. EST</strong></p>
<p>Learn how to integrate computations with visualizations in a CUDA-based application through simple visualization functions for plotting, image and volume rendering, and more.</p>
<div class="mceTemp">
<div id="attachment_1807" class="wp-caption aligncenter" style="width: 350px">
	<a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/10/Graphics-Lib-2.png"><img class="size-full wp-image-1807" title="Graphics Library 2" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/10/Graphics-Lib-2.png" alt="" width="350" height="205" /></a>
	<p class="wp-caption-text">Graphics library: simulating shallow water equations with reflective</p>
</div>
</div>
<p>The series will be conducted by AccelerEyes’ team of GPU computing experts and will include live demos of Jacket and LibJacket.   We hope you will <a title="Register for Webinar" href="http://accelereyes.webex.com/mw0306ld/mywebex/default.do?siteurl=accelereyes" target="_blank"><span style="color: #0000ff;">join us</span></a> as we discuss exciting developments in GPU computing software!</p>
<div class="shr-publisher-1791"></div><!-- Start Shareaholic LikeButtonSetBottom Automatic --><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><div class='shareaholic-like-buttonset' style='float:none;height:30px;'><a class='shareaholic-googleplusone' data-shr_size='medium' data-shr_count='false' data-shr_href='http%3A%2F%2Fblog.accelereyes.com%2Fblog%2F2011%2F10%2F27%2Faccelereyes-webinar-series%2F' data-shr_title='AccelerEyes+Webinar+Series'></a></div><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><!-- End Shareaholic LikeButtonSetBottom Automatic -->]]></content:encoded>
			<wfw:commentRss>http://blog.accelereyes.com/blog/2011/10/27/accelereyes-webinar-series/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Filtering Benchmarks &#8211; OpenCV GPU vs LibJacket</title>
		<link>http://blog.accelereyes.com/blog/2011/09/26/opencv/</link>
		<comments>http://blog.accelereyes.com/blog/2011/09/26/opencv/#comments</comments>
		<pubDate>Mon, 26 Sep 2011 22:30:13 +0000</pubDate>
		<dc:creator>James Malcolm</dc:creator>
				<category><![CDATA[Benchmarks]]></category>
		<category><![CDATA[CUDA]]></category>
		<category><![CDATA[Parallel computing]]></category>
		<category><![CDATA[libjacket]]></category>

		<guid isPermaLink="false">http://blog.accelereyes.com/blog/?p=1756</guid>
		<description><![CDATA[OpenCV is one of the most popular computer vision toolkits, and over the last year they&#8217;ve been integrating more GPU processing into the core. One of the most common image processing tasks is convolution. Since LibJacket and OpenCV both support this, one of my coworkers rolled up his sleeves and benchmarked the latest versions from [...]]]></description>
			<content:encoded><![CDATA[<p></p><!-- Start Shareaholic LikeButtonSetTop Automatic --><!-- End Shareaholic LikeButtonSetTop Automatic --><p><a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/09/OpenCV_Logo.png"><img class="alignright size-full wp-image-1777" title="OpenCV_Logo" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/09/OpenCV_Logo.png" alt="" width="239" height="295" /></a><a title="OpenCV" href="http://opencv.willowgarage.com/wiki/">OpenCV</a> is one of the most popular computer vision toolkits, and over the last year they&#8217;ve been integrating more <a href="http://opencv.willowgarage.com/wiki/OpenCV_GPU">GPU processing</a> into the core.</p>
<p>One of the most common image processing tasks is convolution. Since <a href="http://wiki.accelereyes.com/wiki/libjacket/group__Convolutions.htm">LibJacket</a> and <a href="http://opencv.itseez.com/modules/imgproc/doc/filtering.html">OpenCV</a> both support this, one of my coworkers rolled up his sleeves and benchmarked the latest versions from both libraries: OpenCV/CPU, OpenCV/GPU, LibJacket.</p>
<p>Jump over to his personal website for the <a href="http://mcclanahoochie.com/blog/2011/09/opencv-vs-libjacket-gpu-sobel-filtering/">full benchmark results and source code</a>.  From the graphs, the GPU implementations from OpenCV and LibJacket both easily outperform the default CPU version in OpenCV, but notice that LibJacket pushes performance even further and dominates OpenCV&#8217;s GPU implementation, especially when using separable filters.</p>
<p>We&#8217;ve worked really hard the last few years to produce a reliable, high-performance commercial library and run-time.  If you&#8217;re interested in the best software for GPU computing, you might consider buying a license to save yourself time and to boost the end-to-end performance of your code.</p>
<p>To learn more about LibJacket, <a href="http://accelereyes.com/libjacket_tour">take the tour</a>.</p>
<div class="shr-publisher-1756"></div><!-- Start Shareaholic LikeButtonSetBottom Automatic --><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><div class='shareaholic-like-buttonset' style='float:none;height:30px;'><a class='shareaholic-googleplusone' data-shr_size='medium' data-shr_count='false' data-shr_href='http%3A%2F%2Fblog.accelereyes.com%2Fblog%2F2011%2F09%2F26%2Fopencv%2F' data-shr_title='Filtering+Benchmarks+-+OpenCV+GPU+vs+LibJacket'></a></div><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><!-- End Shareaholic LikeButtonSetBottom Automatic -->]]></content:encoded>
			<wfw:commentRss>http://blog.accelereyes.com/blog/2011/09/26/opencv/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Optimization methods for deep learning</title>
		<link>http://blog.accelereyes.com/blog/2011/09/20/optimization-methods-for-deep-learning/</link>
		<comments>http://blog.accelereyes.com/blog/2011/09/20/optimization-methods-for-deep-learning/#comments</comments>
		<pubDate>Tue, 20 Sep 2011 17:41:58 +0000</pubDate>
		<dc:creator>pavan</dc:creator>
				<category><![CDATA[Case Studies]]></category>
		<category><![CDATA[GPU Comparison]]></category>
		<category><![CDATA[Parallel computing]]></category>
		<category><![CDATA[cluster]]></category>
		<category><![CDATA[deep learning]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[optimization]]></category>

		<guid isPermaLink="false">http://blog.accelereyes.com/blog/?p=1531</guid>
		<description><![CDATA[Researchers at SAIL (Stanford Artificial Intelligence Laborator), have done it again. They have successfully used Jacket to speed up the training part of Deep Learning algorithms. In their paper titled “On Optimization Methods for Deep Learning”, they experiment with some of the well known training algorithms and demostrate their scalability across parallel architectures (GPUs as [...]]]></description>
			<content:encoded><![CDATA[<p></p><!-- Start Shareaholic LikeButtonSetTop Automatic --><!-- End Shareaholic LikeButtonSetTop Automatic --><p>Researchers at SAIL (Stanford Artificial Intelligence Laborator), have done it again. They have successfully used Jacket to speed up the training part of Deep Learning algorithms. In their paper titled “<a href="http://ai.stanford.edu/~quocle/LeNgiCoaLahProNg11.pdf">On Optimization Methods for Deep Learning</a>”, they experiment with some of the well known training algorithms and demostrate their scalability across parallel architectures (GPUs as well as multi-machine networks). The algorithms include SGDs (Stochastic Gradient Descent) L-BFGS (Limited BFGS used for solving non-linear problems), CG (Conjugate Gradient).</p>
<p>While SGDs are easy to implement, they require manual tuning. Add to that their sequential nature, they are hard to tune, scale and parallelize making them difficult to use with Deep Learning algorithms.  L-BFGS and CG algorithms can be harder to implement and are more computationally intensive. For speed benefits, they use approximated second order information for L-BFGS, and use the conjugacy information during the optimization step for CG. To overcome the scalability problems of L-BFGS and CG, which require gradient of the entire data set, they utilized mini-batch training which results in a faster algorithm for larger data sizes.</p>
<p>Following are few of the results they achieved using these algorithms for various problems. Each machine is equipped with 4 Intel CPU cores (at 2.67GHz) and a GeForce GTX 285 GPU.</p>
<div id="attachment_1532" class="wp-caption aligncenter" style="width: 545px">
	<a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/07/Autoencoder_Training.png"><img class="size-full wp-image-1532" title="Autoencoder_Training" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/07/Autoencoder_Training.png" alt="" width="545" height="421" /></a>
	<p class="wp-caption-text">Autoencoder training on the CPU</p>
</div>
<p>&nbsp;</p>
<div id="attachment_1533" class="wp-caption aligncenter" style="width: 527px">
	<a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/07/Autoencoder_Training_GPU.png"><img class="size-full wp-image-1533" title="Autoencoder_Training_GPU" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/07/Autoencoder_Training_GPU.png" alt="" width="527" height="396" /></a>
	<p class="wp-caption-text">Autoencoder training on the GPU vs CPU</p>
</div>
<p>It can be seen that LBFGS and CG algorithms converge nearly twice as fast on the GPU compared to the CPU. SGD methods though neither improve nor degrade the performance. This behavior can be explained by the large number mini batch sizes preferred by LBFGS / CG compared to SGDs, making them more parallelized for the GPUs.</p>
<p align="LEFT"> For the next experiment, they used the algorithms for training supervised Convoluted neural networks. They then split the workload of calculating the gradient across multiple machines with GPUs.</p>
<div id="attachment_1534" class="wp-caption aligncenter" style="width: 452px">
	<a href="http://blog.accelereyes.com/blog/wp-content/uploads/2011/07/CNN_Multimachine_Training.png"><img class="size-full wp-image-1534" title="CNN_Multimachine_Training" src="http://blog.accelereyes.com/blog/wp-content/uploads/2011/07/CNN_Multimachine_Training.png" alt="" width="452" height="345" /></a>
	<p class="wp-caption-text">Training Supervised CNNs on a cluster of computers (with GPUs)</p>
</div>
<p align="LEFT">As it can be seen, the rate of convergence for LBFGS scales well all the way up to 8 machines!</p>
<p align="LEFT">They also talks about other optimization techniques, usage of these algorithms for sparse auto encoding and locally connected networks, and uitility / accuracy of the the LBFGS algorithm for Deep learning.</p>
<p align="LEFT">We would like to thank <a href="http://ai.stanford.edu/~quocle/">Quoc V. Le</a>, <a href="http://cs.stanford.edu/~jngiam/">Jiquan Ngiam</a>, <a href="http://www.stanford.edu/~acoates/">Adam Coates</a>, Abhik Lahiri, Bobby Prochnow,<a href="http://www.cs.stanford.edu/people/ang/"> Andrew Y. Ng</a> and everyone at their group for letting us use and share their work.</p>
<div class="shr-publisher-1531"></div><!-- Start Shareaholic LikeButtonSetBottom Automatic --><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><div class='shareaholic-like-buttonset' style='float:none;height:30px;'><a class='shareaholic-googleplusone' data-shr_size='medium' data-shr_count='false' data-shr_href='http%3A%2F%2Fblog.accelereyes.com%2Fblog%2F2011%2F09%2F20%2Foptimization-methods-for-deep-learning%2F' data-shr_title='Optimization+methods+for+deep+learning'></a></div><div style="clear: both; min-height: 1px; height: 3px; width: 100%;"></div><!-- End Shareaholic LikeButtonSetBottom Automatic -->]]></content:encoded>
			<wfw:commentRss>http://blog.accelereyes.com/blog/2011/09/20/optimization-methods-for-deep-learning/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

