Query job status via our API


Written by: on April 9, 2013

I wanted to share a few tips on how you can leverage the NACC API to query for your job status. Both job_name and job_number are optional, but neither, either or both can be specified for your query. Should you not supply either job_name or job_number, then the API will simply return the status of your last five jobs.

To try this out, post your requests to the following URL https://api.nimbix.net:4081/nimbix/nacc_jstatus.

Just in case you forgot your API key, you can always grab this via the NACC portal.

Job query example

{ "username" : "your_username", "api-key" : "yourAPIkey", "jobname" : "139043483194-job-name", "jobnumber" : 12345 "request" : "status" }

There’s a number of interesting possibilities you can do here once you have this information, you can integrate with an existing dashboard / DevOps tool-set, leverage this info for existing notification / eventing systems you have, or for internal billing and accounting systems.

What are the ways you’re leveraging this information, we’d love to hear from you?

NACC UI and Coming Features


Written by: on March 25, 2013

I’m back in the blogging seat today and while I’m here, I wanted to share some our recent UI changes, along with a preview into some new features coming to NACC.

Let’s dive right in and take look at some of these new UI features we recently launched.

Dashboard

The “dashboard” is your single window view into some of following areas, job status, notifications, key job stats, and the ability to quickly launch your most frequent application workflows. You can see on my dashboard that BWA is my most used app, mostly from test cases. BWA really shows the power of NACC in terms of flexibility, in creating multi-step workflows, along with the compute power of our accelerated co-processors.

Task

Task Builder

Our task builder allows for easy to use workflow creation. In my example below, I’m creating a multi-step paired-end read for BWA. Here I can quickly and easily provide my reference database, either provided by Nimbix (already pre-indexed) and or a number of other third party sources. Once I’ve selected my reference database, which could be plant, animal, or human (martian or gremlin anyone?), I’m then able to provide my read sequence inputs and any optional parameters associated with BWA. Lastly as part of our task builder I can provide details on where my output needs to go, again this could be a Nimbix location or any one of our third party locations.

Task

Data Mover

Data movement is a critical part of workflow and data processing. Our v2.1 release allows you to seamlessly move your inputs and outputs to the location of your choosing for both pre and post processing. Currently we support automated data movement for S3, Dropbox, Globus, and any SFTP location. Our automated data movers give our users significant flexibility when it comes to building, and submitting jobs because they’re not having to wait for data movement to occur and or jungle any dependancies within a workflow due to missing inputs / outputs.

Data

Don’t forget that all of the above features can also be leveraged using our API!

More and More Features

You’re asking… and we’re listening and building! We’re heads down on building amazing new features for you. Some of these new features will include true integrated remote visualization. Just think… from a post-processing stand point, you won’t have to move your data back out of the cloud. You’ll simply be able to interact in real-time with your outputs, models, etc. within the Nimbix cloud. We’re also heads down on offering our users the ability to select their desired interconnect at run-time. This means you’ll be able to select either gigE, 10gigE, or Infiniband for your given application. You won’t have to worry about the plumbing of the fabric, just tell our task builder and we’ll make it happen, in real-time. Lastly, we’re working on building and delivering a true DCIM product for our private cloud / colocation users. This will allow for a ton of flexibility around space, power, and cooling management.

The old blog seat is getting a bit uncomfortable now. I’ve shown you a few things about v2.1 and shared a bit of our product roadmap. I’d like to leave you with a small video of NACC in action. If a picture is worth a 1000 words, this video should be worth at least 10′s of thousands of words.

Hybrid Public/Private Clouds for Processing Intensive Applications


Written by: on February 25, 2013

Most organizations that have managed large batch processing resources, or HPC clusters understand the economics of computing pretty well.  The difficulty has always been in capacity planning, software application management/deployment, and trying to keep up with changing compute, storage, and networking technologies.

When you add public cloud computing to the mix of possible resource options, the reaction from traditional infrastructure managers is mixed.  On the one hand, the idea of an elastic, pay-as-you-go compute model is very attractive, but on the other, the pain of migration,  perceived data risks, and uncertain economics may eliminate the public cloud option altogether.

The other challenge with the mainstream cloud deployment model for  processing intensive applications is that all of the software tools typical in an HPC environment have to be built and tested all over again in the cloud environment.  Add to this issues and costs associated with moving large data sets in and out of that cloud and it is easy to see why cloud elation can quickly fade.

Theses realities should not deter organizations from evaluating options.  As time progresses, it is becoming increasingly clear that even the largest of  organizations will have trouble swallowing the costs of building and scaling their own dedicated computing environments and keeping pace with changing hardware and software technologies.   Ironically, there is a reciprocal challenge for younger companies, too, that have come of age using pure cloud resources for infrastructure.  These organizations have learned that while public clouds have their place, as their processing environments have grown in scale, so has their cloud bill.  They’ve reached a point where the economics start to tip in favor of deploying dedicated resources for portions of their environment.

To address these unique “hybrid” requirements in medium to large scale processing environments, Nimbix offers a blend of do-it-yourself and cloud.   Nimbix enables users to scale both their own computing assets in Nimbix datacenters right alongside managed “private” clouds and the Nimbix Accelerated Compute Cloud.  This flexibility helps solve for some of the technology and economics challenges mentioned above.  By hybridizing in Nimbix HPC datacenters, infrastructure managers can operate traditional in-house clusters while taking comfort in knowing there is more floor space and compute scale available from an infrastructure provider that understands HPC and processing intensive application environments.

Additionally, the ability to leverage Nimbix resources that are deployed as a scalable batch processing cluster simplifies many of the challenges associated with turning up a processing cluster in a commodity cloud utility service.  Users can integrate workflows between collocated, managed, and public cloud resources with a simple API call while only paying for actual processing time billed in minutes on the cloud side.

Web service enabled infrastructures will continue to evolve over time enabling smoother migration between public and private clouds.  Thinking about hybrid environments might just be the best solution in terms of flexibility, productivity and economics.

 

 

Nimbix and HUGEdata


Written by: on January 8, 2013

Keeping with the theme from my last posting I wanted to share a little information about another one of our amazing customers.  HUGEdata has been a Nimbix customer since September of 2012.  They provide a cloud-based or onsite database specially designed to handle the demanding workloads from the ever-increasing need to query, analyze, and derive value from large amounts of data in seconds.

It is specifically aimed at companies with the need to analyze huge (hundreds of millions or billions of rows) amounts of data along with complex queries in seconds instead of minutes and hours.  The system utilizes existing tools and languages (SQL) already in use at most companies, speeding implementation and reducing disruption to existing data and systems, and all at a lower total cost of ownership.

Learn more about HUGEdata and why they choose Nimbix by clicking here to read the case study.

HPC Cloud Observations in 2012 and Implications for 2013


Written by: on December 31, 2012

As we prepare for a new year in the world of HPC cloud, I find it is always good to spend some time reflecting on progress from the prior year and the implications for the next.

I think many who have spent time using or experimenting with HPC applications or workflows on public cloud resources would agree that steady progress has been made in 2012. While many challenges remain around data movement, security, software licensing and ease of use, we’ve all learned more about what it takes to be successful getting processing work done in the more efficient ways. For this post, I summarize a few of my own observations from 2012 and then make some predictions for 2013.

To keep things simple, I’ll just list them out with some commentary:

Observations for HPC Cloud in 2012:

  1. Early large scale HPC cloud deployments with open source software applications – Open source software is still dominating cloud-use cases, although many commercial software organizations will deploy more formal cloud strategies in 2013 (see below).
  2. Data challenges – There are really two big issues associated with HPC data and public clouds.  One is the inherent challenge of transferring and storing (even if temporary) large data sets and the other is data security.  It’s no surprise that the early trail blazers in HPC cloud use cases are in segments where data sets are public or have less restrictive security requirements.
  3. Cloud costs still lack commercial-grade clarity – What I mean here is that most users still don’t have a clear picture of how much their cloud-utility bill will be on a monthly basis.  There are a few cloud expense management platforms emerging, but the picture is still fuzzy for enterprise HPC computing.
  4. Cloud standards maturing – While standards are still shaping the cloud infrastructure industry as a whole, much of the standards debate is centered around the machine stack and provisioning versus workloads and applications.  I expect we will see more drift into applications in the future.

Predictions for HPC Cloud in 2013:

  1. Users and Cloud Providers will add more network bandwidth and data transport acceleration (such as Aspera) to reduce the time to move large data between compute resources
  2. There will be increased use and deployment of data encryption technology which will continue to reduce barriers to cloud adoption
  3. Cloud provider offerings will center more around workloads, applications and processing pipelines versus pure infrastructure
  4. Mid-size and large organizations will migrate toward hybrid private/public infrastructure to optimize economics and monthly spend
  5. Leading HPC ISVs will provide more options and licensing flexibility for cloud enablement
  6. Accelerated platforms and larger memory machines will continue to gain traction in public clouds
  7. We will begin to see more sophisticated tools for cloud processing and workflow automation

So while there is probably nothing earth shattering in the above observations I think it’s important to understand the themes that emerge.   Those themes help shape our collective focus for solving problems in the next year and years to follow.  They help us discern the best standards and cloud deployment models, and finally those observations and themes can help make smarter business decisions.