NASA SBIR 2015 Solicitation


PROPOSAL NUMBER: 15-1 S5.01-8794
SUBTOPIC TITLE: Technologies for Large-Scale Numerical Simulation
PROPOSAL TITLE: A Scheduling-Based Framework for Efficient Massively Parallel Execution

SMALL BUSINESS CONCERN (Firm Name, Mail Address, City/State/Zip, Phone)
EM Photonics
51 East Main Street, Suite 203
Newark, DE 19711 - 4685
(302) 456-9003

PRINCIPAL INVESTIGATOR/PROJECT MANAGER (Name, E-mail, Mail Address, City/State/Zip, Phone)
Daniel Hertenstein
51 East Main Street
Newark, DE 19711 - 4685
(302) 456-9003

CORPORATE/BUSINESS OFFICIAL (Name, E-mail, Mail Address, City/State/Zip, Phone)
Eric Kelmelis
51 East Main Street, Suite 203
Newark, DE 19711 - 4685
(302) 456-9003

Estimated Technology Readiness Level (TRL) at beginning and end of contract:
Begin: 3
End: 4

Technology Available (TAV) Subtopics
Technologies for Large-Scale Numerical Simulation is a Technology Available (TAV) subtopic that includes NASA Intellectual Property (IP). Do you plan to use the NASA IP under the award?

TECHNICAL ABSTRACT (Limit 2000 characters, approximately 200 words)
The barrier to entry creating efficient, scalable applications for heterogeneous supercomputing environments is too high. EM Photonics has found that the majority of the coding and debugging time is not spent defining the problem physics but instead on balancing computation between multiple heterogeneous devices, handling communication of data, and managing distributed memory systems. The time spent improving, modifying, or debugging device specific code paths and common code sections could be better spent improving kernel performance or adding new features.

To address the problem of separating physical science from computing science, we have been developing a solution that decouples the problem definition from the platform-specific implementation details by expressing algorithms as a series of tasks and data dependencies and handing it off to a managed runtime that efficiently partitions and schedules the problem tasks for execution. We have proven this technique in the field of linear algebra, and in this project we will bring these benefits to mission critical NASA solvers. In this SBIR, we will construct a powerful system that, by virtue of decoupling algorithms from dispatch and execution, will be suited for both current and upcoming computer architectures.

Writing a new application will require only an understanding of the algorithm to be implemented, and abstracts away details of heterogeneous resource management and scheduling, thereby removing this responsibility from the scientists that develop this software. Our solution will provide future compatibility, as going to a new version of the same hardware involves no changes and adding new hardware types will require only writing specialized computational kernels. Higher performance is attained because the scheduler will adjust the software's execution based on factors such as the hardware availability and its current performance, as well as the run-time characteristics of the program's execution.

POTENTIAL NASA COMMERCIAL APPLICATIONS (Limit 1500 characters, approximately 150 words)
Our proposed technologies will directly improve the performance, resource utilization, productivity, and fault tolerance of many high performance NASA solver codes. Codes such as GEOS, ModelE, Snowflake, OVERFLOW, and FUN3D can achieve both short and long term benefits by adopting our proposed task-based methodology and scheduling tools. Our proposed technology is truly broad-base, and has no domain-specific requirements. It can benefit every NASA application that utilizes this task-based framework.

In the short term, these tools will be able improve current hardware utilization and performance. Additionally, productivity will increase as the main focus of the codes will become the underlying science rather than the parallelization and communication needed to implement a high performance code base. Another short term benefit includes improved fault tolerance as a scheduled task-based approach will be able to recover from a scenario in which communication with a node is lost.

Long term benefits focus on future-proof algorithms and increased scalability for many of the aforementioned applications. By decoupling the algorithms from the underlying hardware implementation, we will provide a framework that allows for rapid adaptation of new architectures. Scalability will also improve as the proposed scheduler will automatically distribute work to any additional nodes that were added to the cluster.

POTENTIAL NON-NASA COMMERCIAL APPLICATIONS (Limit 1500 characters, approximately 150 words)
Our scheduled task-based methodology has applications outside of NASA. Since our proposed technology does not have any domain-based restrictions, it can be utilized by any software system wishing to increase utilization and performance.

Math libraries for heterogeneous architectures such as CULA, Arrayfire, NMath, and others will be easily scale and work across platforms. They will also be able to easily scale performance to multiple GPUs in both workstation and cluster environments. Additionally, we will be able to solve problems where there is insufficient memory on the GPU to hold the entire problem.

Other non-NASA domains where our technology is immediate applicable include, but is not limited to: molecular dynamics, financial analysis, and graphical ray tracing. All of these domains are easily expressible as interconnected tasks and therefor can prosper for all the aforementioned benefits.

TECHNOLOGY TAXONOMY MAPPING (NASA's technology taxonomy has been developed by the SBIR-STTR program to disseminate awareness of proposed and awarded R/R&D in the agency. It is a listing of over 100 technologies, sorted into broad categories, of interest to NASA.)
Computer System Architectures
Development Environments
Models & Simulations (see also Testing & Evaluation)
Programming Languages
Simulation & Modeling
Software Tools (Analysis, Design)

Form Generated on 04-23-15 15:37