SDSC Taps Plants for Fuel

By By Cassie Ferguson

October 14, 2005

Imagine mowing your lawn and then dumping the grass clippings into the gas tank of your car. Inside your tank, the grasses are digested and converted into ethanol — a high-performance, clean-burning, renewable fuel. You avoid the astronomical cost of filling up with old-fashioned petroleum, and the U.S. avoids the costly environmental, climate, and security issues of depending on nonrenewable fossil fuel. While tapping yard clippings as a source of gas might still be something found only in movies, the use of plant material as a major energy source has attracted nationwide attention, with ethanol blends already being offered at the pump.

But the process of producing ethanol remains slow and expensive, and researchers are trying to formulate more efficient, economical methods — a challenge that hinges on speeding up a key molecular reaction being investigated in a Strategic Applications Collaboration between researchers at the San Diego Supercomputer Center at UC San Diego, the Department of Energy's National Renewable Energy Laboratory (NREL), Cornell University, The Scripps Research Institute and the Colorado School of Mines.

The interest in ethanol, commonly known as grain alcohol, is being driven by the combination of rising petroleum prices and government subsidies for so-called biofuel, a mixture of 15 percent (by volume) gasoline and 85 percent ethanol, known as E85, which sells for an average of 45 cents less per gallon than gasoline. Efforts to mitigate climate change are also spurring the growth of such renewable fuels, which add far less net greenhouse gas to the atmosphere than burning fossil fuels because the step of growing plant material removes carbon dioxide. In August 2005, President George W. Bush signed a comprehensive energy bill that included a requirement to increase the production of biofuels including ethanol and biodiesel from 4 billion to 7.5 billion gallons within the next 10 years.

While most people are familiar with the process used to turn plant material — such as hops — into ethanol, for beverages like beer, that process is slow, expensive, and the end product too impure for energy use. To produce ethanol for energy use on a massive scale, researchers are trying to perfect the conversion of biomass plant matter such as trees, grasses, byproducts from agricultural crops, and other biological material — via industrial conversion in “biorefineries.”

“Cellulose is the most abundant plant material on earth and a largely-untapped source of renewable energy,” said project manager Mike Cleary, who is coordinating SDSC's role in the project. “So this collaboration is addressing not just a significant problem in enzymology but a problem of huge potential benefit to society.”

The central bottleneck in making the biomass to fuel conversion process more efficient is the current slow rate of breakdown of the woody parts of plants — cellulose — by the enzyme cellulase, which also happens to be expensive to produce. The enzyme complex of cellulases, made up of proteins, acts as a catalyst to mediate and speed this chemical reaction, turning cellulose into sugars.

Scientists want to understand this process at the molecular level so that they can learn how to enhance the reaction. Using molecular dynamics simulations, which model the movement of the enzyme at the atomic scale, the researchers want to determine if the kinetics of the enzyme agree with models based on biochemical and genetic studies. By probing in minute detail how the enzyme makes contact with cellulose at the molecular level, the researchers hope to speed up the process and make it more cost effective by discovering ways the enzyme can be altered through genetic engineering.

The cellulase enzyme complex is actually a collection of enzymes, each of which plays a specific role in breaking down cellulose into smaller molecules of sugar called beta-glucose. The smaller sugar molecules are then fermented with microbes, typically yeast, to make the fuel, ethanol. One of the parts of the enzyme complex, cellobiohydrolase (CBH I), acts as a “molecular machine” that attaches to bundles of cellulose, pulls up a single strand of the sugar, and puts it onto a molecular conveyor belt where it is chopped into the smaller pieces. In order to make this process more efficient through bioengineering, researchers will need a detailed molecular-level understanding of how the cellulase enzyme functions. But the system has been difficult to study because it is too small to be directly observed under a microscope while too large for traditional molecular mechanics modeling.

To explore the intricate molecular dynamics of cellulase, researchers at NREL have turned to CHARMM (Chemistry at HARvard Molecular Mechanics), a suite of modeling software for macromolecular simulations, including energy minimization, molecular dynamics, and Monte Carlo simulations. The widely-used community code, originally developed in 1983 in the laboratory of Martin Karplus at Harvard University, models how atoms interact.

In the cellulase modeling, CHARMM is used to explore the ensemble configurations and protein structure, the interactions of the protein with the cellulose substrate, and the interactions of water with both. Not only are the NREL simulations the first to simultaneously model the cellulase enzyme, cellulose substrate, and surrounding water, they are among the largest molecular systems ever modeled. In particular, the researchers are interested in how cellulase aligns and attaches itself to cellulose, how the separate parts of cellulase — called protein domains — work with one another, and the effect of water on the overall system. And they are also investigating which of the over 500 amino acids that make up the cellulase protein are central to the overall workings of the “machine” as it chews up cellulose.

To the biochemists in the collaboration, the simulation is like a stop-motion film of a baseball pitcher throwing a curveball. In real-life the process occurs too quickly to evaluate visually, but by breaking down the throw into a step-by-step process, observers can find out the precise role of velocity, trajectory, movement, and arm angle. In simulations on SDSC's DataStar supercomputer, the researchers have modeled a portion of the enzyme, the type 1 cellulose binding domain, on a surface of crystalline cellulose in a box of water. The modeling revealed how the amino acids of the domain orient themselves when they interact with crystalline cellulose as well as how the interaction disrupts the layer of water molecules that lie on top of the cellulose, providing a detailed glimpse of this intricate molecular dance.

The NREL cellulose model includes more than 800,000 atoms, including the surrounding water, the cellulose, and the enzyme — an enormous structure to model computationally. According to the researchers, an accurate understanding of what is happening will require the capability to scale up their simulation to run for 50 nanoseconds in the reaction — an extremely long amount of time in molecular terms and highly demanding in computational terms (there are one billion nanoseconds in one second). To reach 50 nanoseconds, the researchers must calculate 25 million time-steps at two femtoseconds per time step (one femtosecond is one quadrillionth of a second).

However, the sheer size of the model is beyond the limit of the current capabilities of the CHARMM simulation code, which has been difficult to scale as the number of computer processors grows larger, since the code was originally written to model thousands, not hundreds of thousands, of atoms. The SAC partners have worked to enhance CHARMM to scale to larger numbers of atoms and to run on some of the largest resources available to academic scientists in the U.S., including DataStar (recently expanded to 15.6 teraflops), TeraGrid (4.4 teraflops), and BlueGene (5.7 teraflops).

To determine how much time the large-scale CHARMM simulations require, a calculation on DataStar found that a series of 500-step simulations on a 711,887 atom system for one picosecond (one thousandth of a nanosecond) required 12 minutes on 64 processors and 9 minutes on 128 processors. Because of scaling issues, a full nanosecond run will require 1,000 times more time than these benchmarking runs, so that full-scale simulations are expected to require nearly one million CPU hours.

To extend the capabilities of the CHARMM simulation code to this unprecedented scale, SDSC's Giri Chukkapalli, a computational scientist, along with Scripps' Michael Crowley, a software developer in Charles Brooks' lab, have reengineered parts of CHARMM to be more efficient running as a parallel, rather than serial, application. In particular, the researchers in the SAC collaboration have targeted a number of subroutines in the code, which are being altered to speed up its performance on 256 and 512 processors.

Outreach on the part of SDSC resulted in this large cross-agency collaboration based on a team approach, with interdisciplinary participation by biochemists from NREL, enzymologists and carbohydrate chemists from Cornell, software developers from TSRI, and computational scientists at SDSC. To validate and gauge the accuracy of the CHARMM simulations, the models are studied by James Matthews and John Brady of Cornell, and Linghao Zhong at Penn State, who compare the simulated version of the overall action of the cellulase complex with experimental results. Similarly, the chemists at NREL, including Mark Nimlos, Mike Himmel, and Xianghong Qian at the Colorado School of Mines, interpret the biochemical findings. In addition to assisting with the software development and scaling to be able to run larger simulations, SDSC is also the key site for computation since the center houses compute resources such as DataStar, with capabilities far beyond those available at the other collaborators.

“We were looking for opportunities for collaboration with other agencies,” said Cleary. “SDSC has unique expertise to offer in improving community codes like CHARMM and other molecular dynamics tools like AMBER.” It turned out that Cleary, along with other SDSC staff, knew some of the researchers who had been working on the cellulase problem at NREL and the other sites. Their work was an ideal fit for a SDSC SAC collaboration, with each group lending its expertise to the project.

The collaboration, funded by U.S. Department of Energy's Office of Energy Efficiency and Renewable Energy, and the Office of the Biomass Program, fit the mission of SDSC's SAC program — to enhance the effectiveness of computational science and engineering research conducted by nationwide academic users. The goal of these collaborations is to develop a synergy between the academic researchers and SDSC staff that accelerates the researchers' efforts by using SDSC resources most effectively and enabling new science on relatively short timescales of three to 12 months. And beyond the project results, the hope is to discover and develop general solutions that will benefit not only the selected researchers but also their entire academic communities and other high-performance computing users. In this case, beyond being able to model cellulase digesting cellulose to improve the production of ethanol, the improvements to CHARMM are opening the door so that the software, running on cutting-edge hardware systems, can simulate many other large-scale biological systems. In turn, that will allow scientists to pose entirely new questions, opening novel avenues for research, said Cleary.

According to Chukkapalli, “We're excited about the advent of new architectures that provide massive amounts of computing power. The questions from biophysics, structural biology, and biochemistry that have been only dreams in the minds of computational chemists are now on the verge of being studied in realistic simulations.”

Sectors: Academia & Research, Government, Life Sciences

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that have occurred about once a decade. With this in mind, the ISC Read more…

2024 Winter Classic: Texas Two Step

April 18, 2024

Texas Tech University. Their middle name is ‘tech’, so it’s no surprise that they’ve been fielding not one, but two teams in the last three Winter Classic cluster competitions. Their teams, dubbed Matador and Red Read more…

2024 Winter Classic: The Return of Team Fayetteville

April 18, 2024

Hailing from Fayetteville, NC, Fayetteville State University stayed under the radar in their first Winter Classic competition in 2022. Solid students for sure, but not a lot of HPC experience. All good. They didn’t Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use of Rigetti’s Novera 9-qubit QPU. The approach by a quantum Read more…

2024 Winter Classic: Meet Team Morehouse

April 17, 2024

Morehouse College? The university is well-known for their long list of illustrious graduates, the rigor of their academics, and the quality of the instruction. They were one of the first schools to sign up for the Winter Read more…

MLCommons Launches New AI Safety Benchmark Initiative

April 16, 2024

MLCommons, organizer of the popular MLPerf benchmarking exercises (training and inference), is starting a new effort to benchmark AI Safety, one of the most pressing needs and hurdles to widespread AI adoption. The sudde Read more…

Kathy Yelick on Post-Exascale Challenges

April 18, 2024

With the exascale era underway, the HPC community is already turning its attention to zettascale computing, the next of the 1,000-fold performance leaps that ha Read more…

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

April 18, 2024

Horizon Quantum Computing, a Singapore-based quantum software start-up, announced today it would build its own testbed of quantum computers, starting with use o Read more…

MLCommons Launches New AI Safety Benchmark Initiative

April 16, 2024

MLCommons, organizer of the popular MLPerf benchmarking exercises (training and inference), is starting a new effort to benchmark AI Safety, one of the most pre Read more…

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

April 15, 2024

As the AI revolution marches on, it is vital to continually reassess how this technology is reshaping our world. To that end, researchers at Stanford’s Instit Read more…

Intel’s Vision Advantage: Chips Are Available Off-the-Shelf

April 11, 2024

The chip market is facing a crisis: chip development is now concentrated in the hands of the few. A confluence of events this week reminded us how few chips Read more…

The VC View: Quantonation’s Deep Dive into Funding Quantum Start-ups

April 11, 2024

Yesterday Quantonation — which promotes itself as a one-of-a-kind venture capital (VC) company specializing in quantum science and deep physics — announce Read more…

Nvidia’s GTC Is the New Intel IDF

April 9, 2024

After many years, Nvidia's GPU Technology Conference (GTC) was back in person and has become the conference for those who care about semiconductors and AI. I Read more…

Google Announces Homegrown ARM-based CPUs

April 9, 2024

Google sprang a surprise at the ongoing Google Next Cloud conference by introducing its own ARM-based CPU called Axion, which will be offered to customers in it Read more…

Nvidia H100: Are 550,000 GPUs Enough for This Year?

August 17, 2023

The GPU Squeeze continues to place a premium on Nvidia H100 GPUs. In a recent Financial Times article, Nvidia reports that it expects to ship 550,000 of its lat Read more…

Synopsys Eats Ansys: Does HPC Get Indigestion?

February 8, 2024

Recently, it was announced that Synopsys is buying HPC tool developer Ansys. Started in Pittsburgh, Pa., in 1970 as Swanson Analysis Systems, Inc. (SASI) by John Swanson (and eventually renamed), Ansys serves the CAE (Computer Aided Engineering)/multiphysics engineering simulation market. Read more…

Intel’s Server and PC Chip Development Will Blur After 2025

January 15, 2024

Intel's dealing with much more than chip rivals breathing down its neck; it is simultaneously integrating a bevy of new technologies such as chiplets, artificia Read more…

Choosing the Right GPU for LLM Inference and Training

December 11, 2023

Accelerating the training and inference processes of deep learning models is crucial for unleashing their true potential and NVIDIA GPUs have emerged as a game- Read more…

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

January 5, 2024

Reuters reported this week that Baidu, China’s giant e-commerce and services provider, is exiting the quantum computing development arena. Reuters reported � Read more…

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

October 30, 2023

With long lead times for the NVIDIA H100 and A100 GPUs, many organizations are looking at the new NVIDIA L40S GPU, which it’s a new GPU optimized for AI and g Read more…

Google Addresses the Mysteries of Its Hypercomputer

December 28, 2023

When Google launched its Hypercomputer earlier this month (December 2023), the first reaction was, "Say what?" It turns out that the Hypercomputer is Google's t Read more…

How AMD May Get Across the CUDA Moat

October 5, 2023

When discussing GenAI, the term "GPU" almost always enters the conversation and the topic often moves toward performance and access. Interestingly, the word "GPU" is assumed to mean "Nvidia" products. (As an aside, the popular Nvidia hardware used in GenAI are not technically... Read more…

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

January 25, 2024

In under two minutes, Meta's CEO, Mark Zuckerberg, laid out the company's AI plans, which included a plan to build an artificial intelligence system with the eq Read more…

DoD Takes a Long View of Quantum Computing

December 19, 2023

Given the large sums tied to expensive weapon systems – think $100-million-plus per F-35 fighter – it’s easy to forget the U.S. Department of Defense is a Read more…

China Is All In on a RISC-V Future

January 8, 2024

The state of RISC-V in China was discussed in a recent report released by the Jamestown Foundation, a Washington, D.C.-based think tank. The report, entitled "E Read more…

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

December 7, 2023

AMD and Nvidia are locked in an AI performance battle – much like the gaming GPU performance clash the companies have waged for decades. AMD has claimed it Read more…

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

March 18, 2024

Nvidia's latest and fastest GPU, codenamed Blackwell, is here and will underpin the company's AI plans this year. The chip offers performance improvements from Read more…

Eyes on the Quantum Prize – D-Wave Says its Time is Now

January 30, 2024

Early quantum computing pioneer D-Wave again asserted – that at least for D-Wave – the commercial quantum era has begun. Speaking at its first in-person Ana Read more…

GenAI Having Major Impact on Data Culture, Survey Says

February 21, 2024

While 2023 was the year of GenAI, the adoption rates for GenAI did not match expectations. Most organizations are continuing to invest in GenAI but are yet to Read more…

The GenAI Datacenter Squeeze Is Here

February 1, 2024

The immediate effect of the GenAI GPU Squeeze was to reduce availability, either direct purchase or cloud access, increase cost, and push demand through the roof. A secondary issue has been developing over the last several years. Even though your organization secured several racks... Read more…

Click Here for More Headlines

HPCwire is a registered trademark of Tabor Communications, Inc. Use of this site is governed by our Terms of Use and Privacy Policy.

Reproduction in whole or in part in any form or medium without express written permission of Tabor Communications, Inc. is prohibited.

Leading Solution Providers

Off The Wire

Industry Headlines

April 18, 2024

April 17, 2024

April 16, 2024

Subscribe to HPCwire's Weekly Update!

Kathy Yelick on Post-Exascale Challenges

2024 Winter Classic: Texas Two Step

2024 Winter Classic: The Return of Team Fayetteville

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

2024 Winter Classic: Meet Team Morehouse

MLCommons Launches New AI Safety Benchmark Initiative

Kathy Yelick on Post-Exascale Challenges

Software Specialist Horizon Quantum to Build First-of-a-Kind Hardware Testbed

MLCommons Launches New AI Safety Benchmark Initiative

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

Intel’s Vision Advantage: Chips Are Available Off-the-Shelf

The VC View: Quantonation’s Deep Dive into Funding Quantum Start-ups

Nvidia’s GTC Is the New Intel IDF

Google Announces Homegrown ARM-based CPUs

Nvidia H100: Are 550,000 GPUs Enough for This Year?

Synopsys Eats Ansys: Does HPC Get Indigestion?

Intel’s Server and PC Chip Development Will Blur After 2025

Choosing the Right GPU for LLM Inference and Training

Baidu Exits Quantum, Closely Following Alibaba’s Earlier Move

Comparing NVIDIA A100 and NVIDIA L40S: Which GPU is Ideal for AI and Graphics-Intensive Workloads?

Google Addresses the Mysteries of Its Hypercomputer

How AMD May Get Across the CUDA Moat

Leading Solution Providers

Contributors

Tiffany Trader

Editorial Director

Douglas Eadline

Managing Editor

John Russell

Senior Editor

Kevin Jackson

Contributing Editor

Ali Azhar

Contributing Editor

Alex Woodie

Contributing Editor

Addison Snell

Contributing Editor

Drew Jolly

Assistant Editor

Meta’s Zuckerberg Puts Its AI Future in the Hands of 600,000 GPUs

DoD Takes a Long View of Quantum Computing

China Is All In on a RISC-V Future

AMD’s Horsepower-packed MI300X GPU Beats Nvidia’s Upcoming H200

Nvidia’s New Blackwell GPU Can Train AI Models with Trillions of Parameters

Eyes on the Quantum Prize – D-Wave Says its Time is Now

GenAI Having Major Impact on Data Culture, Survey Says

The GenAI Datacenter Squeeze Is Here

The Information Nexus of Advanced Computing and Data systems for a High Performance World

Share

Copy short link