Social Icons

Saturday, July 7, 2012

Mass Effect 2: A Study in Dynamic Music



NOTE: This article is adapted from work originally published on Eternal Sonate, 02/29/2012.

A few years ago, if I told you about video game music as an area of academic study, you'd likely scoff. In most people's minds, video game music was no more than a series of bleeps and bloops, hardly compelling enough for any sort of scholarly endeavor. However, recent years have proven the contrary, with a number of groups and academic institutions among those to pursue this area of knowledge. With the video game market as a multi-billion dollar industry and continuing to expand, it's perhaps no wonder that video game music is becoming an area of academic interest.

In order to focus on a specific example, I decided to center my discussion of video game audio around one of my favorite video games in recent history: Mass Effect 2. Composed by a sound team led by the illustrious Jack Wall (Unreal 2, Myst IV, Jade Empire, etc.), the soundtrack is a respectable collection of music on its own, its diverse and energetic music setting it apart in a world of space operas. However, in-game, amid the ever-evolving action and drama, is where this soundtrack shines most. The Mass Effect 2 soundtrack enhances the gaming experience in many ways, but most interesting to me is its stellar application of one of the most popular and exciting topics in current video game audio research: the concept of dynamic music.

"Dynamic music," generally speaking, refers to music that changes in real-time due to interaction with the audience. Thus far, dynamic music is mostly confined to the world of video games; unlike other forms of media, such as film and radio, where there is very little or no interaction with the audience, the "audience participation" element inherent to video games makes this medium a perfect vessel for this form of interactive audio. Speaking from an engineer's point of view, this so-called "nonlinear" audio can be modeled as a sort of feedback loop (see below diagram). Whereas most traditional forms of audio in media have an uninterrupted stream from the transmitter (the musical composition) to the receiver (the audience), interactive audio introduces an additional projection from the receiver back to the transmitter, granting the audio the ability to evolve in real-time based on the audience's behavior.



While dynamic music in video games might sound like a recent innovation, it has in fact been present for much of video game history. The PSG-, FM- and MIDI-based sound cards of older consoles all generated tones in real-time, like an electric keyboard. While difficult to compose for, these systems made it very simple to manipulate the music dynamically. One might even argue that dynamic music was introduced in the very first game with a true soundtrack, 1978's Space Invaders, whose famous descending tetrachord speeds up as more enemies are eliminated. A clearer example is the 1982 game Dig Dug, whose musical content progresses only when the player is moving. Perhaps a more recognizable example can be seen in the Genesis-era Sonic the Hedgehog games; whenever the player collects a set of speed shoes, the tempo of the music would increase for a certain amount of time. In fact, even if the background music itself changed during this time interval, the tempo nonetheless remained elevated until the speed shoes' tenure expired.

An important turning point in video game music history came in the mid-1990s with the switch to Redbook audio, better known as CD audio. In all conventional respects, Redbook was a big step forward for video game music; no longer were video game composers restricted by the sounds that the consoles' sound cards could generate, and this led to the capability for live orchestral soundtracks such as that of 1997's Total Annihilation. However, Redbook audio was space-intensive, and because it was pre-recorded, it could not be manipulated in real-time in the same manner as the older audio technology. Thus, while compositional styles no doubt flourished, the dynamic music found in many previous-generation games effectively ground to a halt.

In more recent generations, however, increasing computing power has made it possible for dynamic music to reemerge. Certainly, the obvious examples are video games in which interactive music is the focus, including such games as Electroplankton and the Guitar Hero series. Even for non-audio-focused games, however, sound teams have devised clever ways in which to introduce dynamic elements, ranging from simple techniques (fading, "stinger" chords/passages) to more compositionally challenging ones (cue-to-cue transitions, layering). While newer approaches to dynamic music have yet to rival the level of interactivity in the 8- and 16-bit generations, it is certainly headed in a promising direction.

This leads us back to Mass Effect 2, whose soundtrack contains one of the most effective, yet simplest, implementations of dynamic music I have encountered. The dynamic aspects of Mass Effect 2's music can be most readily appreciated in the battle missions, constantly-changing environments where the amount of action at any time is largely controlled at the player's own pace. Because of the wide range of activity in each of these levels, from quiet strolls to explosion-filled firefights, it seems essential to have music that can reflect these frequent mood shifts.

Mass Effect 2 accomplishes this dynamic behavior by adjusting the music to reflect the amount of onscreen activity. (Some speculation follows, I admit, but I think it's a correct simplification of what's going on.) At any point in time, the game examines the total amount of activity currently present and assigns that value to one of several discrete "activity levels." In turn, each "activity level" corresponds to a particular musical texture, which is switched in real-time as necessary.

At the most fundamental level, this technique is equivalent to layering--increasing activity levels correspond to an increase in textural complexity. However, rather than limit themselves to simply adding and removing voices, the Mass Effect 2 composers wrote individual pieces of music corresponding to each activity level. Below are some examples from one particular battle, arranged in increasing activity level:

Tali -- Mass Effect 2





If not composed carefully, using a separate piece of music for each activity level can lead to some awkward transitions. However, the cohesive compositional style of the entire set prevents this from happening. You may have noticed that the bass pattern stays constant in all four tracks, providing a powerful foundation upon which the remaining texture is built. The same is true of the harmony (or implied harmony, as the case may be). The other voices may change substantially between activity levels, but by holding strong stylistic and structural elements constant, the composers achieve a remarkable sense of consistency. No wonder the in-game transitions sound so flawless.

The best way to appreciate any concept is to see it in action, so here's a clip of me playing one of the combat missions in Mass Effect 2. I've labeled each of the activity level changes encountered in this clip; try to listen for the musical changes during each transition. Also, please forgive my terrible playing; I admit that my gaming skills haven't been the best of late.



For those interested in exploring these activity levels some more, I've created an application where you can manually switch between the four activity levels and listen to how the music adjusts dynamically. Unfortunately, I couldn't get it working as an Internet applet, but below is a download link if you're interested in poking around.


I think we stand at a very exciting point in the story of interactive audio. Modern gaming technology such as the Kinect offer novel methods of feedback to interact with game audio, and a number of existing compositional techniques, such as granular synthesis, hold exciting possibilities for dynamic music. For the moment, however, dynamic music is still emerging, and I'll be pleased to see any game make a valiant effort to introduce some dynamic elements in its soundtrack. If the result is as effective and eloquent as in Mass Effect 2, all the better.

Questions? Critiques? Something I didn't get right? Something you'd like me to elaborate on? Please drop a comment below!

For more information on video game music research and interactive music, please refer to the archive of video game music publications at GamesSound.com, particularly this article on nonlinear video game audio.

~David

9 comments:

  1. As an aspiring sound designer and composer for video games, I found this analysis to be really eye-opening and interesting. I'm still figuring out how to carefully compose tracks to make them blend seamlessly between various intensity levels. Not that great at it yet, but I'd like to think that I'm improving.

    Unfortunately, the link to the Dynamic Layering Test .exe seems to be broken; would you mind re-uploading it? It'd be awesome to get my hands on it again.

    ...also I can see you're a fan of Tali. XD

    ReplyDelete
    Replies
    1. Thanks for the comment! I've just updated the links to the Dynamic Layering Test (they finally disabled my university account, it looks like T_T); so let me know if they work for you.

      It's great to meet someone else interested in writing music for video games. As someone who shares your interest, it's been really neat to observe the types of strategies video game composers are employing to make use of video games' inherent interactivity. It's definitely a challenging problem, but I'm confident that video game composers will find exciting and creative ways to solve it.

      And haha, yes, I think Tali is awesome. XD

      Delete
  2. Thanks! Just wanted to confirm that the download link is in fact working now. My antivirus software keeps complaining that it's malware when I try to open it, but besides that it's fine.

    Also, here's my own input on dynamic music: back in the N64 days when all the music was done with MIDI, transitions were a lot smoother since each instrument could be controlled separately in real time. It kinda surprises me that in this specific ME2 example, the four intensity levels are just four different tracks, with all instruments glued together. To make it cleaner in this particular track, I'd have the bass synth as it's own separate loop since it stays consistent through all intensity levels. This way as the more orchestrated parts fade in, the bass synth still stays at a consistent level, rather than fading itself in and out with the rest.

    I'll quit rambling now, but overall I feel like exporting a track with individual instruments or groups of instruments would offer a lot more flexibility and consistency when it comes to dynamic music. A lot more memory intensive, but I think it could work. Just my own 2 cents.

    ReplyDelete
    Replies
    1. I think the strange fadeouts in that .exe is an artifact of how I manipulated the fadein/fadeout, not how the composers handled it. I only had access to the final tracks from the soundtrack after everything was put together, so it's very possible that the bass is its own track. :P

      I definitely agree that handling individual instruments would afford much more flexibility. To my knowledge, however, most modern games, even those with a good deal of dynamic music, don't handle instruments that way, probably because of the cost associated with manipulating so many tracks on the fly. I definitely know of several older games that operated in the manner you were describing, though. As computing power improves, I predict more composers will start adopting these sort of strategies, however.

      Then again, the cynic in me asks: will the players be able to tell/care about the difference between two completely different tracks and one composed of many different layers that's being manipulated dynamically?

      Delete
  3. Sadly, I don't think most players would notice/care about the difference. I only really care since I'm actively interested in video game music in general.

    I don't think it's necessarily a matter of whether players notice or not; I think the more important aspect is how such a system would impact the way composers approach video game music. I've always composed with the older mindset that I'd be able to dynamically control each track. It's not like I've ever implemented my music in a real game setting (so it probably wouldn't work all that well), but I still feel that the mindset is more freeing, and has a lot more potential to create interesting and varied experiences. Whether players notice or not I feel is more a matter of personal interest of the player.

    ReplyDelete
    Replies
    1. Agreed, and I have no doubt that your mindset is the more interesting from a compositional standpoint. But admittedly, with the way that modern audio is handled, it is easier just to compose several full tracks and switch between them. If players don't notice, I don't particularly fault composers for sticking to what they know.

      Still, it takes only a few forward-thinking video game composers like you to change the status quo :D

      Delete
  4. Heh, well thanks. XD I've still got a long way to go and a lot to learn, but I'd love to help expand and innovate dynamic music systems, or at least make some kind of contribution to them. Audio in general seems so under-appreciated when it comes to games, so I'm excited that one day I might get to work in the field and push it forward.

    ReplyDelete
    Replies
    1. I just put up a new post you might be interested in. It's a YouTube video I found that shows a really neat example of dynamic music.

      I dropped by your SoundCloud account today and really enjoyed what I heard. :) What do you typically use to make your music? While I've been writing music for a while, I've only recently begun to delve into the software and production.

      Delete
  5. Thanks for checking it out. :D I use Cubase 5 as a DAW (looking to get Cubase 7 soon, it looks awesome), and for software instruments I only have Halion Symphonic Orchestra and a bunch of demos from Native Instruments. I'm looking to get Komplete 8 or 9 at some point in the near future, as I find my current VST instruments (especially HSO) to be really limiting and irritating to work with, not to mention that their sounds aren't very focused nor versatile.

    I've messed a bit with the Massive and FM8 demos, and those synths have a huge amount of potential. I get a lot of inspiration from hearing new timbres and textures, since I'm not all that great when it comes to raw musical creativity. :P . What kind of setup are you using?

    ReplyDelete