US 20090077578 A1
Media monitoring and measurement systems and methods are disclosed. Some embodiments of the present invention provide a media measurement system and method that utilizes audience data to enhance content identifications. Some embodiments analyze media player log data to enhance content identification. Other embodiments of the present invention analyze sample sequence data to enhance content identifications. Other embodiments analyze sequence data to enhance content identification and/or to establish channel identification. Yet other embodiments provide a system and method in which sample construction and selection parameters are adjusted based upon identification results. Yet other embodiments provide a method in which play-altering activity of an audience member is deduced from content offset values of identifications corresponding to captured samples. Yet other embodiments provide a monitoring and measurement system in which a media monitoring device is adapted to receive a wireless or non-wireless audio signal from a media player, the audio signal also being received wirelessly by headphones of a user of the monitoring device.
1. A media measurement system comprising:
a plurality of monitoring devices; and
a system adapted to receive and process audio data received from the monitoring devices, the system being adapted to measure media usage by users of the monitoring devices wherein:
a device of the plurality of monitoring devices is adapted to receive a signal that is received wirelessly by headphones being used by an audience member monitored by the monitoring device.
2. The system of
3. The system of
4. The system of
5. The system of
6. The system of
This application is a divisional of U.S. application Ser. No. 11/139,330, filed May 26, 2005, which also claims the benefit of U.S. Provisional Application No. 60/574,836, filed May 27, 2004, both hereby incorporated by reference.
Traditional media measurement systems have focused on directly monitoring channels being utilized by audience members. However, as media consumption patterns have become more complex, channel-centric media measurement is inadequate for many purposes. It may be desirable to track usage of particular media content independent of channel. Furthermore, although “channel” identification in a traditional media measurement system may sometimes be limited to radio or television broadcast station, it is increasingly desirable to track usage of media across several types of media delivery vehicles including radio, television, CD, DVD, computer download, portable media players (e.g. MP3 players, iPod), and other vehicles. Furthermore, with respect to tracking consumption of advertisements, it may be inadequate to simply track channel tuning, because, for example, an audience member may mute a broadcast during commercial periods. Thus simply identifying a broadcast channel does not adequately track whether the audience member listened to a particular advertisement.
Some media measurement systems have used codes to “tag” and track particular content. However, such systems are limited in that they can only track content that has been properly encoded.
With the development of more robust content recognition technologies, some content recognition systems have recently been deployed which do not rely on codes. For example, Philips, Shazam Entertainment, and others have marketed systems for identifying songs played into a mobile phone. Although such systems can be efficiently deployed in the context of song recognition, deploying such systems in the context of media measurement systems poses particular challenges. Continuous searching against a large database of media content can be computationally intensive. Furthermore, such systems, while increasingly robust, still return some erroneous results, particularly in high-noise environments.
At the same time, the media measurement context provides opportunities to utilize data exogenous to a particular audio or video data sample. Such opportunities have thus far been insufficiently exploited for the purpose of efficiently applying existing content recognition technologies in the media measurement context. Thus, an improved media measurement system and method is needed.
Some embodiments of the present invention provide a media measurement system and method that enhances recognition (e.g. in terms of accuracy or efficiency) of the content of a media sample by analyzing information exogenous to the sample. Some embodiments of the present invention provide a media measurement system and method that utilizes audience data to enhance content identifications. Some embodiments analyze media player log data to enhance content identification. Other embodiments of the present invention analyze sample sequence data to enhance content identifications. Other embodiments analyze sequence data to enhance content identification and/or to establish channel identification. Yet other embodiments provide a system and method in which sample construction and selection parameters are adjusted based upon identification results. Yet other embodiments provide a method in which play-altering activity of an audience member is deduced from content offset values of identifications corresponding to captured samples. Yet other embodiments provide a monitoring and measurement system in which a media monitoring device is adapted to receive a wireless or non-wireless audio signal from a media player, the audio signal also being received wirelessly by headphones of a user of the monitoring device.
The novel features of the invention are set forth in the appended claims. However, for purpose of explanation, several aspects of particular embodiments of the invention are described by reference to the following figures.
The following description is presented to enable any person skilled in the art to make and use the invention, and is provided in the context of particular applications and their requirements. Various modifications to the exemplary embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments and applications without departing from the spirit and scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.
As illustrated, monitoring device 101 a includes a microphone 105, a media player port 102, a headphone port 103, and a data upload port 104. A monitored audience member wears monitor 101 a during a monitoring period. When the audience member is consuming media from media player 170 a without the use of headphones 160 a (i.e. if media player 170 a has a speaker), monitor 101 captures audio energy acoustically through microphone 105. However, when the audience member desires to use headphones 160 a to receive media content from media player 170 a (e.g. an MP3 player, iPod, CD player, DVD player, television, radio, computer, or other media player) he or she can plug headphones 160 a into headphone port 103 and plug media player 170 a into media player port 102. In that case, monitor 101 a captures audio energy through media player port 102. Microphone 105 includes a microphone and associated microphone port. However, in alternatives, a monitor may simply have a microphone port into which a microphone may be plugged or linked via wireless connection. Thus the term “microphone port” will herein refer to any electronics capable of receiving energy captured through a microphone, whether or not a microphone is built into the monitoring device itself.
As another example, headphones 160 b are adapted to receive a content signal 171 via wireless transmission 171 from media player 170 b. Monitor 101 b is adapted to receive the same content signal 171 from media player 170 b. In some embodiments, a syncing process allows monitor 101 b to adapt along with headphones 160 b and media player 170 b to changes in signal 171 so that monitor 161 b continues to receive the same signal as headphones 160 b. In some embodiments, syncing may be accomplished via a wireless and/or automatic process, such as providing a monitor that responsively changes tuning based on tuning changes between the media player and the headphones.
Audio data captured by monitors 101, along with associated monitor data (e.g. device ID) (collectively, “audio data/monitor data” 144) may be uploaded to system 100 through upload port 104, directly, via memory device transfer (e.g. flash memory card, floppy disk, CD, etc.) or through a network such as network 150. The audio data uploaded from a monitor 101 may be raw audio data, or may be audio data that has undergone one of a variety of levels of processing prior to upload. For example, the uploaded audio data may include parameters useable to calculate audio signatures (sometimes referred to as audio fingerprints) and landmarks or other values useful in the content recognition process. Alternatively, the audio data may comprise pre-calculated signatures, landmarks, or other data useful for content recognition. On the other hand, in some embodiments, the uploaded audio data may simply represent a raw signal corresponding to audio energy received at monitor 101.
System 100 is adapted to receive some audience data 143 externally. For example, audience data collected by systems and associated monitors other than those of system 1000 may be utilized by system 100; data based on externally collected and/or externally analyzed demographic, psychographic, or other audience related data might also be received as part of external audience data portion 143. Other audience data may be generated internally by audience data processing module 123 alone or in combination with measurement analysis module 129 and/or other system components.
Also, system 100 is adapted to receive media player log data 146. Media player log data 146 may be generated by a media player such as media player 170 a or 170 b and sent to system 100 through direct connection, memory device transfer or via network 150. Also, a third party might collect such log data which then may be made available through direct connection, memory device transfer, or via network 150. Media player log data 146 includes data logging content played on the media player from which the log data was generated. Although log data 146 does not necessarily reflect what content was actually heard, to the extent the media player generating the log data is used primarily by the audience member associated with the corresponding monitor 101, it can provide a useful basis for initial testing of captured content as will be described further below.
System 100 is also adapted to receive known content 141 and content meta data 142. Content 141 may include media files associated with a known piece of media content (e.g. a song, movie, commercial, television show, video game, etc.). However, it will be understood by those skilled in the art that “content” in the sense of the known content 141 received by system 100 may, in some embodiments, simply refer to data about signals representing the content and does not necessarily refer to a stored version of the media content itself; e.g. “content” may be stored signatures and landmarks derivable from a song's audio, but not necessarily an audio file useable to play the song itself. On the other hand, in other embodiments, a system such as system 100 may be adapted to receive a useable audio file and then derive data from that file such as signatures, fingerprints, landmarks, etc. that may be readily searched during the content recognition process. Content meta data 142 includes various data relating to content 141. For example, a portion of content 141 might include a particular song as recorded by a particular artist. Corresponding content meta data 142 for that song might include the various albums or other collections in which that song appears; various radio broadcasts (including station and broadcast times) playing the song; movies, advertisements or video games including the song; other songs that have sampled the song; and various other related information. In the context of a television program, meta data also might include, for example, segment identifications and corresponding time lengths. By distinguishing content data from content meta data in this manner, it is possible to store content data for a single content ID only once, and then add various pieces of meta data as appropriate. For example, a TV program, or program segment, might initially have meta data relating to particular broadcasts. However, at a later date, meta data might be added relating to, for example, a DVD version of the program, without needing to store the program content itself more than once.
Turning to the details of system 100, audio data and monitor data (e.g. device ID) is received at sample control processing module 121. Module 121 provides monitor data, portions of which might be audience data, to processing module 123. Module 121 also divides the audio data into samples and determines which samples should be submitted for further content recognition testing. In alternative embodiments, the audio data may already have been divided into samples prior to receipt by system 100 (e.g. at monitor 101). In such alternatives, a module such as module 121 might either use the provided divisions or combine the data and then re-divide it into samples as determined by the system. Module 121 may be used either to submit a sample to initial test and filtered search module 122 or to content recognition system 124.
Initial test and filtered search module 122 conducts one or more initial identification attempts. Several different types of initial identification attempts might be performed. Module 122 may test a captured sample against content associated with the most recently identified content. As another possible initial test, module 122 may test the sample against content identified from a media player log file. Another initial identification test may be carried out in a variety of ways. For example, filtered search module 122 might use audience data from audience data processing module 123 to form parameters and then pass those parameters to content recognition system 124 which then uses the parameters to conduct a filtered search of its content. Alternatively, module 122 might be adapted to utilize an instance of content recognition system 124 that includes content pre-selected based upon relevant audience data. Some examples of relevant audience data that may be used either to form search parameters or to construct targeted content include:
After completing the initial identification attempt, filtered search module 122 passes the identification result to ID result module 125. If an identification has been made, the result is passed to play stream generator 126 and to sample control processing module 121. If no identification has been made, then content recognition system 124 performs a search using a larger portion of its content to identify the data sample. The result is passed to ID result module 125 which in turn passes the result to play stream generator 126 and to sample control module 121. Content recognition system 124 is adapted to implement one or more known content recognition methods that identify content by extracting parameters from a media signal and applying an algorithm to those parameters to search for a content match. Those skilled in the art will recognize that many such methods and algorithms exist. One such example is described in U.S. patent application Ser. No. 09/839,476 entitled “System and Methods for Recognizing Sound and Music Signals in High Noise and Distortion” by Wang et al. and published Jun. 27, 2002 with publication number US2002/0083060. Aspects of other such examples are described in the following published PCT applications by Konin-Klijke Philips Electronics N.V.: “Fingerprinting Multimedia Contents” (WO2004/044820, published May 27, 2004), “Fingerprint Extraction” (WO2004/030341 published Apr. 8, 2004), and “Improvements in and Relating to Fingerprint Searching” (WO2004/040475 published May 13, 2004). Such distinct methods and algorithms may have varying degrees of efficiency, accuracy, and applicability to certain types of content (or different settings of a particular algorithm might have different efficiency depending on content type). Thus, in some embodiments, a system such as system 100 may intelligently select among multiple such algorithms (or multiple settings of a single algorithm) based on expected content characteristics. For example, algorithm A might be faster and more accurate at identifying silence than algorithm B, and thus a system such as system 100 of
Play stream generator 126 generates a raw play stream relating a sequence of samples to content identification results (for an example of a portion of a raw play stream, see
In alternative embodiments, many of the system components and corresponding functions performed within a system such as system 100 might instead be implemented on the monitoring devices such as monitoring device 101. For example, many of the functions performed by sample control processing module 121 might instead be performed at a monitoring device. To cite but one example in more detail, if a monitoring device maintains an ongoing reciprocal connection to obtain sample identification results of a media measurement system, then the monitoring device can select which samples to send to the system for recognition analysis. The monitoring device might then also adjust parameters such as sample resolution, the length of a sample time window, and the recognition algorithm selected. This is just one example of how the illustrated embodiment might be modified to shift system components from a central system to a monitoring device. Many other such variations will be apparent to those skilled in the art.
Also, it will be understood that although
At step 210, characteristics of the audio data are analyzed to attempt to identify the source of transmission or storage of the media played by an individual based on audio characteristics of that signal. For example, the process attempts to recognize particular types of encoding or compression, or identifies sound associated with screen refresh on a television or computer monitor, or recognizes compression or frequency range of FM or AM radio, or CD or DVD. If the storage or transmission medium is identified, the results of step 210 can be added to a play list and/or utilized in step 211 to identify channel (see below for further description of step 211).
At step 202, identification of a sample in a series of samples is attempted via initial test methods. In particular, step 202 determines whether the audio data in the tested sample matches data associated with the content ID corresponding to the most recently identified sample. Step 202 may also or instead test the sample against content IDs obtained using data from a media player log file associated with the audience member using the corresponding media monitor. Step 203 determines whether successful identification occurred at step 202. If yes, then the identification result is provided to step 206 and 207. If no, then step 204 searches for a content match against targeted content selected based at least in part upon additional audience data. Step 205 determines whether step 204 obtained an identification. If yes, then the identification result is provided to steps 206 and 207. If no, then step 208 searches for a content match against a larger portion of the content. With respect to steps 202, 204, and 208, there are two possible results: an identification has been made or no identification has been made. No identification at steps 202 and 204 leads to the further identification attempt at step 208. Whatever the result is at step 208 (identification or no identification) that result is passed to steps 207 and 206.
At step 206, an identification result is used to adjust sampling control and selection parameters for one or more subsequent samples. Parameters that might be adjusted include resolution, backtrack selection, sample time window, or recognition algorithm selected. For example, if a sample has been identified successfully, and the identification indicates a match with the prior sample, the sampling resolution might be decreased from ⅕ to 1/10, meaning that, in a sequence of samples, nine samples rather than four are skipped before selecting the next sample to analyze for identification. As another example, if an identification result does not match the result of the prior sample identified, then step 206 might initiate a back track analysis and select an intervening sample that has not yet been analyzed. Step 201 would then pass the sample selected from backtracking to step 202 for attempted identification. As another example, resolution might be increased when recent identification results suggest an extended period of silence or absence of known content.
As yet another example, once media content for a limited series of samples has been identified, the time window size for each sample might be decreased from, for example, 5 seconds to 3 seconds. This would mean that samples submitted for attempted identification would be shorter in length. This can create efficiencies because once the content is identified, the process needs only to detect a change in content, and this can, in some cases, be accomplished utilizing a smaller amount of audio data, thereby preserving system resources.
Adjustment of sampling parameters is described further below in the context of
Continuing with the description of
Referring to process 300, depending on available information, either a play stream is made available to step 301 or a play list is made available to step 302. Step 301 prepares a play stream for further processing necessary for channel matching by scrubbing (i.e. reviewing for missing or incorrect data) the play stream using non-channel data. Step 302 prepares a play list for further processing necessary for channel matching by scrubbing (i.e. reviewing for missing or incorrect data) the play list using non-channel data. With respect to steps 301 and 302, the non-channel data selected might be selected particularly for the purpose of better preparing the play stream or play list for a channel matching process. Step 303 receives either a raw (i.e. “not scrubbed”) play stream or play list directly or receives a scrubbed play stream or play list from step 301 or 302. If necessary, step 303 converts received data to a format useful in matching to a channel (e.g. converts a play stream to a play list where a play list format is useful for channel matching, or converts a play stream or play list to a mathematical representation that is useful for channel matching).
Step 304 attempts to match the play stream or play list (or corresponding data format representing the play stream or play list) against known channel data to identify a channel (e.g., an album, samples of an album presented for marketing—e.g. on a web page—, radio broadcast, television broadcast, theater version of a movie, DVD version of a movie, etc.) associated with the elements of the play list or play stream. Step 306 determines whether a channel match was achieved. If yes, then step 305 uses channel data and non-channel data to scrub or further scrub the play stream or play list for purposes of creating a clean play list. If the play list or play stream has already been scrubbed with non-channel data in either step 301 and 302, then the play stream or play list may not need to be further scrubbed with non-channel data at step 305. In that case, step 305 just uses channel data to further scrub the play stream or play list. However, it is possible that scrubbing the play list or play stream with non-channel data for purposes of creating data useful for channel matching will be somewhat different than doing so for purposes of creating a clean play list, thus non-channel data is referenced again in the illustration of claim 305.
If step 306 determines that a channel match was not achieved at step 304, then step 307 scrubs the play stream or play list with non-channel data to create a clean play list. Step 309 determines whether including deduced user actions on the clean play list is desired. If yes, then step 308 creates a clean play list including deduced user actions. If no, then step 310 creates the clean play list without including deduced user actions.
Sample sequence data may be used to “scrub” Raw Play Stream 410. By analyzing the sequence of data on Raw Play Stream, it is possible to locate and correct any identifications that are likely to be incorrect or to supply missing identifications where the system has not otherwise been able to provide an identification.
Sample sequence data can be used to derive portions of Clean Play Stream 450 from raw play list 410 in the following manner: Referring to Raw Play Stream 410, the content IDs associated with samples 1, 3, 4, 6, and 7 all indicate the same media content referenced as “song51.” The content ID value associate with sample 2 indicates that no content has yet been successfully associated with that sample and the content ID value associated with sample 5 indicates “movie81.” Given this pattern of sample sequence data, it is reasonable to assume that the person using the corresponding monitor listened to song51 through the time period associated with the audio data of samples 1-7, and thus, the “scrubbed” version of the play stream, i.e. Clean Play Stream 450, indicates song51 for each of samples 1-7.
Information about the sequence of content in a play stream can be used to generate a play list and identify a channel, and the identified channel data can also potentially be used to help further “scrub” a play stream. For example, the sequence of consumed content represented on Play Stream 450 is searched against known channel data (e.g. particular CD albums, broadcasts, or other particular storage or transmission media carrying a particular sequence of content). In this example, simplified for illustrative purposes, known channel data includes channel data 470, associated with channel ID “radio 35,” and channel data 480, associated with channel ID “album12.” Referring to channel data 470, each row relates a content ID to a start time and end time. In this example, start and end times are referenced with respect to an arbitrary start time value “C(0).” For some channels (e.g. CDs, DVDs, video games) only such arbitrary reference times will be available. For other channels (e.g. broadcasts, concerts, etc.), absolute time might be both available and useful. However, for the purpose of the analysis illustrated in this example, times that are relative only and not necessarily absolute are sufficient.
With respect to further “scrubbing” the Raw Play Stream 410, in this example, channel data may be used to correct the apparent misidentification of sample 29. Note that the sample sequence data alone in this example does not provide a clear basis for correcting sample 29. Sample 28 is identified as “ad49,” sample 29 as “song81,” and sample 30 as “song 35.” However, given the sequence “song21, ad49, and song35” is associated with a known sequence of the radio broadcast referenced as “radio 35,” there is a basis in channel data 470 for believing that the proper content ID for sample 29 is “song35” rather than “song81.” Thus, Clean Play Stream 450 indicates a change relative to Raw Play Stream 410 with respect to the content ID for sample 29. Note that a clean play stream and clean play list are not necessarily generated sequentially. For example, to the extent a channel identification and associated channel data are necessary to “scrub” a raw play stream, portions of a clean play list may be determined as part of obtaining data necessary to make channel selection. In the present example, “radio 35” is identified based on a particular content sequence, and that information helps populate Clean Play Stream 450 with a corrected content ID value for sample 29 relative to the sample 29 content ID value on Raw Play Stream 410.
Audience data may also be used in the “scrubbing” process. For example, if “song81” is radically different from the user's known consumption habits and “song35” is within the user's known consumption habits, another basis might be provided for suspecting that “song35” rather than “song81” is the correct content ID for sample 29 of Play Stream 410.
It will be understood by those skilled in the art that in the context of data sets that might, in alternative examples, make up a “raw play stream,” “a clean play stream,” or a “clean play list,” the particular organization, division, and content of the data may vary considerably from that illustrated in
Continuing with the description of
Play List 560 contains a row for each internally continuous media consumption event. The start log time and end log time for each such event are listed in each row. If an entire piece of content (e.g. a TV program segment, or an entire TV ad) is listened to without interruption, that would correspond to a row of Play List 560. On the other hand, if the data from Play Stream 510 suggests a discontinuity within consumption of a particular piece of content, such discontinuities form boundaries between consumption events that form rows of List 560. The meaning of these concepts may be clarified by discussing further details of the illustrated example.
Discontinuities in consumption of the identified media may be identified by comparing the progression of sample log times to progression of corresponding content offset times in Play Stream 510. For example, referring to rows 7 and 8 of play stream 510, log times progress from t0+6 to t0+7; however, the corresponding content offsets progress, with respect to the start of TV program 27, from (0)+3 to (0)+9. Had the media consumption from one sample to the next been continuous, one would expect that the second offset value (in row 8) would have been (0)+4 rather than (0)+9. Therefore, a boundary between media consumption events can be deduced. Thus, rows 3 and 4 of List 560 indicate two consumption events with respect to the same piece of media content (TV program 7). By contrast, row 1 and row 2 of Play Stream 510 are, on Play List 560, collapsed into a single row (row 1) because the content offset progression from row 1 to row 2 of Play Stream 510 suggests continuous consumption of the same piece of media content.
Referring to Play List 560, the “end log time” of one event also defines the “start log time” of the next event. In other examples, particularly if continuous time is not assumed, a play list might be constructed using start log times that are different than the end log time of the previous consumption event.
Referring to Clean Play List With Deduced Actions 590, it can be seen that a play list can be constructed that supplements the information in Play List 560 with information about the apparent actions of the audience member. Rows 2, 4, 6, 8, and 11 of List 590 include such information about deduced actions.
To the extent such play-altering actions occur within the same piece of content, audience member actions may be deduced without content sequence data for a particular channel. For example, referring to rows 2, 3, 4, 5, and 6, of Play Stream 510, the content offset values of rows 2 and 6 indicate advancing for one time unit within TV program 27 while no content ID (or corresponding offset value) is available for rows 3-5. From this information, and the fact that log times of rows 3-5 together correspond to progression through three time units, it is reasonable to deduce that the audience member paused a media player for three time units from log time t0+2 to log time t0+5. That deduced action is recorded in row 2 of Play List 590.
On the other hand, in other contexts, it may be necessary to refer to content sequence data for a particular channel to be sufficiently confident in deducing the details of an audience member's play-altering activity. For example, referring to Play Stream 510, data in rows 15-17 indicate one or more consumption discontinuities in which a play-altering action appears to have crossed a content boundary. Row 15 indicates consumption of the beginning of ad23; row 16 indicates no identified content, and row 17 indicates consumption of ad24 from a point two time units after the beginning of that ad. From this information alone, it is difficult or impossible to determine how much play-altering activity occurred and whether any content pieces were skipped altogether. However, content sequence data for an identified channel may be used to supplement the information gap. Data 570 lists content sequence data for a channel identified as “TV C4.” “TV C4” may be identified during a method portion such as step 211 of process 200 of
Sequence Data 570 for channel TV C4 indicates that ad23 as broadcast lasted three time units (see row 2), ad25 as broadcast lasted three time units (see row 4), and between ad23 and ad25, another ad, ad24, was broadcast and lasted three time units. From this information, and the information in either Play Stream 510 or Play list 560, it can be deduced that the audience member heard the very beginning of ad23, but then fast forwarded seven time units (during the span of 1 time unit based on the log time data) to just past the beginning of ad25. This deduced action is recorded in row 11 of List 590.
It should be noted that, to the extent actions take less than a single time unit, rows may be added to List 590 relative to List 560 to record such actions. For example, referring to row 4 of List 590, because no log time elapsed between the end of the event in row 3 and the beginning of the event in row 5, row 4 has been added showing the “skip forward 5 units.” That the user “skipped” forward rather than “fast” forwarded can be deduced from the fact that while there is no content ID gap between rows 3 and 4 of List 560, there is a gap in the content offset times for those rows. Thus, though the log times and content ID alone suggest continuous consumption, the gap in offset times suggests a “skip” forward that took the same or less time than the length of a single time unit.
Those skilled in the art will recognize that the method aspects just described in the context of generating and using Stream 510, List 560, Data 570, and List With Deduced Actions 590 reflect just one example of how play-altering actions might be deduced using systems and methods consistent with those of the present invention. To cite but one example, rather than explicitly generate a play list, mathematical representations of play stream data and content sequence data may be used to identify discontinuities and deduce play-altering activities. To cite but one other example, play-altering might be deduced from a pre-generated play list including content offset data without needing to analyzing play stream data from which the play list was generated.
As illustrated, sample set 600 reflects a division of audio data into sixteen samples, three of which are selected for analysis. The samples selected for recognition analysis are referenced generally as the “n0 th, n1 th,” and “n2 th” samples. Samples between the selected samples are not separately numbered, but are marked off between vertical lines along the horizontal length of timelines structure 600 as illustrated. Each sample has a sample time window length “i.” Furthermore, the sample set has varying “resolutions” including r0=⅕ between the n0 th and n1 th samples and r1= 1/10 between the n1 th and the n2 th samples. In other words, “resolution” here refers to the portion of samples being selected during a period of time. In the present example, a process portion such as steps 201 and 206 of process 200 of
Although not illustrated in
The code or a copy of the code contained in computer program product 760 may be stored in memory 710 for execution by processor 720. Computer system 700 also includes I/O subsystem 730 and peripheral devices 740. I/O subsystem 730, peripheral devices 740, processor 720, and memory 710 are coupled via bus 750.
Those skilled in the art will appreciate computer system 700 illustrates just one example of a system in which a computer program product in accordance with an embodiment of the present invention may be implemented. To cite but one example of an alternative embodiment, execution of instructions contained in a computer program product in accordance with an embodiment of the present invention may be distributed over multiple computers, such as, for example, over the computers of a distributed computing network.
Although particular embodiments have been described in detail and certain variants have been noted, various other modifications to the embodiments described herein may be made without departing from the spirit and scope of the present invention. Thus, the invention is limited only by the appended claims.