Added iXML track list parsing

Update README.md
Update wave_ixml_reader.py
2025-12-31 17:00:41 +00:00 · 2019-08-19 11:39:13 -07:00 · 2019-06-29 21:55:19 -07:00 · 2019-06-29 21:50:20 -07:00 · 2019-06-25 20:28:40 -07:00 · 2019-06-25 20:24:29 -07:00
33 changed files with 460 additions and 201 deletions
--- a/.coveragerc
+++ b/.coveragerc
@@ -0,0 +1,13 @@
 [run]
 branch = True
 source = wavinfo 
 [report]
 exclude_lines =
    if self.debug:
    pragma: no cover
    raise NotImplementedError
    if __name__ == .__main__.:
 ignore_errors = True
 omit =
    tests/*
--- a/.travis.yml
+++ b/.travis.yml
@@ -1,20 +1,23 @@
 dist: xenial
 language: python
 python:
 #    - "2.7"
    - "3.6"
    - "3.5"
    - "3.7"
 script:
    - "gunzip tests/test_files/rf64/*.gz"
    - "python setup.py test"
-    - "py.test tests/ -v --cov wavinfo --cov-report term-missing"
+#    - "py.test tests/ -v --cov wavinfo --cov-report term-missing"
 before_install:
    - "sudo apt-get update"
    - "sudo add-apt-repository universe"
    - "sudo apt-get install -y ffmpeg"
    - "pip install coverage"
-#    - "pip install coverage==4.4"
+    - "pip install codecov"
    - "pip install pytest-cov==2.5.0"
-    - "pip install python-coveralls"
+#    - "pip install coverage==4.4"
 install:
    - "pip install setuptools"
 after_success:
-    - coveralls
+    - "codecov"
--- a/README.md
+++ b/README.md
@@ -1,13 +1,12 @@
 [![Build Status](https://travis-ci.com/iluvcapra/wavinfo.svg?branch=master)](https://travis-ci.com/iluvcapra/wavinfo)
-[![Coverage Status](https://coveralls.io/repos/github/iluvcapra/wavinfo/badge.svg?branch=master)](https://coveralls.io/github/iluvcapra/wavinfo?branch=master)
+[![codecov](https://codecov.io/gh/iluvcapra/wavinfo/branch/master/graph/badge.svg)](https://codecov.io/gh/iluvcapra/wavinfo)
 [![Documentation Status](https://readthedocs.org/projects/wavinfo/badge/?version=latest)](https://wavinfo.readthedocs.io/en/latest/?badge=latest) ![](https://img.shields.io/github/license/iluvcapra/wavinfo.svg) ![](https://img.shields.io/pypi/pyversions/wavinfo.svg) [![](https://img.shields.io/pypi/v/wavinfo.svg)](https://pypi.org/project/wavinfo/) ![](https://img.shields.io/pypi/wheel/wavinfo.svg)
 # wavinfo
-The `wavinfo` package allows you to probe WAVE files and extract extended metadata, with an emphasis on 
+The `wavinfo` package allows you to probe WAVE and [RF64/WAVE files][eburf64] and extract extended metadata, with an emphasis on film, video and professional music production metadata. 
 production metadata. 
 `wavinfo` reads:
@@ -27,6 +26,7 @@ In progress:
 [ebu]:https://tech.ebu.ch/docs/tech/tech3285.pdf
 [smpte_330m2011]:http://standards.smpte.org/content/978-1-61482-678-1/st-330-2011/SEC1.abstract
 [ixml]:http://www.ixml.info
 [eburf64]:https://tech.ebu.ch/docs/tech/tech3306v1_1.pdf
@@ -53,104 +53,10 @@ The length of the file in frames (interleaved samples) and bytes is available, a
 >>> (48000, 2, 6, 24)
 ```
-### Broadcast WAV Extension
+## Other Resources
-A WAV file produced to Broadcast-WAV specifications will have the broadcast metadata extension,
+* For other file formats and ID3 decoding, look at [audio-metadata](https://github.com/thebigmunch/audio-metadata).
 which includes a 256-character free text descrption, creating entity identifier (usually the 
 recording application or equipment), the date and time of recording and a time reference for 
 timecode synchronization.
 The `coding_history` is designed to contain a record of every conversion performed on the audio
 file.
 In this example (from a Sound Devices 702T) the bext metadata contains scene/take slating
 information in the `description`. Here also the `originator_ref` is a serial number conforming
 to EBU Rec 99.
 If the bext metadata conforms to EBU 3285 v1, it will contain the WAV's 32 or 64 byte SMPTE 
 330M UMID. The 32-byte version of the UMID is usually just a random number, while the 64-byte 
 UMID will also have information on the recording date and time, recording equipment and entity, 
 and geolocation data.
 If the bext metadata conforms to EBU 3285 v2, it will hold precomputed program loudness values
 as described by EBU Rec 128.
 ```python
 print(info.bext.description)
 print("----------")
 print("Originator:", info.bext.originator)
 print("Originator Ref:", info.bext.originator_ref)
 print("Originator Date:", info.bext.originator_date)
 print("Originator Time:", info.bext.originator_time)
 print("Time Reference:", info.bext.time_reference)
 print(info.bext.coding_history)
 ```
    sSPEED=023.976-ND
    sTAKE=1
    sUBITS=$12311801
    sSWVER=2.67
    sPROJECT=BMH
    sSCENE=A101
    sFILENAME=A101_1.WAV
    sTAPE=18Y12M31
    sTRK1=MKH516 A
    sTRK2=Boom
    sNOTE=
    ----------
    Originator: Sound Dev: 702T S#GR1112089007
    Originator Ref: USSDVGR1112089007124001008206301
    Originator Date: 2018-12-31
    Originator Time: 12:40:00
    Time Reference: 2190940753
    A=PCM,F=48000,W=24,M=stereo,R=48000,T=2 Ch
 ### iXML Production Recorder Metadata
 iXML allows an XML document to be embedded in a WAV file.
 The iXML website recommends a schema for recorder information but 
 there is no official DTD and vendors mostly do their own thing, apart from 
 hitting a few key xpaths. iXML is used by most location/production recorders 
 to save slating information, timecode and sync points in a reliable way.
 iXML is also used to link "families" of WAV files together, so WAV files
 recorded simultaneously or contiguously can be related by a receiving client.
 ```python
 print("iXML Project:", info.ixml.project)
 print("iXML Scene:", info.ixml.scene)
 print("iXML Take:", info.ixml.take)
 print("iXML Tape:", info.ixml.tape)
 print("iXML File Family Name:", info.ixml.family_name)
 print("iXML File Family UID:", info.ixml.family_uid)
 ```
    iXML Project: BMH
    iXML Scene: A101
    iXML Take: 1
    iXML Tape: 18Y12M31
    iXML File Family Name: None
    iXML File Family UID: USSDVGR1112089007124001008206300
 ### INFO Metadata
 INFO Metadata is a standard method for saving tagged text data in a WAV or AVI
 file. INFO fields are often read by the file explorer and host OS, and used in 
 music library software.
 ```python
 bullet_path = '../tests/test_files/BULLET Impact Plastic LCD TV Screen Shatter Debris 2x.wav'
 bullet = WavInfoReader(bullet_path)
 ```
    print("INFO Artist:",    bullet.info.artist)
    print("INFO Copyright:", bullet.info.copyright)
    print("INFO Comment:",   bullet.info.comment)
--- a/docs/conf.py
+++ b/docs/conf.py
@@ -26,7 +26,7 @@ author = u'Jamie Hardt'
 # The short X.Y version
 version = u''
 # The full version, including alpha/beta/rc tags
-release = u'v1.0'
+release = u'v1.1'
 # -- General configuration ---------------------------------------------------
--- a/docs/index.rst
+++ b/docs/index.rst
@@ -6,9 +6,6 @@
 Welcome to wavinfo's documentation!
 ===================================
 .. toctree::
   :maxdepth: 2
   :caption: Contents:
 .. module:: wavinfo
@@ -24,6 +21,17 @@ Welcome to wavinfo's documentation!
 .. autoclass:: wavinfo.wave_reader.WavDataDescriptor
   :members:
 .. toctree::
   :maxdepth: 2
   :caption: Notes:
   metadata_scopes/bext.rst
   metadata_scopes/ixml.rst
   metadata_scopes/info.rst
 Indices and tables
 ==================
--- a/docs/metadata_scopes/bext.rst
+++ b/docs/metadata_scopes/bext.rst
@@ -0,0 +1,65 @@
 Broadcast WAV Extension
 =======================
 .. module:: wavinfo
 .. autoclass:: wavinfo.wave_bext_reader.WavBextReader
   :members:
 Notes
 -----
 A WAV file produced to Broadcast-WAV specifications will have the broadcast metadata extension,
 which includes a 256-character free text descrption, creating entity identifier (usually the 
 recording application or equipment), the date and time of recording and a time reference for 
 timecode synchronization.
 The `coding_history` is designed to contain a record of every conversion performed on the audio
 file.
 In this example (from a Sound Devices 702T) the bext metadata contains scene/take slating
 information in the `description`. Here also the `originator_ref` is a serial number conforming
 to EBU Rec 99.
 If the bext metadata conforms to EBU 3285 v1, it will contain the WAV's 32 or 64 byte SMPTE 
 330M UMID. The 32-byte version of the UMID is usually just a random number, while the 64-byte 
 UMID will also have information on the recording date and time, recording equipment and entity, 
 and geolocation data.
 If the bext metadata conforms to EBU 3285 v2, it will hold precomputed program loudness values
 as described by EBU Rec 128.
 ..  code:: python
    print(info.bext.description)
    print("----------")
    print("Originator:", info.bext.originator)
    print("Originator Ref:", info.bext.originator_ref)
    print("Originator Date:", info.bext.originator_date)
    print("Originator Time:", info.bext.originator_time)
    print("Time Reference:", info.bext.time_reference)
    print(info.bext.coding_history)
 Result: 
 ::
    sSPEED=023.976-ND
    sTAKE=1
    sUBITS=$12311801
    sSWVER=2.67
    sPROJECT=BMH
    sSCENE=A101
    sFILENAME=A101_1.WAV
    sTAPE=18Y12M31
    sTRK1=MKH516 A
    sTRK2=Boom
    sNOTE=
    ----------
    Originator: Sound Dev: 702T S#GR1112089007
    Originator Ref: USSDVGR1112089007124001008206301
    Originator Date: 2018-12-31
    Originator Time: 12:40:00
    Time Reference: 2190940753
    A=PCM,F=48000,W=24,M=stereo,R=48000,T=2 Ch
--- a/docs/metadata_scopes/info.rst
+++ b/docs/metadata_scopes/info.rst
@@ -0,0 +1,32 @@
 INFO Metadata
 =============
 .. module:: wavinfo
 .. autoclass:: wavinfo.wave_info_reader.WavInfoChunkReader
   :members:
 Notes
 -----
 INFO Metadata is a standard method for saving tagged text data in a WAV or AVI
 file. INFO fields are often read by the file explorer and host OS, and used in 
 music library software.
 ..  code:: python
    bullet_path = '../tests/test_files/BULLET Impact Plastic LCD TV Screen Shatter Debris 2x.wav'
    bullet = WavInfoReader(bullet_path)
    print("INFO Artist:",    bullet.info.artist)
    print("INFO Copyright:", bullet.info.copyright)
    print("INFO Comment:",   bullet.info.comment)
--- a/docs/metadata_scopes/ixml.rst
+++ b/docs/metadata_scopes/ixml.rst
@@ -0,0 +1,44 @@
 iXML Production Recorder Metadata
 =================================
 .. module:: wavinfo
 .. autoclass:: wavinfo.wave_ixml_reader.WavIXMLFormat
   :members:
 Notes
 -----
 iXML allows an XML document to be embedded in a WAV file.
 The iXML website recommends a schema for recorder information but 
 there is no official DTD and vendors mostly do their own thing, apart from 
 hitting a few key xpaths. iXML is used by most location/production recorders 
 to save slating information, timecode and sync points in a reliable way.
 iXML is also used to link "families" of WAV files together, so WAV files
 recorded simultaneously or contiguously can be related by a receiving client.
 ..  code:: python
    print("iXML Project:", info.ixml.project)
    print("iXML Scene:", info.ixml.scene)
    print("iXML Take:", info.ixml.take)
    print("iXML Tape:", info.ixml.tape)
    print("iXML File Family Name:", info.ixml.family_name)
    print("iXML File Family UID:", info.ixml.family_uid)
 Result:
 ::
    iXML Project: BMH
    iXML Scene: A101
    iXML Take: 1
    iXML Tape: 18Y12M31
    iXML File Family Name: None
    iXML File Family UID: USSDVGR1112089007124001008206300
--- a/setup.py
+++ b/setup.py
@@ -4,10 +4,10 @@ with open("README.md", "r") as fh:
    long_description = fh.read()
 setup(name='wavinfo',
-      version='1.1',
+      version='1.3',
      author='Jamie Hardt',
      author_email='jamiehardt@me.com',
-      description='WAVE sound file metadata parser.',
+      description='Probe WAVE Files for iXML, Broadcast-WAVE and other metadata.',
      long_description_content_type="text/markdown",
      long_description=long_description,
      url='https://github.com/iluvcapra/wavinfo',
@@ -16,5 +16,8 @@ setup(name='wavinfo',
          'Topic :: Multimedia',
 	  'Topic :: Multimedia :: Sound/Audio',
          "Programming Language :: Python :: 3.5",
-          "Programming Language :: Python :: 3.6"],
+          "Programming Language :: Python :: 3.6",
-      packages=['wavinfo'])
+          "Programming Language :: Python :: 3.7"],
      packages=['wavinfo'],
      install_requires=['lxml']
      )
--- a/tests/init.py
+++ b/tests/init.py
@@ -1,2 +1,3 @@
 from . import test_wave_parsing
--- a/tests/test_files/pyramix/Testfile_PYR_BWF.wav
+++ b/tests/test_files/pyramix/Testfile_PYR_BWF.wav
--- a/tests/test_files/rf64/Pro_Tools_2019_RF64.wav.gz
+++ b/tests/test_files/rf64/Pro_Tools_2019_RF64.wav.gz
--- a/tests/test_files/rf64/Testfile_SEQ_RF64_Big.wav.gz
+++ b/tests/test_files/rf64/Testfile_SEQ_RF64_Big.wav.gz
--- a/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr1.WAV
+++ b/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr1.WAV
--- a/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr1_D.WAV
+++ b/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr1_D.WAV
--- a/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr2.WAV
+++ b/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr2.WAV
--- a/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr2_D.WAV
+++ b/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr2_D.WAV
--- a/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr3.WAV
+++ b/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr3.WAV
--- a/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr4.WAV
+++ b/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr4.WAV
--- a/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr7.WAV
+++ b/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr7.WAV
--- a/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr8.WAV
+++ b/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_Tr8.WAV
--- a/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_TrL_R.WAV
+++ b/tests/test_files/zoom_F8/DISCRETE/Scene_1_2AG-T001.TAKE/Scene_1_2AG-T001_TrL_R.WAV
--- a/tests/test_files/zoom_F8/MULTI/Scene_1_2AG-T001.WAV
+++ b/tests/test_files/zoom_F8/MULTI/Scene_1_2AG-T001.WAV
--- a/tests/test_wave_parsing.py
+++ b/tests/test_wave_parsing.py
@@ -1,46 +1,20 @@
 import os.path
-import json
+import sys
 import subprocess
 from subprocess import PIPE
 from unittest import TestCase
 from .utils import all_files, ffprobe
 import wavinfo
-
+        
 FFPROBE='ffprobe'
 def ffprobe(path):
    arguments = [ FFPROBE , "-of", "json" , "-show_format", "-show_streams", path ]
    process = subprocess.run(arguments, stdin=None, stdout=PIPE, stderr=PIPE)
    if process.returncode == 0:
        output_str = process.stdout.decode('utf-8')
        return json.loads(output_str)
    else:
        return None
 class TestWaveInfo(TestCase):
    def all_files(self):
        for dirpath, dirnames, filenames in os.walk('tests/test_files'):
            for filename in filenames:
                name, ext = os.path.splitext(filename)
                if ext in ['.wav','.WAV']:
                    yield os.path.join(dirpath, filename)
    def test_sanity(self):
-        for wav_file in self.all_files():
+        for wav_file in all_files():
            info = wavinfo.WavInfoReader(wav_file)
            self.assertTrue(info is not None)
    def test_fmt_against_ffprobe(self):
-        for wav_file in self.all_files():
+        for wav_file in all_files():
            info = wavinfo.WavInfoReader(wav_file)
            ffprobe_info = ffprobe(wav_file)
@@ -56,19 +30,26 @@ class TestWaveInfo(TestCase):
                self.assertEqual( info.fmt.byte_rate     , byte_rate  )
    def test_data_against_ffprobe(self):
-        for wav_file in self.all_files():
+        for wav_file in all_files():
            info = wavinfo.WavInfoReader(wav_file)
            ffprobe_info = ffprobe(wav_file)
            self.assertEqual( info.data.frame_count, int(ffprobe_info['streams'][0]['duration_ts'] ))
    def test_bext_against_ffprobe(self):
-        for wav_file in self.all_files():
+        for wav_file in all_files():
            info = wavinfo.WavInfoReader(wav_file)
            ffprobe_info = ffprobe(wav_file)
            if info.bext:
-                self.assertEqual( info.bext.description, ffprobe_info['format']['tags']['comment']  )
+                if 'comment' in ffprobe_info['format']['tags']:
-                self.assertEqual( info.bext.originator, ffprobe_info['format']['tags']['encoded_by']  )
+                    self.assertEqual( info.bext.description, ffprobe_info['format']['tags']['comment']  )
                else: 
                    self.assertEqual( info.bext.description , '')
                if 'encoded_by' in ffprobe_info['format']['tags']:
                    self.assertEqual( info.bext.originator, ffprobe_info['format']['tags']['encoded_by']  )
                else:
                    self.assertEqual( info.bext.originator, '')
                if 'originator_reference' in ffprobe_info['format']['tags']:
                    self.assertEqual( info.bext.originator_ref, ffprobe_info['format']['tags']['originator_reference']  )
                else:
@@ -95,7 +76,7 @@ class TestWaveInfo(TestCase):
                        'tape': '18Y12M31', 'family_uid': 'USSDVGR1112089007124001008206300'},
                }
-        for wav_file in self.all_files():
+        for wav_file in all_files():
            basename =  os.path.basename(wav_file)
            if basename in expected:
                info = wavinfo.WavInfoReader(wav_file)
@@ -106,3 +87,10 @@ class TestWaveInfo(TestCase):
                self.assertEqual( e['take'], info.ixml.take )
                self.assertEqual( e['tape'], info.ixml.tape )
                self.assertEqual( e['family_uid'], info.ixml.family_uid )
                for track in info.ixml.track_list:
                    self.assertIsNotNone(track.channel_index)
                    if basename == 'A101_4.WAV' and track.channel_index == '1':
                        self.assertTrue(track.name == 'MKH516 A')
--- a/tests/test_zoom_f8.py
+++ b/tests/test_zoom_f8.py
@@ -0,0 +1,12 @@
 import os.path
 import sys
 import json
 import subprocess
 from subprocess import PIPE
 from unittest import TestCase
 import wavinfo
 class TestZoomF8(TestCase):
    pass
--- a/tests/utils.py
+++ b/tests/utils.py
@@ -0,0 +1,39 @@
 import os.path
 import sys
 import subprocess
 from subprocess import PIPE
 import json
 FFPROBE='ffprobe'
 def ffprobe(path):
    arguments = [ FFPROBE , "-of", "json" , "-show_format", "-show_streams", path ]
    if int(sys.version[0]) <  3:
        process = subprocess.Popen(arguments, stdout=PIPE)
        process.wait()
        if process.returncode == 0:
            output = process.communicate()[0]
            if output:
                output_str = output.decode('utf-8')
                return json.loads(output_str)
        else:
            return None
    else: 
        process = subprocess.run(arguments, stdin=None, stdout=PIPE, stderr=PIPE)
        if process.returncode == 0:
            output_str = process.stdout.decode('utf-8')
            return json.loads(output_str)
        else:
            return None
 def all_files():
    for dirpath, _, filenames in os.walk('tests/test_files'):
        for filename in filenames:
            _, ext = os.path.splitext(filename)
            if ext in ['.wav','.WAV']:
                yield os.path.join(dirpath, filename)
--- a/wavinfo/init.py
+++ b/wavinfo/init.py
@@ -10,5 +10,5 @@
 from .wave_reader import WavInfoReader
-__version__ = '1.1'
+__version__ = '1.3'
 __author__ = 'Jamie Hardt'
--- a/wavinfo/rf64_parser.py
+++ b/wavinfo/rf64_parser.py
@@ -0,0 +1,40 @@
 import struct
 from collections import namedtuple
 from . import riff_parser 
 RF64Context = namedtuple('RF64Context','sample_count bigchunk_table')
 def parse_rf64(stream):
    #print("starting parse_rf64")
    start = stream.tell()
    assert( stream.read(4) == b'WAVE' ) 
    ds64_chunk = riff_parser.parse_chunk(stream)
    ds64_field_spec = "<QQQI"
    ds64_fields_size = struct.calcsize(ds64_field_spec)
    assert(ds64_chunk.ident == b'ds64')
    ds64_data = ds64_chunk.read_data(stream)
    assert(len(ds64_data) >= ds64_fields_size )
    #print("Read ds64 chunk: len()",len(ds64_data))
    riff_size, data_size, sample_count, length_lookup_table = struct.unpack( ds64_field_spec , ds64_data[0:ds64_fields_size] )
    bigchunk_table = {}
    chunksize64format = "<4sL"
    chunksize64size = struct.calcsize(chunksize64format)
    #print("Found chunks64s:", length_lookup_table)
    for n in range(length_lookup_table):
        bigname, bigsize = struct.unpack_from( chunksize64format , ds64data, offset= ds64_fields_size ) 
        bigchunk_table[bigname] = bigsize
    bigchunk_table[b'data'] = data_size
    bigchunk_table[b'RF64'] = riff_size
    stream.seek(start, 0)
    #print("returning from parse_rf64, context: ",RF64Context( sample_count=sample_count, bigchunk_table=bigchunk_table ) )
    return RF64Context( sample_count=sample_count, bigchunk_table=bigchunk_table )
--- a/wavinfo/riff_parser.py
+++ b/wavinfo/riff_parser.py
@@ -2,6 +2,7 @@
 import struct
 import pdb
 from collections import namedtuple
 from .rf64_parser import parse_rf64
 class ListChunkDescriptor(namedtuple('ListChunkDescriptor' , 'signature children')):
@@ -18,19 +19,20 @@ class ListChunkDescriptor(namedtuple('ListChunkDescriptor' , 'signature children
                            return chunk
-class ChunkDescriptor(namedtuple('ChunkDescriptor', 'ident start length') ):
+class ChunkDescriptor(namedtuple('ChunkDescriptor', 'ident start length rf64_context') ):
    def read_data(self, from_stream):
        from_stream.seek(self.start)
        return from_stream.read(self.length)
-def parse_list_chunk(stream, length):
+def parse_list_chunk(stream, length, rf64_context =None):
    start = stream.tell()
    signature = stream.read(4)
    #print("Parsing list chunk with siganture: ", signature)
    children = []
    while (stream.tell() - start) < length:
-        child_chunk = parse_chunk(stream)
+        child_chunk = parse_chunk(stream, rf64_context= rf64_context)
        if child_chunk:
             children.append(child_chunk)
        else: 
@@ -38,27 +40,31 @@ def parse_list_chunk(stream, length):
    return ListChunkDescriptor(signature=signature, children=children)
-def parse_chunk(stream):
+def parse_chunk(stream, rf64_context=None):
    #breakpoint()
    ident = stream.read(4)
    if len(ident) != 4:
        return
    sizeb = stream.read(4)
    size  = struct.unpack('<I',sizeb)[0]
-
+    
    if size == 0xFFFFFFFF:
        if rf64_context is None and ident == b'RF64':
            rf64_context = parse_rf64(stream=stream)
        size = rf64_context.bigchunk_table[ident] 
    displacement = size
    if displacement % 2 is not 0:
        displacement = displacement + 1
-    if ident in [b'RIFF',b'LIST']:
+    if ident in [b'RIFF',b'LIST', b'RF64']:
-        return parse_list_chunk(stream=stream, length=size)
+        #print("Parsing list chunk with ident: ", ident)
        return parse_list_chunk(stream=stream, length=size, rf64_context=rf64_context)
    else:
        start = stream.tell()
        stream.seek(displacement,1)
-        return ChunkDescriptor(ident=ident, start=start, length=size)
+        #print("Parsing chunk with start=%i, ident=%s" % (start, ident))
-
+        return ChunkDescriptor(ident=ident, start=start, length=size, rf64_context=rf64_context)
--- a/wavinfo/wave_bext_reader.py
+++ b/wavinfo/wave_bext_reader.py
@@ -2,28 +2,12 @@ import struct
 class WavBextReader:
    def __init__(self,bext_data,encoding):
-        # description[256]
+        """
-
+        Read Broadcast-WAV extended metadata.
-        # originator[32]
+        :param best_data: The bytes-like data.
-        # originatorref[32]
+        "param encoding: The encoding to use when decoding the text fields of the
-        # originatordate[10]   "YYYY:MM:DD"
+                 BEXT metadata scope. According to EBU Rec 3285 this shall be ASCII.
-        # originatortime[8]    "HH:MM:SS"
+        """
        # lowtimeref U32
        # hightimeref U32
        # version U16
        #
        # V1 field
        # umid[64]
        #
        # V2 fields
        # loudnessvalue S16    (in LUFS*100)
        # loudnessrange S16    (in LUFS*100)
        # maxtruepeak   S16    (in dbTB*100)
        # maxmomentaryloudness S16 (LUFS*100)
        # maxshorttermloudness S16 (LUFS*100)
        #
        # reserved[180]
        # codinghistory []
        packstring = "<256s"+ "32s" + "32s" + "10s" + "8s" + "QH" + "64s" + "hhhhh" + "180s"
        rest_starts = struct.calcsize(packstring)
@@ -39,20 +23,38 @@ class WavBextReader:
            decoded = trimmed.decode(encoding)
            return decoded
        #: Description. A free-text field up to 256 characters long.
        self.description     = sanatize_bytes(unpacked[0])
        #: Originator. Usually the name of the encoding application, sometimes
        #: a artist name.
        self.originator      = sanatize_bytes(unpacked[1])
        #: A unique identifer for the file, a serial number.
        self.originator_ref  = sanatize_bytes(unpacked[2])
        #: Date of the recording, in the format YYY-MM-DD
        self.originator_date = sanatize_bytes(unpacked[3])
        #: Time of the recording, in the format HH:MM:SS.
        self.originator_time = sanatize_bytes(unpacked[4])
        #: The sample offset of the start of the file relative to an
        #: epoch, usually midnight the day of the recording. 
        self.time_reference  = unpacked[5]
-        self.version         = unpacked[6]
+        #: A variable-length text field containing a list of processes and
-        self.umid            = None
+        #: and conversions performed on the file.
        self.loudness_value          = None
        self.loudness_range          = None
        self.max_true_peak           = None
        self.max_momentary_loudness  = None
        self.max_shortterm_loudness  = None
        self.coding_history  = sanatize_bytes(bext_data[rest_starts:])
        #: BEXT version. 
        self.version         = unpacked[6]
        #: SMPTE 330M UMID of this audio file, 64 bytes are allocated though the UMID
        #: may only be 32 bytes long.
        self.umid            = None
        #: EBU R128 Integrated loudness, in LUFS.
        self.loudness_value          = None
        #: EBU R128 Loudness rante, in LUFS.
        self.loudness_range          = None
        #: True peak level, in dBFS TP
        self.max_true_peak           = None
        #: EBU R128 Maximum momentary loudness, in LUFS
        self.max_momentary_loudness  = None
        #: EBU R128 Maximum short-term loudness, in LUFS.
        self.max_shortterm_loudness  = None
        if self.version > 0:
            self.umid = unpacked[7]
--- a/wavinfo/wave_info_reader.py
+++ b/wavinfo/wave_info_reader.py
@@ -14,19 +14,42 @@ class WavInfoChunkReader:
        self.info_chunk  = next((chunk for chunk in list_chunks \
                if chunk.signature == b'INFO'), None)
-
+        
        #: 'ICOP' Copyright
        self.copyright      = self._get_field(f,b'ICOP')
        #: 'IPRD' Product
        self.product        = self._get_field(f,b'IPRD')
        #: 'IGNR' Genre
        self.genre          = self._get_field(f,b'IGNR')
        #: 'ISBJ' Supject
        self.subject        = self._get_field(f,b'ISBJ')
        #: 'IART' Artist, composer, author
        self.artist         = self._get_field(f,b'IART')
        #: 'ICMT' Comment
        self.comment        = self._get_field(f,b'ICMT')
        #: 'ISFT' Software, encoding application
        self.software       = self._get_field(f,b'ISFT')
        #: 'ICRD' Created date
        self.created_date   = self._get_field(f,b'ICRD')
        #: 'IENG' Engineer
        self.engineer       = self._get_field(f,b'IENG')
        #: 'ITCH' Technician
        self.technician     = self._get_field(f,b'ITCH')
        #: 'IKEY' Keywords, keyword list
        self.keywords       = self._get_field(f,b'IKEY')
        #: 'INAM' Name, title
        self.title          = self._get_field(f,b'INAM')
        #: 'ISRC' Source
        self.source         = self._get_field(f,b'ISRC')
        #: 'TAPE' Tape
        self.tape           = self._get_field(f,b'TAPE')
        #: 'IARL' Archival Location
        self.archival_location = self._get_field(f,b'IARL')
        #: 'ISFT' Software
        self.software       = self._get_field(f,b'ISFT')
        #: 'ICSM' Commissioned
        self.commissioned   = self._get_field(f,b'ICMS')
    def _get_field(self, f, field_ident):
@@ -43,6 +66,9 @@ class WavInfoChunkReader:
    def to_dict(self):
        """
        A dictionary with all of the key/values read from the INFO scope.
        """
        return {'copyright':    self.copyright,
                'product':  self.product,
                'genre':    self.genre,
@@ -54,7 +80,12 @@ class WavInfoChunkReader:
                'keywords': self.keywords,
                'title':    self.title,
                'source':   self.source,
-                'tape':     self.tape
+                'tape':     self.tape,
                'commissioned': self.commissioned,
                'software': self.software,
                'archival_location':self.archival_location,
                'subject':  self.subject,
                'technician':self.technician
                }
--- a/wavinfo/wave_ixml_reader.py
+++ b/wavinfo/wave_ixml_reader.py
@@ -1,42 +1,97 @@
-import xml.etree.ElementTree as ET
+#import xml.etree.ElementTree as ET
 from lxml import etree as ET
 import io
 from collections import namedtuple
 IXMLTrack = namedtuple('IXMLTrack', ['channel_index', 'interleave_index', 'name', 'function'])
 class WavIXMLFormat:
    """
-    iXML recorder metadata, as defined by iXML 2.0
+    iXML recorder metadata.
    """
    def __init__(self, xml):
        """
        Parse iXML.
        :param xml: A bytes-like object containing the iXML payload.
        """
        self.source = xml
        xmlBytes = io.BytesIO(xml)
-        self.parsed = ET.parse(xmlBytes)
+        try:
            parser = ET.XMLParser(recover=True)
            self.parsed = ET.parse(xmlBytes, parser=parser)
        except ET.ParseError as err:
            print("Error parsing iXML: " + str(err))
            decoded = xml.decode(encoding='utf_8_sig')
            print(decoded)
            self.parsed = ET.parse(io.StringIO(decoded))
    def _get_text_value(self, xpath):
        e = self.parsed.find("./" + xpath)
        if e is not None:
            return e.text
    @property
    def raw_xml(self):
        """
        The root entity of the iXML document.
        """
        return self.parsed
    @property
    def track_list(self):
        """
        A description of each track.
        :return: An Iterator
        """
        for track in self.parsed.find("./TRACK_LIST").iter():
            if track.tag == 'TRACK':
                yield IXMLTrack(channel_index=track.xpath('string(CHANNEL_INDEX/text())'),
                                interleave_index=track.xpath('string(INTERLEAVE_INDEX/text())'),
                                name=track.xpath('string(NAME/text())'),
                                function=track.xpath('string(FUNCTION/text())'))
    @property
    def project(self):
        """
        The project/film name entered for the recording.
        """
        return self._get_text_value("PROJECT")
    @property
    def scene(self):
        """
        Scene/slate.
        """
        return self._get_text_value("SCENE")
    @property
    def take(self):
        """
        Take number.
        """
        return self._get_text_value("TAKE")
    @property
    def tape(self):
        """
        Tape name.
        """
        return self._get_text_value("TAPE")
    @property
    def family_uid(self):
        """
        The globally-unique ID for this file family. This may be in the format
        of a GUID, or an EBU Rec 9 source identifier, or some other dumb number.
        """
        return self._get_text_value("FILE_SET/FAMILY_UID")
    @property
    def family_name(self):
        """
        The name of this file's file family.
        """
        return self._get_text_value("FILE_SET/FAMILY_NAME")
--- a/wavinfo/wave_reader.py
+++ b/wavinfo/wave_reader.py
@@ -1,8 +1,14 @@
 #-*- coding: utf-8 -*-
 import struct
-
+import os
 import sys
 from collections import namedtuple
 if sys.version[0] == '3':
    import pathlib
 else:
    import urlparse, urllib
 from .riff_parser import parse_chunk, ChunkDescriptor, ListChunkDescriptor
 from .wave_ixml_reader import WavIXMLFormat
 from .wave_bext_reader import WavBextReader
@@ -14,7 +20,6 @@ WavDataDescriptor = namedtuple('WavDataDescriptor','byte_count frame_count')
 #: The format of the audio samples.
 WavAudioFormat = namedtuple('WavAudioFormat','audio_format channel_count sample_rate byte_rate block_align bits_per_sample')
 class WavInfoReader():
    """
    Parse a WAV audio file for metadata.
@@ -32,8 +37,14 @@ class WavInfoReader():
        :param bext_encoding: The text encoding to use when decoding the string
          fields of the Broadcast-WAV extension. Per EBU 3285 this is ASCII
          but this parameter is available to you if you encounter a werido.
        """
        absolute_path = os.path.abspath(path)
        if sys.version[0] == '3':
            #: `file://` url for the file.
            self.url = pathlib.Path(absolute_path).as_uri()
        else:
            self.url = urlparse.urljoin('file:', urllib.pathname2url(absolute_path))
        with open(path, 'rb') as f:
            chunks = parse_chunk(f)
@@ -125,7 +136,7 @@ class WavInfoReader():
        if ixml_data is None:
            return None
-        ixml_string = ixml_data
+        ixml_string = ixml_data.rstrip(b'\0')
        return WavIXMLFormat(ixml_string)
    def walk(self):
@@ -133,7 +144,7 @@ class WavInfoReader():
        Walk all of the available metadata fields.
        :yields: a string, the :scope: of the metadatum, the string :name: of the
-        metadata field, and the value
+        metadata field, and the value.
        """
        scopes = ('fmt','data')#,'bext','ixml','info')
Author	SHA1	Message	Date
Jamie Hardt	4dfc1ab33c	Added iXML track list parsing	2019-08-19 11:39:13 -07:00
Jamie Hardt	4770c781b2	Update README.md	2019-06-29 21:55:19 -07:00
Jamie Hardt	45c2aae640	Update wave_ixml_reader.py	2019-06-29 21:50:20 -07:00
Jamie Hardt	792e5505b6	Update wave_info_reader.py	2019-06-25 20:28:40 -07:00
Jamie Hardt	8f575c5131	Update wave_info_reader.py Added some more fields	2019-06-25 20:24:29 -07:00
Jamie Hardt	128dffef4e	Update README.md	2019-06-25 15:05:26 -07:00
Jamie Hardt	ae1594d496	Update test_wave_parsing.py	2019-06-25 14:47:34 -07:00
Jamie Hardt	b5713479cd	Update .travis.yml	2019-06-25 14:37:49 -07:00
Jamie Hardt	f1be6852b6	Update setup.py for v1.2	2019-06-25 13:23:10 -07:00
Jamie Hardt	242fa51b32	Update README.md	2019-06-25 13:20:22 -07:00
Jamie Hardt	177e913c86	Update test_wave_parsing.py	2019-06-25 13:17:18 -07:00
Jamie Hardt	1827b46e34	RF64 implementation	2019-06-25 13:10:26 -07:00
Jamie Hardt	4b6407c1f4	Update .travis.yml Addind 3.7	2019-06-25 11:15:05 -07:00
Jamie Hardt	99bfa99b3b	Renamed Sequoia file	2019-06-25 11:07:55 -07:00
Jamie Hardt	b7aeccacf5	Reorganized some test code	2019-06-25 11:00:07 -07:00
Jamie Hardt	9d5f8899d5	RF64 Test files Adding RF64 Test files, zipped	2019-06-25 10:09:46 -07:00
Jamie Hardt	0ce18d9f13	Update .travis.yml	2019-06-23 22:34:45 -07:00
Jamie Hardt	3f3fbc7632	Update setup.py Added dependency to setup	2019-06-23 22:27:44 -07:00
Jamie Hardt	809ff71bb4	Update wave_ixml_reader.py Switched XML parser from xml.etree to lxml with recovery	2019-06-23 22:23:28 -07:00
Jamie Hardt	a6ed0152db	Update wave_ixml_reader.py Added a brief report and suppressing the iXML exception for now	2019-06-23 20:43:59 -07:00
Jamie Hardt	526bc0c29c	Create Testfile_PYR_BWF.wav A test file from pyramix	2019-06-23 20:30:01 -07:00
Jamie Hardt	d3c8207a33	Update wave_reader.py Strip trailing nulls. Relates to #1	2019-02-14 09:44:49 -08:00
Jamie Hardt	185c810e7c	Create test_zoom_f8.py Stub test	2019-02-14 09:44:16 -08:00
Jamie Hardt	ebc224d5d2	Added Zoom F8 test cases from #1	2019-02-14 09:31:47 -08:00
Jamie Hardt	976d5844e2	Fixed version comparison	2019-01-05 22:06:17 -08:00
Jamie Hardt	f1fb83f208	Update wave_reader.py Fixed file URL code for version 2.7	2019-01-05 22:03:16 -08:00
Jamie Hardt	8d0d51b7fa	Update wave_reader.py Comment	2019-01-05 21:52:05 -08:00
Jamie Hardt	188ba855b2	Update wave_reader.py Added `url` property	2019-01-05 21:49:58 -08:00
Jamie Hardt	2f1511d935	Update .travis.yml	2019-01-05 12:53:09 -08:00
Jamie Hardt	8c8fdc1bb3	Update .travis.yml	2019-01-05 12:48:11 -08:00
Jamie Hardt	8564fd1fd8	Update README.md Codecov badge replaces coveralls	2019-01-05 12:46:16 -08:00
Jamie Hardt	77aadd563f	Update .travis.yml	2019-01-05 12:40:30 -08:00
Jamie Hardt	c2ddee8f6a	Merge branch 'master' of https://github.com/iluvcapra/wavinfo	2019-01-05 12:37:32 -08:00
Jamie Hardt	dc08bd39fe	Create .coveragerc	2019-01-05 12:37:30 -08:00
Jamie Hardt	12de7b078f	Update test_wave_parsing.py	2019-01-05 12:34:54 -08:00
Jamie Hardt	7992640fb8	Update .travis.yml	2019-01-05 12:32:47 -08:00
Jamie Hardt	88da763ca1	Update .travis.yml CODECOV_TOKEN	2019-01-05 12:29:12 -08:00
Jamie Hardt	1e0f31a794	Update test_wave_parsing.py Style	2019-01-05 12:25:34 -08:00
Jamie Hardt	0e1094421e	Update .travis.yml Removing 2.7 for now	2019-01-05 10:28:23 -08:00
Jamie Hardt	4f0c26f5ca	Update test_wave_parsing.py 2.7 test support	2019-01-05 10:27:29 -08:00
Jamie Hardt	71f0aed5ff	Update conf.py Bumped version	2019-01-05 09:52:13 -08:00
Jamie Hardt	f8feec8119	Update setup.py Changed descrption to sell the module a little better	2019-01-05 09:51:39 -08:00
Jamie Hardt	16bc4b016c	Adding 2.7 to tests	2019-01-05 09:51:25 -08:00
Jamie Hardt	83bf656ad3	Documentation	2019-01-04 19:58:03 -08:00
`@@ -1,2 +1,3 @@`
	`from . import test_wave_parsing`	`from . import test_wave_parsing`