Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. Does such a software exist?

Does such a software exist?

Scheduled Pinned Locked Moved The Lounge
question
26 Posts 14 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • A Amarnath S

    Surely there should be something invariant between these two utterances of 'rainy'.

    C Offline
    C Offline
    CodeWomble
    wrote on last edited by
    #21

    Yes and no. The similarities can be washed out by the noise in the sentence, particularly with all of the different accents/varieties of English that are spoken around the world. Also, just identifying word endings can be tricky. Picking out words from "rainy days" can give "rainy days", "ray knee days" or "ray need as'" without changing the sound.

    1 Reply Last reply
    0
    • A Amarnath S

      If I had one file, I wouldn't have asked. I have at least 100 files, and have heard them once at least, and would like to find whether certain keywords occur in them. Something like batch-processing is what I an looking for. That's why I asked.

      L Offline
      L Offline
      Lost User
      wrote on last edited by
      #22

      Run the files through a (batch) speech recognition program and dump the whole thing to a text file. You can (re)search the text without having to go back to the mp3. Looking at a file in its entirety will help in deciding how to "train" the (ML) process. [Transcribe your recordings](https://support.microsoft.com/en-us/office/transcribe-your-recordings-7fc2efec-245e-45f0-b053-2a97531ecf57)

      "Before entering on an understanding, I have meditated for a long time, and have foreseen what might happen. It is not genius which reveals to me suddenly, secretly, what I have to say or to do in a circumstance unexpected by other people; it is reflection, it is meditation." - Napoleon I

      1 Reply Last reply
      0
      • R Rage

        Here[^], but it is an API, not a finished software. Don't you have a trainee, or a teenager lying somewhere doing teenaging stuff ? :-D

        Do not escape reality : improve reality !

        R Offline
        R Offline
        r_hyde
        wrote on last edited by
        #23

        The answer in that link leads to the VOSK project[^], which in turn has a page that lists several projects that integrate the VOSK toolkit. One of those projects is pretty much exactly what the OP is asking for: mp4grep[^]. Looks like it's Linux-only, and using it is a two-step process unless you happen to be starting with a 16khz wav file, but it appears to automate the process of using speech recognition to generate a timestamped transcript, and then grep'ing the result to find the word or expression of interest. I didn't actually try to use it, so I can't vouch for its quality, but it looks pretty neat if it works!

        1 Reply Last reply
        0
        • A Amarnath S

          I have an MP3 file. It is an hour-long lecture on a topic about weather. I would like to find out ('seek') the exact time(s) when the word 'rainy' gets played in this MP3 file, without listening to the entire hour-long lecture. Similar to finding a substring in a long string, but finding an audio clip within an audio file. Does such a software functionality exist? My Google search didn't yield much.

          L Offline
          L Offline
          Lost User
          wrote on last edited by
          #24

          Hmmm, Did you find a solution for this? [Whisper](https://openai.com/blog/whisper/) was released just yesterday. [GitHub - OpenAI/whisper](https://github.com/openai/whisper) Looks really easy to use.

          A 1 Reply Last reply
          0
          • L Lost User

            Hmmm, Did you find a solution for this? [Whisper](https://openai.com/blog/whisper/) was released just yesterday. [GitHub - OpenAI/whisper](https://github.com/openai/whisper) Looks really easy to use.

            A Offline
            A Offline
            Amarnath S
            wrote on last edited by
            #25

            Thanks a lot. Will download and take a look.

            L 1 Reply Last reply
            0
            • A Amarnath S

              Thanks a lot. Will download and take a look.

              L Offline
              L Offline
              Lost User
              wrote on last edited by
              #26

              You are welcome. :)

              1 Reply Last reply
              0
              Reply
              • Reply as topic
              Log in to reply
              • Oldest to Newest
              • Newest to Oldest
              • Most Votes


              • Login

              • Don't have an account? Register

              • Login or register to search.
              • First post
                Last post
              0
              • Categories
              • Recent
              • Tags
              • Popular
              • World
              • Users
              • Groups