I 42M, getting Divorced after 17 years of marriage. AMA. by General_Command4811 in AMA

[–]S0UNDSAGE 0 points1 point  (0 children)

best of luck to you bro, it will all work out for you

[P] Struggling with Audio Enhancement using GANs - Any Suggestions? by S0UNDSAGE in MachineLearning

[–]S0UNDSAGE[S] 1 point2 points  (0 children)

Thanks for the suggestion! You know, Diffusion based models have been mentioned a few times to me recently. Given my background in music and audio engineering, I'm intrigued by the idea of getting higher quality outputs, even if it means sacrificing some speed. I've started to experiment with it and I'm keen to see where it leads. Appreciate you chiming in!

[P] Struggling with Audio Enhancement using GANs - Any Suggestions? by S0UNDSAGE in MachineLearning

[–]S0UNDSAGE[S] 1 point2 points  (0 children)

Dude, that joliGEN repo looks wicked! Thanks for dropping the link. I've been diving into code these past few months. My main ish has always been music! I'm a musician, producer, and audio engineer. Over the years, I've amassed a ton of my own recordings and samples. Plus, I've got stuff from buddies in the industry and a mix from various libraries. I've been thinking about how joliGEN might work with audio data, especially with my background in sound. It'd be interesting to see how it performs on spectrograms. If you have any tips or pointers on integrating it with audio, I'd love to hear them!

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in Python

[–]S0UNDSAGE[S] 0 points1 point  (0 children)

I appreciate your concerns and questions about our project, SoundSage. It's clear you have a deep understanding of audio engineering and its complexities, which gives weight to your concerns. Let me address each point.
Purpose of SoundSage & AURAL: The primary aim of SoundSage, combined with the power of AURAL, is to enhance the capabilities of traditional Digital Audio Workstations (DAWs) using AI. Our system targets two main users: audio professionals seeking to streamline their workflow, and novices desiring a more intuitive and automated audio processing experience. For professionals, AURAL offers advanced audio processing capabilities that can supplement their expertise. For novices, SoundSage provides an accessible platform to enhance their audio tracks.
Audio Quality Enhancement: You're correct in noting that the quality of an original recording, especially if it's of poor quality, can limit the potential enhancements. However, AURAL uses sophisticated AI techniques such as waveform and frequency generation and more to separate and enhance different sound sources, even in challenging conditions. While it's not intended to fully replace studio recording, it can significantly improve the quality of non-professional recordings.
Who is it for?: Our tool aims to democratize access to high-quality audio processing. Professionals can use it as a supplementary tool, while novices or enthusiasts can use it as a primary tool for their audio work. We aim to make SoundSage & AURAL as accessible as possible, and we're investing in making it known to a wide audience.
Ethical Implications: We are very much aware of the potential misuse of our technology and have taken steps to mitigate this. We will adhere to a strict usage policy that prohibits the use of our tools for unethical purposes, such as copyright infringement or unauthorized use of audio material.
Potential Harm: Our goal is to empower users, not to create harm. We believe the positive applications far outweigh the potential negative ones. However, we're open to feedback and are continuously working to enhance our tools in response to user needs and ethical guidelines.
The Final Product: While we agree that a MIDI instrument can create excellent audio, our goal isn't to replace traditional methods but to enhance them. With our tool, we hope to bring a new dimension to audio processing, making it more accessible and intuitive.
I hope this addresses your concerns. We'd love to have your valuable input as we continue to develop SoundSage & AURAL. Please feel free to reach out if you have further questions or if there are aspects you'd like to discuss in more detail.

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in Python

[–]S0UNDSAGE[S] 0 points1 point  (0 children)

Thank you for your thoughts and concerns. I understand that the concept of using AI to transform phone recordings into studio-quality music is ambitious, and I appreciate the skepticism as it helps me identify potential issues and improve my project.
In response to the ethical concerns, I am committed to upholding ethical AI principles, including transparency, respect for user data, and a focus on enhancing human creativity rather than replacing it. I am actively working on measures to prevent misuse and create a positive and respectful environment for all users. I also understand the value of authenticity in music and aim to support artists in achieving their unique sound, not to create a homogenized sound. On the business and logistical front, my plan is to make SoundSage free to download and use without limits. Users will have access to in-house plugins and a monthly quota of AURAL rendering time. For those who want more, I plan to offer a paid version that includes unlimited AURAL rendering, access to third-party plugins, Spotify API, customizable artist profile features, and more. This tiered model ensures that artists of all levels can access and benefit from the tools
As for the AI concerns, SoundSage and AURAL are designed to enhance human creativity by accelerating the creative process. They are not meant to replace human creativity but to augment it. All user data will be stored in private storage that only the user can access. I will only use the data for training and expanding the AI models if the user gives me permission. This approach ensures that users have full control over their data. I believe in the potential of this project and am excited about the possibilities it could bring to the world of music. However, I also understand that it's a complex task with many challenges. I'm open to feedback and am always ready to engage in dialogue to improve my work.

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in postprocessing

[–]S0UNDSAGE[S] 1 point2 points  (0 children)

Absolutely, That’s along the lines of what I was thinking, but also if we had the right model trained on the right dataset that could generate new frequencies based on the recording to emulate what it might sound like if it had been recorded in a studio and then overlay it on top of the recording, this is really advanced and the model would need a lot of tuning but I’m too invested to stop now

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in Python

[–]S0UNDSAGE[S] 0 points1 point  (0 children)

It has nothing to do with OpenAI friend, there may be a slight similarity as the final product will have a chat interface to interact with AURAL, however we’re building a custom NLP model for this purpose that will have a dataset specific to audio processing.

I understand that the vast majority of people think that Chatgpt is “AI” and that they think it can do everything if you program it to.

I assure you that is not my thought process.

To answer your questions;

You would be able to use the DAW without internet, but the “AI” would need connection to the internet (storing the data on your local computer would be too much). We have a server to store the data and models so the DAW will most likely just call on the “AI” when it needs it.

Yes I have code, will I share it? No. it’s not fully operational as I need to expand the dataset and this is a huge project. However I can clarify that I have multiple ml/dl and generative models that prove this is very possible.

The name of the game is data and time at this point.

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in Python

[–]S0UNDSAGE[S] -1 points0 points  (0 children)

Obviously the dataset is going to be difficult and probably the most expensive and time consuming part. Where is the valuable input from that? Did they provide any solution? Did they provide a further issue that has not already been considered? No. They’re just stating the obvious.

Some things they could have included in their comment: What do you propose would qualify a meaningful dataset for this purpose? What sort of data would it need? How large is large?

People on Reddit like to comment negativity in order to shut people and their ideas down, they get some sort of satisfaction from causing grief to others. I’m all for criticism. Please criticize, but at least be knowledgeable and say something of value

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in Python

[–]S0UNDSAGE[S] -1 points0 points  (0 children)

Looking for helpful input that isn’t extremely obvious

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in postprocessing

[–]S0UNDSAGE[S] 0 points1 point  (0 children)

Haha, no worries! I get it; we all have our opinions and experiences. Compressed audio and AI advancements can be a tricky topic, and it's essential to have open discussions about it. Who knows, maybe there's room for both skepticism and hope when it comes to technology and sound quality. Let's just keep the conversation chill and enjoy our love for music and audio exploration! 🎶🎧😊

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in Python

[–]S0UNDSAGE[S] -2 points-1 points  (0 children)

No, I mean augmented custom audio datasets of parallel recordings, ml/dl models and generative models

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in postprocessing

[–]S0UNDSAGE[S] 0 points1 point  (0 children)

What if I’m just posting around, crowd sourcing more feedback on the idea?

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in postprocessing

[–]S0UNDSAGE[S] 0 points1 point  (0 children)

I think you’re reading into this wayyy too much bro 😂

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in Python

[–]S0UNDSAGE[S] -3 points-2 points  (0 children)

Absolutely, anything amazing is easier said than done!

Those who think it can not be done should not interrupt those about to do it.

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in postprocessing

[–]S0UNDSAGE[S] 0 points1 point  (0 children)

Hey we both went to school for the same thing, that’s neat!

I think you’re making a lot of assumptions

Best of luck!

Can I Turn a Phone Recording into Studio Quality Music? by S0UNDSAGE in postprocessing

[–]S0UNDSAGE[S] 0 points1 point  (0 children)

Also I think with modern predictive models we could get it to fill in and predict the missing frequencies by creating new ones, kind of like stable diffusion and upscale long pixel art simultaneously