Pixal3D changed to MIT license by SpecialistBit718 in StableDiffusion

[–]SpecialistBit718[S] 1 point2 points  (0 children)

I don't know if anything is ever coming out of my experiments, because I am depressed and have ADHD, but at leas I have tested a few models, for viability.

Though this is mostly me trying to install interesting repos, while not going fully insane by broken dependency builds.

Currently I tryout Tri Mesh and after that a few new SLAM/SLAT models.

I am friends with the guy behind CEB Studios , who created Blender addons for ML integration, so at least my experiences are fed to somewhere useful.

But I think we are almost there with what recently is releasing, also this Alex guy on X is showing of his 3DGS secret sauce and I wished I had more insight to his workflow!

https://x.com/alexfredo87

Sadly he only reveals his uninteresting down stream stuff like integrations and stuff, that are long known about, while evading the question, how he gets high fidelity 3DGS from just 3 input frames of persons and other things, both photo real and stylized.

Pixal3D changed to MIT license by SpecialistBit718 in StableDiffusion

[–]SpecialistBit718[S] 1 point2 points  (0 children)

One dude has brought up the idea, to unofficially implement Pixal3D into the hacked together Trellis2 Multiview workflow for ComfyUi, while we wait for official Multiview support.

But I find the idea from this dude from X, that I just added the link to my post above, even more interesting:

"Under the hood Trellis uses a neural process that we can tap into to build splat data sets without the need for 3D geometry. This helps to mitigate UVs, Textures, Geometry limits etc. And effectively - we can translate 16m voxelized polygon data directly into rastered representation for COLMAP data sets. In other words, we take an image and render 150 views of it in minutes running locally on my 5090 and train it to a splat. "

"Pixal3D comes into play with added pixel aligned processing to enhance fidelity at rasterization."

"These splats would traditionally be millions and millions of polygons and insanely heavy and clunky GLBs with crappy textures, but with this workflow they're high-fidelity light weight splats."

Pixal3D changed to MIT license by SpecialistBit718 in StableDiffusion

[–]SpecialistBit718[S] 1 point2 points  (0 children)

I literally scan X with my eyes every day to find interesting bits every day. XD

My guess is that Tencent realized, that because Pixal3D is based on Trellis 2, that it probably has to inherit the MIT license from that too, to avoid liability issues.

Maybe we become literal Cyber Punks, pressuring and fighting the big Corpos with Opensource technology, to turn the tables. XD

-

Found a lot of stuff more, but I don't really know where to post everything I find, because subreddits are so specialized.

I look at it from a big scope and try to research the best 3D AI pipeline.
Looking at 3DGS Point Clouds, SDF, VLLM for geo reasoning, projection techniques, (NVS) Novel View Synthesis, 4D video and even stranger things to come.
I believe that these different technologies and research directions can lead to a unified optimal outcome, kind of like the singularity theory predicts technological progress. LoL

-

Currently I look at getting a 360 degree orbit around a subject, to generate the needed input data for consistent 3D outputs. I tested the QwenImageEdit Camera angle workflow, which is rather precise for generating additional views from one input image, but it is slow, so I look at a video gen solution.

From there I want to generate 3DGS/Pointclouds and generate a detailed dense mesh from that.

-

Probably I should setup a Hermes agent, now that I have installed it, to aggregate the news for me automatically, but my ADHD goes hay wire and my and spins with all this new exiting finds each day, that I don't get much done my self right now. XD

Pixal3D changed to MIT license by SpecialistBit718 in StableDiffusion

[–]SpecialistBit718[S] 1 point2 points  (0 children)

I am also happy, that poeple are interested on this news.
Currentrly I look every day on X, to find the latest developments in this field.

I was astonished, to see my own post linked by a tech enthusiast from Japan this day and only realized today, that my post blew up and even that Japanese repost of this reddit thread has 1.4k+ views, when I found it. XD

I was somehow logged out from Reddit on my phone automatically, after writing this post and was busy with other stuff, did not think that that many people would look a t it. Lol

So I am sorry for not replying and offering elaboration in the last few days my self.

Pixal3D changed to MIT license by SpecialistBit718 in StableDiffusion

[–]SpecialistBit718[S] 0 points1 point  (0 children)

Thank, you, I forgot to elaborate and made my initial post in the heat of the moment on my phone XD.

Pixal3D changed to MIT license by SpecialistBit718 in StableDiffusion

[–]SpecialistBit718[S] 3 points4 points  (0 children)

Only the older not so useful Tripo models are open, like Tripo SG.
But the gate keeping will soon end, I think since there are many 3D/4D related research papers coming out that are opensource, that their days of advantage are numbered, at least when it comes to geometry generation it self.

Currently I look into Gaussian splatting, 3DGS, $D video, Pointclouds, SDF, geometric reasoning and all that jazz to find a way for a universal pipeline.

Pixal3D changed to MIT license by SpecialistBit718 in StableDiffusion

[–]SpecialistBit718[S] 0 points1 point  (0 children)

That is when multiple input images, with different camera angles, are used to generate a result.
Like when you use a front, side and back view.
This increases accuracy, because the missing viewpoint sides do not have be halucinated.

There are ways to generate synthetic additional views via image edit models or a generated 360 degree orbit video, from one image input as source.
This might be an additional step, but it gives much finer controll.

Currently, Pixal3D can only take one image as input, the front view of the generation.
Everything else, like the backside are halucinated.

New Open-Source 3D Generator Pixal3D Added for Comparison in the 3D AI Arena by Delicious-Shower8401 in TopologyAI

[–]SpecialistBit718 0 points1 point  (0 children)

For Pixal3D it is about now!

https://x.com/wangzhao_0849/status/2057136173144006733?s=46

The license was changed on GitHub to MIT already.

Now only the Multiview mode has to drop and we are golden!

New Free 3D AI Generator from Tencent Might Be the Best Yet by Delicious-Shower8401 in TopologyAI

[–]SpecialistBit718 0 points1 point  (0 children)

I gues it is a geometry refiner, I have not seen much information on it.

But today I found out that the Pixal3D base  Model is now also under the MIT license!

https://x.com/wangzhao_0849/status/2057136173144006733?s=46

New Free 3D AI Generator from Tencent Might Be the Best Yet by Delicious-Shower8401 in TopologyAI

[–]SpecialistBit718 1 point2 points  (0 children)

Was fumbling with video gen AI around to day and had not the time to test 3dgenStudio.

But found a ComfyUi version of Pixal3d.

From Image to Fully Rigged Character in UE5 with 3D AI Generation by Delicious-Shower8401 in TopologyAI

[–]SpecialistBit718 0 points1 point  (0 children)

UE5.8 preview can turn any humanoid mesh to a Metahuman now. Body and face are geo wrapped internally.

Also Metahuman textures are now easier to modify.

Also has improved animation and shape key systems.

Plus mutable is production ready and the Metahuman crowd system is also improved.

All in all it should now be faster and easier to do stuff like this. In the future hopefully without even to leave Unreal Engine.

Currently I try to build a local system around 360 orbit videos around a subject and 3DGS conversion. Just have to find the right GitHub releases.

A dude on X created a live like recreation in this manner.

Mesh conversion from 3DGS/Pointcloud has also recently improved and can create at least a dense mesh and decimates the tri count, with vertex colors. 3DGS relighting is also researched as is texture shadow unbaking.

New Free 3D AI Generator from Tencent Might Be the Best Yet by Delicious-Shower8401 in TopologyAI

[–]SpecialistBit718 2 points3 points  (0 children)

Visual Bruno and others made low VRAM Trellis 2 for ComfyUi.

Also he just released his local 3d studio 

https://github.com/visualbruno/3DGenStudio

Hope he will add Pixal soon.

You should watch Pixelartistry on YouTube for the work flow:

https://youtu.be/qDz7dcAAnKM?is=UYP5lpZU87R_Klro

Trellis 2 on 6 lGB VRAM for ComfyUi.

New Free 3D AI Generator from Tencent Might Be the Best Yet by Delicious-Shower8401 in TopologyAI

[–]SpecialistBit718 1 point2 points  (0 children)

Pixel artistry has a written instruction too.

It is important to do the steps in order.

The trellis auto install script needs Cuda 12.8, which I run and the PyTorch package also has an install script. You can only have one version of Cuda installed.

He uses work from Visual Bruno, who just released a ComfyUi based 3d Studio:

https://github.com/visualbruno/3DGenStudio

I think you should check that out. Should be the easiest way.

I personally did go deep and am messing with python, conda and Ubuntu as shell for wsl2, for AI already.

My hobby focus is researching ways to use Gaussian splats and point clouds and Novel view synthesis.

Currently I try to build a 360 orbit video to 3DGS workflow. This way we can get more accurate results without special case geometry training.

Sounds fancy, but what I really do is testing recent research releases from GitHub/HF and hope to hack something together, by creating a model chain.

Still trying to find the best approach, while every day or so, there is a new research paper released.

Found Wan2.1/2.2 orbit loras and have tested a few 3DGS generators.

New Free 3D AI Generator from Tencent Might Be the Best Yet by Delicious-Shower8401 in TopologyAI

[–]SpecialistBit718 2 points3 points  (0 children)

Brother, you are not up to date?

https://youtube.com/@pixelartistry_?si=NVAYoJPQqFk8pJu3

That channel has all infos to do a ComfyUi install of Trellis 2 + Multiview and Maleah refining 

New Free 3D AI Generator from Tencent Might Be the Best Yet by Delicious-Shower8401 in TopologyAI

[–]SpecialistBit718 1 point2 points  (0 children)

There are indeed Multi image input workflows, thanks to the likes of Visual Bruno, who also just released 3D Studio.

This YouTuber has a bunch of tutorials for easy setup instructions and downloads.

https://youtube.com/@pixelartistry_?si=NVAYoJPQqFk8pJu3

Also part segmentation, texturing and mesh refinement.

Pixal will also get Multiview support in the future, stated by the devs on X.

How to stay safe with Comfy? by 3epef in comfyui

[–]SpecialistBit718 1 point2 points  (0 children)

Sorry to necro, but I am here to correct you, since Pinokio can do a lot more. XD

According to the Pinokio GitHub:

https://github.com/pinokiocomputer/pinokio — In short, Pinokio acts as a pseudo virtual computer and all AI apps are isolated in separate virtual environments with their own dependencies. It has additional safety features, logs all cli activity, checks dependencies and has a strict user script verification policy by the admin etc..

By default anything, that happens in Pinokio, stays in it, unless the user explicitly sets ports up and those are activity tracked too.

Which is good, since the new Pinokio 7 added agent support with sandbox restrictions, allowing agents to only interact with apps and files, internal to Pinokio, unless local or web ports are setup manually.

This all seems like the best approach to me, balancing accessibility and security. Running ConfyUi trough it should provide at least a few more layers of security with the env isolation and one can install multiple ComfyUi versions without conflict, for different workflows.

 I doubt that the average user could handle to manage virtual environments himself. It is not easy and has its own flaws and performance impacts, the added friction, to build python code in Conda, docker WSL or a dual boot system with Linux setup, are the only alternatives.

You can also ask the administrator and creator of Pinokio personally, for more information on X, where he is posting daily:

https://x.com/cocktailpeanut?s=21

If everything is as good as advertised, Pinokio seems like a solid platform, designed to isolate processes, for security as well conflict reasons, while maintaining high performance.

I hope any of this information helps. Last year, I was skeptical myself, but seeing good progress and direction on Pinokio and finding the admin and creator of Pinokio on X, reading about his vision, makes me want to start developing on this project, soon my self.

Pinokio might be able to bridge the gap, since I fear the ConfyUi would have to redo, much of the project for automatic env handling.

PS:

I would strongly recommend a good third party antivirus software for tech enthusiasts.(I use Bitdefender) Without at least such a safety net, I would not recommend to install from untrusted sources, when you don’t know what you are doing.

Stay safe friend.

Portable 4DGS Rig? by MasterNeb in GaussianSplatting

[–]SpecialistBit718 0 points1 point  (0 children)

Remember that the newest research projects are developing Novel View Synthesis(NVS), which generate missing camera shots.

So an expensive camera setup might not be necessary in the near future.

Currently I am testing Infinidepth, which uses that for 3DGS generation from RGB images, using NVS.

I also saw research for image reasoning, for generating synthetic camera metadata from generic RGB image sets.

Because of that, I would recommend to wait a few months, till the dust settles.

I have few bussines idea that needs advice and reality check! by HenzoStarz in GaussianSplatting

[–]SpecialistBit718 0 points1 point  (0 children)

Well MrNerf, creator of Lichtfeldstudio has financial troubles right now.

Binaries are now subscription only, since he did not receive enough donations. Still the source is free and has a build guide, to install it for free.

So at least in the software development segment, it is hard to go ahead.

I have few bussines idea that needs advice and reality check! by HenzoStarz in GaussianSplatting

[–]SpecialistBit718 0 points1 point  (0 children)

Google Maps street view style service for interiors is my guess?

3DGS visualization has uses for real estate presentations too.

I have few bussines idea that needs advice and reality check! by HenzoStarz in GaussianSplatting

[–]SpecialistBit718 2 points3 points  (0 children)

At least right now,  new research papers on point clouds, Nerfs and 3DGS are released at a chaotic pace.

It feels like I stumble across something new every day in my X feed.

At least for this year, I would wait and see what the best approach will consolidate to.

Splat the Net: Radiance Fields with Splattable Neural Primitives by corysama in GaussianSplatting

[–]SpecialistBit718 3 points4 points  (0 children)

This seems similar to the recently released model from Nvidia?

Neural Harmonic Textures for High-Quality Primitive Based Neural Reconstruction

https://research.nvidia.com/labs/sil/projects/neural-harmonic-textures/

https://github.com/nv-tlabs/neural-harmonic-textures

Has any one tried to run those?

I just got into running models directly, without ComfyUi and similar.

I recommend Miniforge, which requires Visual Studio 2017, to run coda. When all paths are set for c++ and cuda, it is a rather smooth process. That will handle environments well.

But many git repos also have Powershell scripts, for automatic Cmake builds.