Introducing Shape Tokens, a continuous, compact, and easy-to-integrate 3D representation in machine learning models. Shape Tokens act as shape vectors, representing shape information within a 3D flow simulation model. This flow simulation model is trained to estimate probability density functions corresponding to delta functions embedded in 3D shape space. By combining Shape Tokens with various machine learning models, we can generate new shapes, convert images to 3D, match 3D shapes to text and graphics, and assign shapes directly to flexible, user-defined decisions. Additionally, Shape Tokens enable systematic analysis of geometric properties, including normal, density, and transition fields. In all tasks and tests, the use of Shape Tokens shows strong performance compared to existing frameworks.
Figure 1: Our representation of Shape Tokens can be easily used as input/output to machine learning models in various applications, including one-to-3D image (left), neural rendering of standard maps (top right) and 3D-CLIP alignment (bottom right). The resulting models obtain robust performance compared to the baselines for each function.
Video 1: The video shows one of our image in 3D point cloud effects. Images are objects that can be seen in the Objaverse test set. Each video first shows the input image, then the generated point cloud. [Credits]
Video 2: From the same abstract image, we generate multiple point clouds independently. [Credits]
Figure 2: Overview of our architecture. (Left) We model the 3D shape as a probability density function centered on the surface, creating a delta function in 3D. (Right) Our tokenizer uses cross-sectional attention to integrate information about the point cloud derived from the shape in ST. Velocity estimation uses only cross-correlation and MLP to preserve independence between points.
Figure 3: Reconstruction, density, and normalization of invisible point clouds in the GSO dataset. For each row, we are given a point cloud containing 16,384 points (xyz only), we calculate the ST and the iid sample of the result p(x|s) to get 262,144 points. Different columns give input and sample point clouds from different viewpoints. Shown with a label in parentheses, we color the input points according to their xyz coordinates and the sample points according to their original uvw noise coordinates and their estimated normality (last two columns). Note that we do not generally provide such input to the shape token.
Video 4: We calculate Shape Tokens from input point clouds (16,384 points) from Google Scanned Objects. Then we sample 16x more points (262,144 points). The video shows the uvw-to-xyz trajectory of the sample flow matching process, that is, the ODE trajectory. We color the points by their starting position in the noise space (uvw). [Credits]
Figure 4: The integration path of the ODE defines the map from xyz (data) to uvw (noise).
Video 3: The video shows the latest one-to-3D image modes in Google Scanned Objects, which are not visible in all modes.
From left to right:
Installation image
Spatter-image (CVPR 2024): trained on Objaverse
Point-e (2022): trained on several million 3D identity meshes.
Make-a-shape (ICML 2024): trained on 18 datasets, including Objaverse
Ours: trained in Objaverse
Note that this video is not intended to compare individual methods — these models differ in their training data (e.g., Point-e was trained for the identity of 3D meshes) and methods (e.g., Splatter image is not a generative model, our method. takes a model of a known camera). We provide viewer reference results. [Credits]
Video 5: The page shows the results of neural rendering on invisible clouds. From the Shape Tokens, we use a neural network to independently estimate the intersection point of each ray and its normal point.
From left to right:
A common ground truth environment
Pointersect (CVPR 2023)
Ours
[Credits]
Mesh/Image Credits: Google Scanned Objects, fedomo.ru, Jacob.Elhatmi, WrenArt, undeadfae, Monicag97, STK_produktion, Andi R, xabi, th_jabba, johnnokomis, LasquetiSpice, AdiXXioN, taplinhvipmarknsao, KotronyID, VeppalID. JacksonSanders, remdwaas, GRAPHTEC AMERICA, iiircha, despinozavi, AstrumProjects, asleshka, ulmsklv, S.Duce, idcim, Darkkostas25, CREATRBOI, steam2020, fedomo.ru, AnirudhRao, 3DFoxtarrians, katitpaxtarrians, iicepade, iicepade, A109082012, RyanCrosby, Armen Gevorgyan, EnjoyLife_Tlt, Fong Chen, WHA Arquitectos, andreagonzalez28, YouSaveTime, Cutestormy, amy3d, daand, EfrenR, Poppy, MARTINICE GROUP, julianCheelletz, Whatsartkart, Whatsartskar, rednice, whatsart LuDiChRiS, mbilalsiddique1, Frybrix, defnotdan, invisiprim, Brent Loncher, MrMaxICT , Stevie_66, Jesse Van Norman, WuhuAirline, anyachan, Lustron.ru, КУКАЛЕВ, Maxmalow, Karolina K Bienkowska, James Fraarkowska, James Fraarkowska, James show, Christopher Cox, apoiocad, Padraig Daly, CurveCreativeStudio, DennisGray, grantbowlds, YouniqueĪdeaStudio, nobodyroo, dinomaster, pattarrian, rodrigo.ferrada , tamaliteitor123, George B, Csaba Baity (tsabszy), tim.a. , DJMaesen, agglover, Adrian Carter, mohamedsuspeito, Kevin Bond, faizn0rdin, SpaceCowBoy, Giravolt, NukedGames, bhrf, mscla1r3, ScannerDev, Vikrama Raghuraman, NoobiePie, prostair.pl, Rzyahlevid, Phiafl Goschhuta, digital, Rzyahlevid, Phiafl Gookufa pixelsquare, SketchingSushi , Mateus Schwaab, archmix, jacob_kenndey, lidija.simo, Jessica Peterson, Ltcolscotty, 3Dystopia, Vincent Laberge, frdifrn, Frédérick Pagé, camlaneve, Matt, IronEqual, Tursito, DavidkLearn, Mr. guseu, Guilhermino, dieterreinert, Mattyew, natalimedeiros, leopro, Trappemakeren, beehn, alisachen69, Chrifuf, cncbrasil, zuzana vajdova, nguyenlouis32, DarksProducer, globalshizaku, louayleo, semmert179, Hainesruemma Nick_Sherman, chaosexcell, ssarinareza, aveli.ladva, Tomas Rubianes, RainerWahnsinn , Lucas Jaenisch, cs_adam, trinityscsp, a109082026, JasseeNFT, Cowdi, Kisielev Mikhail, kay Quobad, secretariatep,9, me160 Concert. Beroepsonderwijs, fedomo.ru, PatelDev, bipolarbear, Emm (Scenario), De Oliveira M., Naruto, Keita-sama, RodierGabrielle, mizuhi, shughes, Gregory Khodyrev, millerj449, Marko31, David_Holiday, fedouard, Artem, Edouard, Artem. Shamsuarov, Alan IGrice Staircase Co Ltd, THESTIG03, vamsikrishna.v, Dundee Howff Conservation Group, sinhoroto, jia100, 10668285, Born_Canadian, jashma82, aki.karppinen, DarkAaron999, Luckster, julius, RtrolosBillf, Rj. fedomo.ru, MOHMAX1, jamesdeantv1, moxmoin, Adrian21, andrea bocchini geometra, Re3xyyz, Binkley-Spacetrucker, FeralMan, unownlord, pigfinite, duperonvincent, ayekerik, 140813, antonio. Stairs Ltd, breezeca, kishi, 97jana, Sogomonyan_Vaagn, peachybunny, gb.prof.69, milen.margaryan2003, nguyenhuydang, andysmiles4games, Aorie, jonamanz9673, mommy long legs, buckygaming, Guygaming2019, XXs arakiminoru, Tatiana Sumarokova, potaato, Lustron.ru , jhseok8927, Xillute | Dev, re1monsen, c4n, Ceat, joseph.terronez, matusekfoto, Max Wittig, rltw, lsbergin, KIΣITO, Aiden Huxley, 3Dystopia, MartyUkovGBS, Jamie Rose, Mihail.Burduja, ashpatz845, Schack-Excel-Trapper, Stairs Ltd, Behets, Noemi.Mancilla.Serrano, madison319478, Drake, xeratdragons, timpugh44, GSMRF, Lauren Hasegawa, Ca7chi, dewathoem, schaffsp, newfields-3dprinting, Dikart, MariaMam, Micayla Spiros, silvinomc00, Neutqrie untia, J9rie, J2000, Neutqrie, Neutq000, Neutqrie. Robwaah007, shakiller, newfields-3dprinting, -Slash-, Saumleid, DreamSail Games – Graham, Jingbari, sualogo3d, maypassamon, Uğur Yakışık, Caitlin, LynSalvador, lanvalond, TheDesigner, e90r96, guilherme, FEVIEMB, ZONIEMB, ZONIEMB, GUILHERME, FE. TroyMay21, Qubx.3D
Source link