parabellum630
parabellum630 t1_ixvapdu wrote
Reply to comment by JustOneAvailableName in [D] Pytorch or TensorFlow for development and deployment? by CodaholicCorgi
I see. But can they be used as a good starting point to build upon?
parabellum630 t1_ixuno8d wrote
Reply to comment by Deep-Station-1746 in [D] Pytorch or TensorFlow for development and deployment? by CodaholicCorgi
Oh no! I had no idea about ONNX and just transfered a huge model from tf to pyt manually.
parabellum630 OP t1_ix5yyv8 wrote
Reply to comment by trashacount12345 in [R] Tips on training Transformers by parabellum630
Oh my God. I used to do this too! I am happy I am not the only one!! But my monkey brain learned not to do this eventually. I have managed to get it to GRU performance by applying more warmup steps, learning rate scheduling, decreasing model size, using Pre-LN, doubling the batch size, and reducing the sequence length.
parabellum630 OP t1_ix4flxr wrote
Reply to comment by fasttosmile in [R] Tips on training Transformers by parabellum630
Interesting, I will try this out as well.
parabellum630 OP t1_ix4fc49 wrote
Reply to comment by ChangingHats in [R] Tips on training Transformers by parabellum630
Thank you!! I was experimenting with off-the-shelf implementation with little customization. I am using the transformer in an encoder fashion with 800 hidden dimensions due to the constraints of other models surrounding it. I will try out varying all these hyper parameters. Looks like it's going to be a long week.
parabellum630 OP t1_ix4eozs wrote
Reply to comment by suflaj in [R] Tips on training Transformers by parabellum630
Thank you so much for these insights!! I will try these out.
Submitted by parabellum630 t3_z088fo in MachineLearning
parabellum630 t1_itmm0ei wrote
Reply to comment by trikortreat123 in [D] Neural Avatar Community by trikortreat123
I worked on eyes and upper head shape while the paper I built upon (DECA) tackles personalized fine details. But I do agree hair modeling and finer details and texturing is better with volumetric approaches. Maybe a fusion of the two would be a good research direction? Like the end goal is not only the modeling but also using it for real tasks.
parabellum630 t1_itmihvd wrote
Reply to [D] Neural Avatar Community by trikortreat123
What are your thoughts about 3DMM based models. I am doing research on avatar generation from monocular images using 3DMM based approaches and found them to be better in terms of inference speed, animatability and ease of integration into existing creative workflows.
parabellum630 t1_j9dcup7 wrote
Reply to comment by Valachio in [D] What's the best way to capture a person's 3D likeness right now? by Valachio
3D morphable models are the way to go for image of face to 3d model. I have previously done a lot of research work on this. The deep learning model takes an image as input and outputs 3d model as an obj or fbx file. This can be used in any 3d editing software. Take a look at DECA on github.