Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You could train that architecture end-to-end though. You just have to run both models and backprop through both of them in training. Sort of like mixture of experts but with two very different experts.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: