Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

How is this built? What'd be the approach if I'd like to achieve similar results against proprietary data.

References article speak of RAG and RIG - but I wonder if they factor into fine-tuning the models. AFAIK, RAG doesn't play nicely with structured data.




I haven't read it in great detail, but it looks like there's documentation for self-hosting[1] (on Google Cloud).

[1] https://docs.datacommons.org/custom_dc/




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: