Gemma3 4B can answer questions about 80% of the time given access to a ZIM file of Wikipedia.
Unfortunately it still takes 20 seconds to run on a CPU, so I can't think of many practical uses at the moment until we get cheap low power AI accelerators that are a bit easier to develop for....
Unfortunately it still takes 20 seconds to run on a CPU, so I can't think of many practical uses at the moment until we get cheap low power AI accelerators that are a bit easier to develop for....