It seems to be small language model (2.2 gigabytes in memory) and it runs on GPU (at least on M1 Ultra) predictions are pretty bad compared to LLAMA 2/3 or GPT 3.5.
Here some short video of some minimal swift project with single struct that i asked to free memory of. (It sadly failed I tested in on my bigger projects with a lot of Metal and HPC and it never make usable code completion for my comments. On the other hand it made some usable (?) documentation for my methods inside some structs)
Need Text
Play
Current Time 0:00
/
Duration Time 0:00
Remaining Time -0:00
Stream TypeLIVE
Loaded: 0%
Progress: 0%
0:00
Fullscreen
00:00
Mute
Playback Rate
1x
- 2x
- 1.5x
- 1.25x
- 1x
- 0.75x
- 0.5x
Subtitles
- subtitles off
Captions
- captions off
Chapters
- Chapters
Anyone got some good results with it (some tricks maybe :?)