Goliath is very big model, need 2 nvdidia X090 for run smoothly!
If you have 8, 10 or 12 G VRAM on your Graphic Card:
Context = 4096 and quantization : 8G use Q4_K_M, 10 and 12G use Q5_K_M
Try this model :
TheBloke/MythoMax-L2-Kimiko-v2-13B-GGUF
TheBloke/LLaMA2-13B-Psyfighter2-GGUF
TheBloke/LLaMA2-13B-Tiefighter-GGUF
If you have 16, 20 or 24 G VRAM :
Context = 8192 and quantization use Q5_K_M
Try this model is little better for long RolePlay : TheBloke/MLewd-ReMM-L2-Chat-20B-GGUF