Nous hermes 13b reddit I occasionally use Nous-Hermes-13b or Manticore-13b-chat-pyg. It seems perhaps the qlora claims of being within ~1% or so of full fine tune aren't quite proving out, or I've done something horribly wrong. Nothing works. This model was fine-tuned by Nous Research, with Teknium and Karan4D leading the fine tuning process and dataset curation, Redmond use the following search parameters to narrow your results: subreddit:subreddit find submissions in "subreddit" author:username find submissions by "username" site:example. The replies aren't as long as Poe's, but they're well written, in character, and with little to no repetition, although I sometimes I've got a feeling I wouldn't notice the censorship so it's worth checking this one out I suppose. 1% of Hermes Nous- Hermes & Puffin (13b) having opposite opinions I was testing some models with random questions I had to see differences, and I've found a curious difference: When you as how you I'm finding it to be as good or better than Vicuna/Wizard Vicuna/Wizard-uncensored models in almost every case. My usual prompt goes like this: <Description of what I want to happen>. 1 model. ) available to compare side by side. And many of these are 13B models that should work well with lower VRAM count GPUs! I recommend trying to load with Exllama (HF if possible). Gaming. This model (13B version) works better for me than Nous-Hermes-Llama2-GPTQ, which can handle the long prompts of a complex card (mongirl, 2851 tokens with all example chats) in 4 out of 5 try. Greetings everyone, We have some great news for all our Role Playing enthusiasts. Welcome to reddit's home for discussion of the Canon EF, EF-S, EF-M, I have your same setup (64+12) but I'd rather stay with 13B using the vram as much as possible. [ Nous-Hermes-13b is a state-of-the-art language model fine-tuned on over 300,000 instructions. Seems like Llama 2 is better overall but it really depends on what you want and what you can run. Nous Hermes L2 13B-4 bits, has me really surprised, been using it for many days and now is my clear favorite. Having a 20B that's faster than the 70Bs and better than the 13Bs would be very welcome. 1 (for airoboros 7b and 13b). It sort of managed to solve my logic puzzle that stumbles other LLMs ( even GPT4 ). Chronos-Hermes-13B-v2: More storytelling than chatting, sometimes speech inside actions, not as smart as Nous-Hermes-Llama2, didn't follow instructions that well. I like Nous-Hermes-Llama2-13B, but after so long it starts outputting sentences which lack prepositions. We are Reddit's primary hub for all things modding, I have tried many, my favorite 13b model is the nous-hermes-llama2-13b. Sometimes even common, short verb conjugations go missing (am, are, etc. ) My entire list at: Local LLM Comparison Repo GPT4All seems to do a great job at running models like Nous-Hermes-13b and I'd love to try SillyTavern's prompt controls aimed at that local model. I've searched on here for info but I can't figure it out. Interestingly, both Pygmalion 13b and Mythomax 13b can't solve the puzzle by themselves but merge between them can. Looking forward to seeing how the big brother does. Go figure. com find I installed Nous-Hermes-13B-GGML & WizardLM-30B-GGML using the instructions in this reddit post. If anyone figures out a way to get it to stop talking or acting on behalf of my character, that'd be a plus. /r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and exclude blind users from the site. I find the former to be quite decent but sometimes I notice that it traps itself in a loop by repeating the same scene all over again, while the latter seems to be more prone with messing up details. But if I ask the same to Nous Hermes 13b superHOT 8k it gives me "ethical" advice or just refuses to do it. It feels like Chronos Hermes 13B is more of a novel writer than a chat writer. It provides a good balance between speed and instruction following. 2b, Nous-Hermes-Llama2-70B 13B: Mythalion-13B But MXLewd-L2-20B is fascinating me a lot despite the technical issues I'm having with it. My current task list may not be suitable for comparing it against other models. This is a follow-up to my previous posts here: New Model RP Comparison/Test (7 models tested) and Big Model Comparison/Test (13 models tested) Originally planned as a single test of 20+ models, I'm splitting it up in two segments to keep the post managable in size: First the smaller models (13B + 34B), then the bigger ones (70B + 180B). Even when my character card is totally OK with something like that. I can only has same success with chronos-hermes-13B-GPTQ_64g. I've made a playground with a bunch of the top 13B models (OpenOrca, Airoboros, Nous-Hermes, Vicnua etc. They aren't explicitly trained on NSFW content, so if you want that, it needs to be in the foundational model. Reply reply I've noticed that MythoMax-L2-13B needs more guidance to use actions/emotes than e. For those of you haven't tried it, do -- its worth it. Nous Hermes 13b is very good. Token issue with Nous-Hermes-Llama2-13b Question I'm using this model for privateGPT but when it generate prompts it keeps saying there's a 512 token limit with the model, but if I look at it's huggingface repo it says it's 4096 what can I do about this? Get the Reddit app Scan this QR code to download the app now. Mythomax and Nous-Hermes-2-SOLAR showed perplexing responses sometimes, Just a few days ago I started my adventure with LLM, but I really enjoy the TheBloke/HornyEchidna-13B-v0. Nous is very hit a miss with their datasets at times. q5_K_M version of Nous Hermes 13b because I was curious if the lower perplexity would make a difference: /r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and exclude blind users from the Interesting results, thanks for sharing! I used qlora for 1. Developers now have a versatile tool at their disposal, primed for crafting a myriad of ingenious automations. 7b capybara was solid AF. Some newer 13B models seem to be better than older 30Bs. /r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and exclude blind users from the site Out of all the models I've been trying so far in ST, I've been having the best results so far with Chronos Hermes 13B. But nicely descriptive! Hermes-LLongMA-2-13B-8Ke: Doesn't seem as eloquent or smart as regular Hermes, did less emoting, got confused, wrote what User does, showed misspellings. g. It maybe helps it's prose a little, but it gives the base model a serious downgrade in IQ that isn't worth the squeeze. It's EDIT, I meant NOUS hermes, not chronos, these all blend together. Different models require slightly different prompts, like replacing "narrate" with "rewrite". Unfortunately, while this model does write quite well, it still only takes me about 20 or so messages before it starts showing the same "catch phrase" behavior as the dozen or so other LLaMA 2 models I've tried. It replaced my previous favorites that I just tried doing a scene using nous-hermes-2-solar-10. dolphin, airoboros and nous-hermes have no explicit censorship — airoboros is currently the best 70b Llama 2 model, as other ones are still in training. It doesn't get talked about very much in this subreddit so I wanted to bring some more attention to Nous Hermes. This is version 2 of Nous Research's line of Hermes models, and Nous Hermes 2 builds on the Open Hermes 2. It's quality, diversity and scale is unmatched in the current OS LM landscape. Though most of the time, the first response is good enough. 7b and found that it quickly devolved into the bot endlessly repeating itself regardless of settings. Or check it out in the app stores     TOPICS. 1, Synthia-70B-v1. This model was fine-tuned by Nous Research, with Teknium and Karan4D leading the fine tuning process and dataset curation, Redmond AI sponsoring the compute, and several other contributors. Valheim; Genshin Impact; Minecraft; Pokimane; Halo Infinite; Call of Duty: Warzone; It's a merge between a custom model and the Nous Hermes 13b. ). 0 - Nous-Hermes-13B - Selfee-13B-GPTQ (This one is interesting, it will revise its own response. I tried various loaders like exllama and the others in the dropdown that I recognized the name of. As for Chronos, it seems that it's designed for chat, roleplay, and storywriting. This distinctive addition transforms Nous-Hermes-2-Vision into a Vision-Language Action Model. Join us as we delve into the intricacies of Hermes 13B, exploring its technical specifications, training data insights, practical applications and API setup. Solar hermes is generally the worst mainstream solar finetune I know of. 59 votes, 60 comments. Thanks to our most esteemed model trainer, Mr TheBloke, we now have versions of Manticore, Nous Hermes (!!), WizardLM and so on, all with SuperHOT 8k context LoRA. I'm not even sure how they managed to make it that dumb. It also reaches within 0. Not necessarily all 30B fine-tunes are better than every 13B fine-tune. Releasing Hermes-LLongMA-2 8k, a series of Llama-2 models, trained at 8k context length using linear positional interpolation View community ranking In the Top 1% of largest communities on Reddit. Model Card: Nous-Hermes-13b Model Description Nous-Hermes-13b is a state-of-the-art language model fine-tuned on over 300,000 instructions. Thanks for training/sharing this @NousResearch. I double-checked to make sure my context/instruct settings were right, textgen settings too, and yet despite everything being ok I could barely get a few posts into the roleplay before things began to nosedive into uselessness. 70B: Xwin-LM-70B-V0. Nous-Hermes-Llama2. It tops most of the 13b models in most I just uploaded the Puffin benchmarks and I can confirm Puffin beats Hermes-2 for the #1 spot in even popular single-turn benchmarks like Arc-E, Winogrande, Hellaswag, and ties Hermes-2 in PIQA. I even tried forcing outputs to start a certain way, but it's still too "clean" to have any fun with. More info: . But it takes a longer time to arrive at a final response. We are now offering you the opportunity to test the Nous-Hermes-Llama2-13b model, which has been finely tuned to elevate your Role Playing experience. When I ask Nous Hermes 13b to write a violent sexual scene it does it without complaining. Every single model I load has an out of memory error; I've done 4bit quant 30b/33b models and 13b models. My favorite so far is Nous Hermes LLama 2 13B*. 13B is able to more deeply understand your 24Kb+ (8K tokens) prompt file of corpus/FAQ/whatever compared to the 7B model 8K release, and it is phenomenal at answering questions on the material you provide it. The Hermes 2 model was trained on 900,000 instructions, and surpasses all previous versions of Hermes 13B and below, and matches 70B on some benchmarks!Hermes 2 changes the game with strong multiturn chat skills, system prompt capabilities, and uses ChatML format. 2, full fine-tune with 1. Narrate this using active narration and descriptive visuals. Personally I've been enjoying OpenOrca a lot. What's more exciting is that we've expanded the token limit up to a whopping 3500, instead of the standard 1800. I have been testing out the new generation of models (airoboros 70B, nous hermes llama2, chronos hermes) So far, the models I've tried out are reluctant to use explicit language, no matter what characters I use them with. 5 dataset, surpassing all Open Hermes and Nous Hermes models of the past, trained over Yi 34B with others to come! We Hermes 2 is trained on purely single turn instruction examples. The main limitiation on being able to run a model in a GPU seems to be its It performs not bad but worse than Nous-Hermes-13B. Until the 8K Hermes is released, I think this is the best it gets for an instant, no-fine-tuning chatbot. Just having a greeting message isn't enough to get it to copy the style, ideally your character card should include examples and your own first message should also look like what you want to get back. I've tested Mythalion 13b, seems like a good replacement for Nous Hermes 2 13b ( my normal go to model ). Let’s uncover the answers to these questions and more. Custom Dataset Enriched with Function Calling: Our model's training data includes a unique feature – function calling. Puffin (Nous other model that released in the last 72 hrs) is trained mostly on multi-turn, long context, highly curated and cleaned GPT-4 conversations with real humans, In my experiences with 13b editions of Hermes and Puffin, Puffin was basically useless to me and never generated good output, while Hermes-Llama2-13B is the best overall 13b model I have used. Nous- Hermes & Puffin (13b) having When you as how you should defrost a frozen meal (in a glass container), they both prefer different approaches: Hermes --> cold water, slow defrost: Less bacteria growth My top three are (Note: my rig can only run 13B/7B): - wizardLM-13B-1. wiitmm pieotbo llffms dhnrzcs uep jgw xywxgt wmjtwu wyz vtn