Submitted by davidmezzetti t3_11bk12r in MachineLearning

​

https://i.redd.it/4yli3zalvbka1.gif

txtchat is a framework for building conversational search and workflows. txtchat is open source under Apache 2.0 license and available on GitHub.

GitHub | Article

A set of intelligent agents are available to integrate with messaging platforms. These agents or personas are associated with an automated account and respond to messages with AI-powered responses. Workflows can use large language models (LLMs), small models or both.

https://preview.redd.it/uhypbdu0wbka1.png?width=1301&format=png&auto=webp&v=enabled&s=ccdfc7063cfc5a26086c869a7f539fa82c6fea73

A persona is a combination of a chat agent and workflow that determines the type of responses. Each agent is tied to an account in the messaging platform. Persona workflows are messaging-platform agnostic.

Examples

The following is a list of YouTube videos that shows how txtchat works. These videos run a series of queries with the Wikitalk persona. Wikitalk is a combination of a Wikipedia embeddings index and a LLM prompt to answer questions.

Every answer shows an associated reference with where the data came from. Wikitalk will say "I don't have data on that" when it doesn't have an answer.

History

Conversation with Wikitalk about history.

https://www.youtube.com/watch?v=ROyess8dLoA

Sports

Talk about sports.

https://youtube.com/watch?v=LXRB-iruKSc

Culture

Arts and culture questions.

https://www.youtube.com/watch?v=OkObkNhJIgk

Science

Let's quiz Wikitalk on science.

https://youtube.com/watch?v=-rsYDsZc9Wo

Summary

Not all workflows need a LLM. There are plenty of great small models available to perform a specific task. The Summary persona simply reads the input URL and summarizes the text.

https://youtube.com/watch?v=PBJm9aDqkn0

Mr. French

Like the summary persona, Mr. French is a simple persona that translates input text to French.

https://youtube.com/watch?v=4x8pOIm4rbo

10

Comments

You must log in or register to comment.

visarga t1_j9y7624 wrote

Does it do only one round of retrieval?

3

davidmezzetti OP t1_j9y7tmq wrote

With the current version, yes it runs an embeddings query for each message. I plan to handle threaded conversations shortly. In that scenario, the chat history will be provided to the prompt.

1

dancingnightly t1_ja2lfup wrote

Is this current version mostly RAG + WebGPT semantic search to GPT answer, then?

Big fan of your recent work.

2

davidmezzetti OP t1_ja345mn wrote

Thank you.

This application is RAG with a local vector index combined with a LLM from the FLAN-T5 series of models.

The whole solution can be locally hosted with no remote runtime API dependencies.

1

SatoshiNotMe t1_ja2swra wrote

This is very interesting. Ignore the down-voters. Thank you for sharing 🙏

2

davidmezzetti OP t1_ja34bm6 wrote

Thanks, appreciate it. Not much I can do with down votes unless someone provides their rationale, which no one ever does.

1

SatoshiNotMe t1_ja3z3f2 wrote

You may get better discussion on HN. Speaking of which, I have a trove of HN discussion links bookmarked, and these are a goldmine of info. Would something like your approach work for “chatting” with these and getting useful answers?

2