Companies routinely use tools that collect data about them and their customer data (slack, teams, etc). Should they also remove those tools and build everything in house ?
It is easy to opt out from openai data collection when using chatgpt and it is by default when using the APIs.
Now if you are really worried about data privacy, then I don't see how LangChain is a solution ? LangChain is just an orchestrator that connects different LLM providers (openai included, cohere, anthropic etc) with other tools such as vector store for example. LangChain is not built for data privacy per se. Using openai, cohere or anthropic through LangChain leads to the same data privacy issues.
So if data privacy is really the issue, the solution is using open source models in house not LangChain !