Comment by taegee
8 months ago
If you have a bunch of coders, just scrape the data. Then turn your back on this greedy maw.
We recently moved to Mattermost for the same reason. Not looking back.
8 months ago
If you have a bunch of coders, just scrape the data. Then turn your back on this greedy maw.
We recently moved to Mattermost for the same reason. Not looking back.
If you try to use the Slack APIs to scrape the data you will *quickly* run face-first into the insanely restrictive rate limiting they recently enacted to combat their customers using AI tools they aren't providing and able to monetize.
That being said, we were able to get full data exports in the past when we were merging two companies into a single slack instance. YMMV
Slack lets you do a full export which even includes DMs depending on which plan you have.
https://slack.com/help/articles/201658943-Export-your-worksp...
When the org I was at moved away from Slack (due to costs) we used this method and wrote a little Python script to convert the main channels' JSON dumps into PDFs so we had a usable backup of channels.
Please do not include PDF and usable in one sentence. Setting up some simple Postgres with sonic for fuzzy search would be _usable_, but PDF is like migrating from Slack to Teams.
In this case we didn't need a long term solution for searchable data on Slack.
We did the migration in stages, basically this:
In the end, most people never even needed to use the PDFs because they got everything they needed out of Slack before access was removed, but they are there for peace of mind and a last resort.
We also took this as an opportunity to stop using chat as a source of truth for long lived information. Anything that should be stored long term made its way somewhere else (Jira, Confluence, etc.).
5 replies →
PDF is the worse, it has its use-cases but is so painful to use programatically.
1 reply →
I'd bake them into a Sphinx static site. That gives you a free client side search index along with better navigability than sheets of paper. And you can target PDF if you still want it.
Does that break DM’s privacy or does it only let you export your own DM’s?
Well as per the article (and my own experience), the free tier only gives you public channels. The paid tier gives you everything: public/private channels, group chats (called MPIMs), and one-to-one DMs.
So yes, it breaks "privacy" (not that you should expect privacy when using a work Slack account).
Admins can break DM privacy on most company accounts.
IIRC, you have to do something called a "compliance export," which just like any other compliance feature (SSO, HIPAA BAA, audit logs, etc.) usually requires the highest plan. It's designed to add some extra friction so admins can't just add themselves to a DM from the main UI like they could with a channel, but it is possible.
If you think anything you do on your work computer or saas accounts is private I’ve got a bridge to sell you.
why the extra step of making them into PDFs?
We had dozens of channels with almost 10 years of business information in them.
Over time the business gravitated towards putting anything long lived into other sources but since migrating off Slack was essentially a kill switch on our data we wanted to make sure we had ways to access this historic data if needed.
There's no way non-developers were going to parse JSON files for text. We wanted a quick and dirty way to attach the archived PDF file for a channel as a file attachment to the new Teams channel. It gave everyone peace of mind that they could find anything later.
It all worked out in the end and was worth the few hours of dev time to make the 1 off script.
Btw I wasn't the one responsible for making the tech choice to use or leave Slack for Teams. I was the one who was tasked to help with the migration and help make things as streamlined as possible for the business to switch.
One of the biggest pain points was going back to a bunch of Google Drive, Jira, Confluence, etc. sources and finding + updating the links to Slack to be screenshots of the conversation. Another one was converting a bunch of Slack app / webhook integrations over. Teams is absolutely horrendous for this compared to Slack.
Human readable format at rest, I assume.
3 replies →
Mattermost is great, we've used it at a few places and it's very flexible.
Extensibility and integrations with learning management systems, as well as owning all your data, makes it sound like a great option in particular for an education-oriented organization.
And I imaging the AWS or GCP costs for hosting it won't be as high as what Slack wants.
Zulip is awesome too. On prem.
What do you like about Zulip? Any drawbacks that you tolerate? Considering it for on-prem but would love to have some real-world feedback.
Used it for about 2 years at a growing startup. What I liked:
* Threads are way better than in Slack. They're top level instead of just a bolted on afterthought. This means all your various conversations scale way better and are way easier to find than in Slack. I can't overstate this enough, it's a killer feature and genuinely improves the overall organisation of your communications
* Font size is just slightly smaller. My eyesight isn't what it used to be, but I still think they get the balance of legibility and information density spot on, whereas Slack feels cartoony in comparison.
* Search felt a little better, I can't exactly put my finger on why or how, just that finding things in Slack always feels comparable to the iOS Mail search feature: very basic.
Drawbacks:
* There are less out of the box integrations I think.
Yeah, my first thought was Mattermost, it’s pretty straightforward to set up and then your data’s nobody’s hostage.
this is what we're doing :)
Mattermost adheres to the same tactics as Salesforce: group calls in v10 only with paid tiers whereas free before. Have you considered alternatives?
See discussions below in this HN thread.
[0] https://old.reddit.com/r/selfhosted/comments/1fp76f0/matterm...
[1] https://forum.mattermost.com/t/solved-is-there-any-limitatio...
We're selfhosting it! If it runs on our infrastructure, we can't be extorted like this again
3 replies →
Is that using Jitsi or whatever it is? I thought that was third party? I actually set that up for work before realising that we all hate voice/video calls and would just prefer to type. :P