Automod for #294 #478

ghost · 2020-06-27T23:27:58Z

Started adding automod functionality much like dyno. #294 (comment)

Things that still need to be added:

~~Anti fast messages~~ (may need tweaking)
Auto mute after 3 infractions
Create a banned word list (and decide where it goes)
Add logging

This still needs some optimisation but I think the basis is there.

Bottersnike · 2020-07-09T20:10:18Z

cdbot/cogs/admin.py

+    @Cog.listener()
+    async def on_message(self, message):
+        def check(m):
+            return m.author == message.author and (datetime.datetime.utcnow() - m.created_at).seconds < 4


This will make 1000 calls to datetime.utcnow() for every message sent. This is a bad thing.

The time, 4 seconds, should probably also be configurable

Bottersnike · 2020-07-09T20:10:47Z

cdbot/cogs/admin.py

+
+        if not message.author.bot:
+            # Checks if message contains banned word.
+            if any([word in message.content for word in BANNED_WORDS]):


This will block snigger. Consider using a regex of \Wword\W.

Surrounding the generator in [] is redundant, and is marginally slower than just leaving it as a generator; it defeats any's early-exit handler. Likewise in other places any([...]) has been used.

Bottersnike · 2020-07-09T20:11:05Z

cdbot/cogs/admin.py

+                await message.delete()
+                await message.channel.send(f"{message.author.mention} Watch your language!", delete_after=5)
+            # Checks if message contains banned domain.
+            if any([word in message.content for word in BANNED_DOMAINS]):


elif should be used, not if. We may have already deleted this message. Likewise, there's the same issue of matching half-matches.

Bottersnike · 2020-07-09T20:14:42Z

cdbot/cogs/admin.py

+                await message.delete()
+                await message.channel.send(f"{message.author.mention} Don't spam mentions!", delete_after=5)
+            # Checks if message was sent too quickly.
+            if len(list(filter(lambda m: check(m), self.client.cached_messages))) >= 4:


self.client.cached_messages is an array of 1000 messages. Ignoring the fact that the lambda is redundant, this will call check 1000 times for every message sent to the bot. Rather than checking every message retroactively, it would be better to track timestamps of sent messages as a dictionary of dequeues, each limited to 5 items, indexed by user ID. Each dequeue would contain the timestamp of messages as the bot receives them. Checking for spam is then a case of pulling up the corresponding dequeue for a given user, and checking if the oldest item in the dequeue is younger than n seconds. At this moment, spamming is more likely to DoS the bot than be deleted.

Bottersnike · 2020-07-09T20:18:44Z

cdbot/cogs/admin.py

+                await message.channel.send(f"{message.author.mention} Don't spam mentions!", delete_after=5)
+            # Checks if message was sent too quickly.
+            if len(list(filter(lambda m: check(m), self.client.cached_messages))) >= 4:
+                await message.channel.purge(limit=4, check=who)


The magic number 4 has just been used twice here. For starters, it should be defined in a config somewhere. Secondly, because you're checking if it's >= 4, rather than == 4, we need to handle the case where it's > 4 as well. Additionally, messages will come into the bot faster than the deletion event for the purge. This means that if I spam 5 messages, the first 4 will be caught by the check, and deleted, but then messages 2 though 5 will also match the check, as the deletion response for 1 though 4 has not yet arrived. This causes the bot to double-warn and attempt to delete messages 2 though 4 twice.

Furthermore, limit is not the number of deleted messages. As the docs say:

limit (Optional[int]) – The number of messages to search through. This is not the number of messages that will be deleted, though it can be.

This means this filter can be trivially bypassed by having two or more members spam in unison. The bot will attempt to delete the most recent 4 messages, however if the check fails for all 4, it will delete none and will not continue searching.

Bottersnike · 2020-07-09T20:19:49Z

cdbot/cogs/admin.py

+            # Checks if message contains mass mention.
+            if len(message.mentions) > 8:
+                await message.delete()
+                await message.channel.send(f"{message.author.mention} Don't spam mentions!", delete_after=5)


At this point, I'm getting a sense of DRY. It may be better to encapsulate the checks, then have a single call to message.delete() and to message.channel.send() after the encapsulated checks.

Bottersnike · 2020-07-09T20:20:28Z

cdbot/cogs/admin.py

+        def check(m):
+            return m.author == message.author and (datetime.datetime.utcnow() - m.created_at).seconds < 4
+
+        def who(m):


This isn't really a very suitable function name. is_original_author or just is_author would be a far more descriptive name.

Bottersnike · 2020-07-09T20:20:59Z

cdbot/cogs/admin.py

@@ -89,6 +92,32 @@ async def on_member_join(self, member: Member):
            # assign placeholder nickname
            await member.edit(nick=PLACEHOLDER_NICKNAME)

+    @Cog.listener()
+    async def on_message(self, message):
+        def check(m):


Check? Check for what? Give this function a name that tells me what it actually is; it's performing a very specific task, so name it like it is.

Docstrings, typehints, etc.

Bottersnike · 2020-07-09T20:21:46Z

cdbot/cogs/admin.py

+            # Checks if message contains banned word.
+            if any([word in message.content for word in BANNED_WORDS]):
+                await message.delete()
+                await message.channel.send(f"{message.author.mention} Watch your language!", delete_after=5)


delete_after=5 is a bit of a magic number here and is going to be a pain to replace all occurrences off if it needs changed.

Bottersnike · 2020-07-09T20:27:11Z

cdbot/constants.py

@@ -143,6 +144,7 @@ class Exchange:
 }

 # Admin Constants
+BANNED_WORDS = []


There's already a banned words constant defined here:

cyberdisc-bot/cdbot/constants.py

Lines 149 to 153 in dc4f20a

NICKNAME_PATTERNS = [

r"(discord\.gg/)", # invite links

r"(nigg|ligma|fag|nazi|hitler|\bpaki\b)", # banned words

r"(http(s)?:\/\/.)?(www\.)?[-a-zA-Z0-9@:%._\+~#=]{2,256}\.[a-z]{2,6}\b([-a-zA-Z0-9@:%_\+.~#?&//=]*)", # hyperlinks

]

It would probably be better to pull this out into its own constant and use it, rather than having two copies of banned words.

thebeanogamer · 2020-07-09T21:58:32Z

I'd like to repeat most of @Bottersnike's points but from an infra perspective, and point out how hilariously inefficient this will be from a computational perspective, how slow it will be, and how much it will murder the instance the bot runs on (which until now has been fine as a 1 core 512mb container)..

ghost · 2020-07-09T22:07:59Z

Thanks for the suggestions @Bottersnike. I will implement them soon.

lightspeedana · 2020-07-09T22:11:51Z

Taking a brief look through @H4ckerJ4cker I understand you've tried hard to make this solution, but I suggest scrapping a lot of it and leaving it up to someone else to try for now. I'd like to think this works as a fairly heuristic approach, but it is not efficient in many ways and this is less of a side project and more of an actual job that needs to be done with no room for error. I'm happy to ensure someone in the mod team takes this role on and fulfils it, and I'm sure you'll be able to learn well from the final solution which will include more efficient solutions to the problems you've tried to solve.

Thanks so much for the help, and we'll be taking some of these ideas forward, but I think from now it's best that comdev/sudo/root take this on 😄

ghost · 2020-07-09T22:13:40Z

@lightspeedana this was never meant to be even thought about getting pushed yet. It is a draft.

HackerJacker added 4 commits June 28, 2020 00:26

Automod for #294

07a928e

Added anti fast messages

0aacbe7

This still needs some optimisation but I think the basis is there.

fixed lint issues

e91eb8e

Fixed a few grammar issues

dc4f20a

github-actions bot added admin Changes to the admin cog module Changes to the bot module labels Jun 28, 2020

Bottersnike suggested changes Jul 9, 2020

View reviewed changes

Bottersnike reviewed Jul 9, 2020

View reviewed changes

lightspeedana closed this Jul 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automod for #294 #478

Automod for #294 #478

ghost commented Jun 27, 2020 •

edited by ghost

Loading

Bottersnike Jul 9, 2020

Bottersnike Jul 9, 2020

Bottersnike Jul 9, 2020

Bottersnike Jul 9, 2020

Bottersnike Jul 9, 2020

Bottersnike Jul 9, 2020

Bottersnike Jul 9, 2020

Bottersnike Jul 9, 2020 •

edited

Loading

Bottersnike Jul 9, 2020

Bottersnike Jul 9, 2020

thebeanogamer Jul 9, 2020

Bottersnike Jul 9, 2020

Bottersnike Jul 9, 2020 •

edited

Loading

thebeanogamer commented Jul 9, 2020

ghost commented Jul 9, 2020

lightspeedana commented Jul 9, 2020

ghost commented Jul 9, 2020

	NICKNAME_PATTERNS = [
	r"(discord\.gg/)", # invite links
	r"(nigg\|ligma\|fag\|nazi\|hitler\|\bpaki\b)", # banned words
	r"(http(s)?:\/\/.)?(www\.)?[-a-zA-Z0-9@:%._\+~#=]{2,256}\.[a-z]{2,6}\b([-a-zA-Z0-9@:%_\+.~#?&//=]*)", # hyperlinks
	]

Automod for #294 #478

Automod for #294 #478

Conversation

ghost commented Jun 27, 2020 • edited by ghost Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Bottersnike Jul 9, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Bottersnike Jul 9, 2020 • edited Loading

Choose a reason for hiding this comment

thebeanogamer commented Jul 9, 2020

ghost commented Jul 9, 2020

lightspeedana commented Jul 9, 2020

ghost commented Jul 9, 2020

ghost commented Jun 27, 2020 •

edited by ghost

Loading

Bottersnike Jul 9, 2020 •

edited

Loading

Bottersnike Jul 9, 2020 •

edited

Loading