javascript - regular expression to match hashtags in both left to right and right to left languages - Stack Overflow

admin•2025-04-21 22:01:08•questions•阅读2

I use the following code to find words that start with hashtags:var regex = (?:^|W)#(w+)(?!w)g;bu

I use the following code to find words that start with hashtags:

var regex = /(?:^|\W)#(\w+)(?!\w)/g;

but it only matches the English words and it can not match hashtags in other languages such as arabic. so, how can I find hashtags in a text like this:

this is a simple #text
هذا #نص بسیط

I use the following code to find words that start with hashtags:

var regex = /(?:^|\W)#(\w+)(?!\w)/g;

but it only matches the English words and it can not match hashtags in other languages such as arabic. so, how can I find hashtags in a text like this:

this is a simple #text
هذا #نص بسیط

Share Improve this question asked Oct 10, 2020 at 9:37 user6931342 1553 silver badges13 bronze badges

Add a ment |

3 Answers 3

Sorted by: Reset to default 5

If the value after the # should not contain a # itself, you could use a negated character class [^\s#] matching any character except # either way around using an alternation |

The value is in capture group 1.

(?:^|\s)(#[^\s#]+|[^\s#]+#)(?=$|\s)

Regex demo

const pattern = /(?:^|\s)(#[^\s#]+|[^\s#]+#)(?=$|\s)/;
[
  "this is a simple #test1",
  "هذا #نص بسیط",
  "test #test2#",
  "test #test3#test3",
  "test ##test4",
  "test test5##",
].forEach(s => {
  const m = s.match(pattern);
  if (m) console.log(m[1]);
});

You may use the following regex alternation:

(?<!\S)#\S+|\S+#(?!\S)

Demo

Bearing in mind that a Unicode aware \w can be represented with [\p{Alphabetic}\p{Mark}\p{Decimal_Number}\p{Connector_Punctuation}\p{Join_Control}] (see What's the correct regex range for javascript's regexes to match all the non word characters in any script?), the direct Unicode equivalent of your pattern is

const uw = String.raw`[\p{Alphabetic}\p{Mark}\p{Decimal_Number}\p{Connector_Punctuation}\p{Join_Control}]`; // uw = Unicode \w
const regex = new RegExp(`(?<!${uw})#(${uw}+)(?!${uw})`, "gu");

Now, to match both directions, you may use

const regex = new RegExp(`(?<!${uw})(?:#(${uw}+)|${uw}+#)(?!${uw})`, "gu");
                                  ^_________^_______^

That is, a non-capturing group with an alternation | char is used with two alernatives, that match # + Unicode word chars on the right, or Unicode word chars and then a # on the right. Details:

(?<!${uw}) - a negative lookbehind that fails the match if there is a Unicode word char immediately on the left
(?:#(${uw}+)|${uw}+#) - a non-capturing group that matches either
- #(${uw}+) - a # char followed with one or more Unicode word chars
- | - or
- ${uw}+# - one or more Unicode word chars followed with a # char
(?!${uw}) - a negative lookahead that fails the match if there is a Unicode word char immediately on the right.

The g flag ensures multiple matches and u enables the Unicode property classes support in the pattern.

A JavaScript demo:

const strings = ["this is a simple #text #text2", "هذا #نن*&ص بسیط","#نص2 هذا #نص بسیط"];
const uw = String.raw`[\p{Alphabetic}\p{Mark}\p{Decimal_Number}\p{Connector_Punctuation}\p{Join_Control}]`; // uw = Unicode \w
const regex = new RegExp(`(?<!${uw})(?:#(${uw}+)|${uw}+#)(?!${uw})`, "gu");
strings.forEach( string => console.log(string, '=>', string.match(regex)))

发布者：admin，转转请注明出处：http://www.yc00.com/questions/1745223125a4617341.html

admin

questions
homepage - Is it possible to use a single custom post as the site front page
I know I can set a static page as the homepage but is it possible to set a single custom post as the site front page?I c
admin
33分钟前
10
questions
Javascript Regex for all words not between certain characters - Stack Overflow
I'm trying to return a count of all words NOT between square brackets. So given ..[don't matc
admin
31分钟前
10
questions
javascript - Testing Google One Tap - closed and now getting "suppressed-by-user" message - Stack Overflow
I am adding the Google One Tap api to a React application. I am correctly getting the one tap login mod
admin
30分钟前
10
questions
javascript - JS - Check if all an object's own properties are true - Stack Overflow
I have an object that has several fields that could potentially get shifted to true for a user (think l
admin
29分钟前
10
questions
javascript - How do i access childNode of <label> using document.getElementsByClassName()? - Stack Overflow
<html><head><head><body><span class="mtb-price"><label
admin
27分钟前
10
questions
javascript - How can I change the color of a changed cell in Handsontable? - Stack Overflow
I am using the Handsontable plugin and when the user changes the values in the cell, it should turn yel
admin
25分钟前
00
questions
javascript - Web Direct Print Plug-in - Stack Overflow
We all know that it's impossible to do native print in a browser that bypasses the browser's
admin
23分钟前
00
questions
asp.net - Use Javascript to copy Text from Label - Stack Overflow
Label1 (asp control) is located inside Panel1 of my webpage and I have a button called bt.What is the
admin
21分钟前
10
questions
javascript - Audio continuously playing across all pages? - Stack Overflow
Is this even possible? To have an mp3 play where it left off when you navigate to a different page on t
admin
21分钟前
00
questions
How to make a custom button that redirects to a "user specified link while entering product details" woocommer
Tried this solution but its not workingfunction wc_shop_demo_button() {echo '<a class="button demo_button&
admin
21分钟前
10
questions
hosting - Adding video to a Wordpress website
Closed. This question is off-topic. It is not currently accepting answers.Asking to recommend a product (plugin, theme,
admin
19分钟前
00
questions
php - Using JavascriptjQuery to get Query String in Segment-based URL - Stack Overflow
I'm using a PHP framework Codeigniter that uses segment based urls likeinstead of the usual quer
admin
18分钟前
10
questions
javascript - Clear a specific context of a canvas - Stack Overflow
I'm working on a project with HTML canvas and I'm in trouble with a few things.I want to be a
admin
15分钟前
10
questions
How to update a custom field in all posts with the value of another custom field in the same post?
I try to get the value of the source field 'enddate', save it in the variable $enddatevar and write it in the
admin
11分钟前
00
questions
Force iOS to download image from HTML5 Canvas (using pure javascript) - Stack Overflow
This question has been asked before and the general response is that it can't be done on iOS. Howe
admin
9分钟前
00
questions
javascript - html2canvas resetting letter spacing - Stack Overflow
$('#cypher-branding-letter-spacing').change(function(e) {$('#cypher-branding-main-edit-ri
admin
8分钟前
00
questions
Dynamic sidebar rendered in another place than i would like
I'm trying include dynamic sidebar to Uncode Theme. I created child template, then I created all files and now I ha
admin
7分钟前
00
questions
Best practice: JavascriptJquery saving variable for later use - Stack Overflow
I'm sure, this question has been answered somewhere before but I just couldn't find it.If wit
admin
6分钟前
00
questions
javascript - Cannot use 'in' operator to search for 'model' - Stack Overflow
There is many questions from the same object. But mine is little different. The difference is that in m
admin
5分钟前
00
questions
javascript - Rails4 - why hidden.bs.modal is not firing? - Stack Overflow
I have bootstrap 3.3.1 in my gemfile. Did bundle install.I have the following in my view<div class=&
admin
11秒前
00

发表回复

评论列表（0条）

暂无评论

javascript - regular expression to match hashtags in both left to right and right to left languages - Stack Overflow

3 Answers 3

Demo

发表回复

评论列表（0条）

联系我们

400-800-8888

javascript - regular expression to match hashtags in both left to right and right to left languages - Stack Overflow

3 Answers 3

Demo

相关推荐

发表回复

评论列表（0条）

联系我们

400-800-8888