Explore chapters and articles related to this topic
Natural Language Processing
Published in Vishal Jain, Akash Tayal, Jaspreet Singh, Arun Solanki, Cognitive Computing Systems, 2021
V. Vishnuprabha, Lino Murali, Daleesha M. Viswanathan
Rule-based approaches are very efficient for rule-based tasks. Natural language has a specific rule for syntax and semantics, and hence, those tasks are rule based. Therefore, the analysis of syntax and semantics can be quickly done using rule-based approaches. The main disadvantage of a rule-based system is that the efficiency of the system depends on the programmer who creates rules. If he is efficient, the system will also be efficient. A skilled programmer can build the system within a limited period. Rule-based methods used in computers are regular expressions and FA. Regular expressions are very fast and easy to use. Most of the preprocessing and morphological tasks are done using regular expressions. One of the everyday NLP tasks called string matching can be done quickly using regular expressions.
Ideation
Published in Walter R. Paczkowski, Deep Data Analytics for New Product Development, 2020
Tokenization is done with a text manipulation software language called regular expressions. Regular expressions are powerful, widely used, but yet arcane and difficult to understand and interpret, not to mention difficult to write. It consists of defining a pattern using metacharacters (e.g., the asterisk, question mark, and period or dot are three metacharacters) and regular characters (e.g., letters of the alphabet and the digits 0–9) which are meant to match corresponding symbols in a character string. Each metacharacter is a special pattern matching instruction. The question mark metacharacter, for example, instructs the software to match zero or one occurrence of a preceding character in the pattern. The character string could contain letters, symbols, and digits. White spaces are also possible and it is these white spaces that break a string into tokens. For example, the classic string “The quick brown fox” has three separating white spaces and would be decomposed into four tokens. Regular expression capabilities are found in many programming languages such as Python, Pearl, and R to mention a few. See Watt [2005] and Frield [2002] for excellent introductions to regular expressions.
Speaking Naturally: Text and Natural Language Processing
Published in Jesús Rogel-Salazar, Advanced Data Science and Analytics with Python, 2020
In Python, the re module enables us to use regular expressions. When working with Python strings which will be parsed with regular expressions, it is recommended to use raw strings. This is because in raw strings, backslashes have no special meaning as an escape character. Anyway, we mentioned above that a simple search and replace function can be used to substitute a desired pattern, let us take a look at implementing this use case with re. A raw string in Python is preceded by an r as such r ‘This is a raw string.’
Crashworthiness analysis of circumferential sinusoidal hierarchical tubes (CSHTs): Experiment, simulation and theoretical prediction
Published in Mechanics of Advanced Materials and Structures, 2022
Zi Yang, Shuguang Yao, Chengxing Yang
The geometrical parameters of two layers of sinusoidal tubes are the same. Double layers of sinusoidal tubes intersect at an angle when the center coincides. First of all, the first layer of sinusoidal tube is axially rotated degrees about the center point to obtain the second layer of sinusoidal tube. The third layer of tube is a regular polygon with 2 N sides. The radius of the outer circle of the regular polygon is R0, which is the basic nominal radius of the sinusoidal curve. CSHT is obtained by superimposing the center of three layers as shown in Figure 4. CSHT is intersected by two layers of sinusoidal curves, forming 2 N hollow holes in the periphery. As an intermediate layer, the regular polygon is sandwiched between two sinusoidal curves. The vertex of the regular polygon is just the intersection point of two sinusoidal curves.
Modelling virus contact mechanics under atomic force imaging conditions
Published in Applicable Analysis, 2022
Paolo Piersanti, Kristen White, Bogdan Dragnea, Roger Temam
Beyond the shape does not change anymore. At this point, a continuum description will be adequate. The dotted line circle in Figure 5 corresponds to the circle that circumscribes the initial, regular polyhedron, added to the polygonal shape in a way that superposes the apex point of the circle and the polygon and preserves the left-right symmetry. Below the continuum threshold, there are some variations of the polygonal shape with respect to the circumscribing circle, but they are rather minor in the vicinity of the apex point. However, the apex point height varies significantly below the continuum threshold. The practical outcome of this observation is that the AFM measurement of the top part of the virus, which is very accurate, could be used in fitting it at every map point with a spherical shell. It is not unreasonable to hypothesize that, from the radius and the center of curvature for that shell one could deduce the elastic constants of the virus particle at an unprecedented level of accuracy.
High school prospective teachers’ problem-solving reasoning that involves the coordinated use of digital technologies
Published in International Journal of Mathematical Education in Science and Technology, 2019
Manuel Santos-Trigo, Isaid Reyes-Martínez
They reported three findings based the exploration of the extended model: When the position of the interior point coincides with the centre of polygon, then the generated polygonal region is a regular polygon with the same number of sides.When the number of sides of the initial regular polygon increases (Figure 14 shows a polygon with 200 sides), then the intersection of the corresponding perpendicular bisectors (red points) seems to form an ellipse and when point G is outside of the circle the intersection points generate a hyperbola.When the number of side of the regular polygon increases and involves more than 200 sides, it is observed that the generated polygon becomes a circle. Figure 15 shows a circle with centre at point A, D is any point on the circle, f is the perpendicular bisector of segment DG that intersects line AD at point E. Then the locus of point E when point D is moved along the circle is an ellipse. This is true because segment ED and GE are congruent (E is on the perpendicular bisector) the radius AD is constant. Then it holds that d(A, E) + d(E, G) is always constant (definition of an ellipse) (Assessment dimension).